USPS_INT comparison

Compares an interval from a data source to an interval from a reference source for columns that contain address primary number. The results match if the interval in the data source overlaps any part of the interval from a reference source and the odd-even parity agrees. Parity control information such as the USPS ZIP+4 control column is required.

Frequency information is not taken into account when this match comparison is used but a two-source match requires four input streams. If you use this match comparison with a Two-source Match stage job, create two dummy file inputs instead of files that contain frequency information.

Required Columns

Both sources require an address primary low number, an address primary high number, and an address primary odd/even control.

The following data source and reference source columns are required.

  • Data. The beginning of the street address range from the data source.
  • Data. The ending of the street address range from the data source.
  • Reference. The beginning of the street address range from the reference source.
  • Reference. The ending of the street address range from the reference source.
  • Data. The odd/even parity for the data source.
  • Reference. The odd/even parity for the reference source.

The control information from the USPS ZIP + 4 code is:

  • O. The range represents only odd house numbers.
  • E. The range represents only even house numbers.
  • B. The range represents all numbers (both odd and even) in the interval.
  • U. The parity of the range is unknown.

How It Works

Agreement weight is assigned when:

  • The odd/even control is set to E, O, or B on both the data source and the reference source
  • The odd/even control is set to E or O on one file and to B on the other file (such as E on the data source and B on the reference source)

Disagreement weight is assigned when the parity on one source is set to E or O and on the other source is set to the opposite; that is, either the data source to E and the reference source to O or the data source to O and the reference source to E.

If all strings are numeric, the comparison performs an integer interval comparison; otherwise, the comparison performs an alphanumeric interval comparison.