Output columns

Each rule set contains a list of output columns. Standardized data is added to these columns.

Output columns are defined in the dictionary (previously called the .DCT file).

In the IBM® InfoSphere® QualityStage® Standardization Rules Designer, you can add or modify rules that map input data to output columns. You can also specify leading separators, which separate strings in an output column.

Most rule sets include the types of output columns that are shown in the following table.
Table 1. Types of output columns
Type Description Example columns
Business intelligence Contain the standardized values. These columns are often arranged in an order that is suited to the domain. For example, in the predefined rule set that is applied to US address data, the house number output column is before the street name output column.
  • HouseNumber
  • HouseNumberSuffix
  • StreetPrefixDirectional
Matching Contain information that is used exclusively for matching.
  • StreetNameSOUNDEX
  • MatchFirstNameNYSIIS
  • CityNameRVSNDX
Reporting Contain information that is used to evaluate the standardization results.
  • UnhandledData
  • InputPattern
  • ExceptionData