Quality stages
Use the QualityStage stages in DataStage® to investigate, cleanse, and manage your data.
With the QualityStage stages, also known as data quality
stages, you can manipulate your data in the following ways:
- Resolve data conflicts and ambiguities.
- Uncover new or hidden attributes from free-form or loosely controlled source columns.
- Conform data by transforming data types into a standard format.
Note: You must be using DataStage Enterprise Plus to
access the QualityStage stages.
Stage functions
The following table lists the available stages and gives details on their functions:
Stage | Icon | Function |
---|---|---|
Address Verification | Provides comprehensive address parsing, standardization, validation, geocoding, and reverse geocoding. | |
Data rules | Checks data quality anywhere in the flow of a job. | |
Investigate | The character investigation type of Investigate stage analyzes and classifies data, parsing it into a single-pattern report. The word investigation type of Investigate stage uses a set of rules for classifying data such as personal names, business names, and addresses. | |
Match Frequency | Generates the frequency distribution of values for columns in the input data. You use the frequency distribution and the input data in match jobs. | |
One-source Match | Matches records from a single source file. | |
Standardize | Makes source data internally consistent, so each data type has the same kind of content and format. | |
Survive | Groups duplicate records and constructs output records with the best available data for each group. | |
Two-source Match | Compares two sources of input data (reference records and data records) for matches. |