Building InfoSphere DataStage and InfoSphere QualityStage jobs
You can use the IBM® InfoSphere® DataStage® and IBM InfoSphere QualityStage® Designer to build jobs that cleanse, transform, and write data to a database table or a comma-separated value (CSV) file. After the data is written, you can import the data into IBM Product Master.
Before you begin
- Identify the product data sources and ensure that you have access to the sources.
- Ensure that you understand the content, structure, and initial quality of the product data sources. You can use the Information Analyzer component of the IBM InfoSphere Information Server product to analyze the data sources.
About this task
Important: Ensure
that output file produces data in the correct pair format (column,
column-value).
Procedure
Open the InfoSphere DataStage Designer and create
a job that runs one or more of the following tasks:
- Compute a frequency distribution for:
- Eliminating duplicate records.
- Matching input records.
- Determining the clerical pairs.
- Validate field values.
- Validate that a manufacturer in the source data is recognized by the product.
- Correct misspellings.
- Specify which part of the hierarchy an item should be placed into based on business rules.
- Create the input file of cleansed master product data to load into the product.