IBM InfoSphere DataStage and QualityStage
IBM® InfoSphere® DataStage® and QualityStage® provides a graphical framework that you use to design and run the jobs that transform and cleanse your data.
Depending on which products you have licensed, you can develop parallel jobs to transform and cleanse data and server jobs to transform data. Parallel and server jobs are run on the IBM InfoSphere Information Server engine. Mainframe jobs produce COBOL code which runs on a mainframe computer.
You design jobs in the IBM InfoSphere DataStage and QualityStage Designer client and run them in the IBM InfoSphere DataStage and QualityStage Director client. Jobs are organized into projects, and you can administer these projects by using the IBM InfoSphere DataStage and QualityStage Administrator client. You can deploy your job designs and job design collateral by using the InfoSphere Information Server Manager.
This document lists the stages that are available in IBM
InfoSphere Information Server, as included with the base installation or with add-on installations.
IBM
InfoSphere
DataStage is a data integration tool for designing, developing, and running jobs that move and transform data.
Use these tutorials to learn the basic skills that you need to develop parallel jobs that transform data and parallel jobs that cleanse data.
You design IBM
InfoSphere
DataStage and QualityStage jobs by using the Designer client. The Designer client is like a workbench or a blank canvas that has a palette that contains the tools that form the basic building blocks of a job: stages, links, and annotations.
You design parallel jobs to transform and to cleanse data. Parallel jobs consist of individual stages. Each stage describes a particular process, this might be accessing a database or transforming data in some way. Parallel jobs brings the power of parallel processing to your data extraction and transformation applications.
Server jobs are compiled and run on the server engine. Such jobs connect to a data source, extract and transform data, and write data to a target database or file, such as a data warehouse.
QualityStage jobs
The cleansing process can include, but is not limited to, eliminating redundant, obsolete, or inaccurate data. Clean data is a critical component for accurate information, reports, and analyses. Throughout your organization, people make business decisions based on data that is provided to them. By cleansing data, you provide high-quality data.
Use the InfoSphere Information Server Manager to move IBM
InfoSphere
DataStage and QualityStage objects between projects on the same engine or on different engines. You can also use the InfoSphere Information Server Manager to move objects from one domain to another.
You can use the workload management queues to control the starting of parallel and server jobs.
You can use the Operations Database and the Operations Console to better monitor the job runs, services, and system resources on several InfoSphere DataStage engines.
IBM
InfoSphere
DataStage and QualityStage jobs are organized in projects, along with associated design items. Different users can be granted access to different projects.
The reference topics provide more in-depth information about IBM
InfoSphere
DataStage and QualityStage. You can use these topics to help you fine tune your jobs and to produce custom components to use in your jobs.
This glossary contains terms and definitions for InfoSphere Information Server.