Information icon IBM Information Server, Version 8.1
Feedback

WebSphere DataStage

Data transformation and movement is the process by which source data is selected, converted, and mapped to the format required by targeted systems. The process manipulates data to bring it into compliance with business, domain, and integrity rules and with other data in the target environment.

Transformation can take some of the following forms:

Aggregation
Consolidating or summarizing data values into a single value. Collecting daily sales data to be aggregated to the weekly level is a common example of aggregation.
Basic conversion
Ensuring that data types are correctly converted and mapped from source to target columns.
Cleansing
Resolving inconsistencies and fixing the anomalies in source data.
Derivation
Transforming data from multiple sources by using an algorithm.
Enrichment
Combining data from internal or external sources to provide additional meaning to the data.
Normalizing
Reducing the amount of redundant and potentially duplicated data.
Pivoting
Converting records in an input stream to many records in the appropriate table in the data warehouse or data mart.
Sorting
Sequencing data based on data or string values.

WebSphere® DataStage™ supports the collection, transformation and distribution of large volumes of data, with data structures that range from simple to highly complex. WebSphere DataStage manages data that arrives and data that is received on a periodic or scheduled basis. WebSphere DataStage enables companies to solve large-scale business problems with high-performance processing of massive data volumes.

By leveraging the parallel processing capabilities of multiprocessor hardware platforms, WebSphere DataStage can scale to satisfy the demands of ever-growing data volumes, stringent real-time requirements, and ever-shrinking batch windows.


PDF This topic is also in the IBM Information Server Introduction.

Update icon Last updated: 2008-09-15