IBM InfoSphere DataStage is an extract, transform, and load (ETL) solution that is part of IBM InfoSphere Information Server. DataStage moves and transforms large volumes of data at high speeds to feed data warehouses, data lakes, applications, business intelligence reports, cloud repositories, and many other target systems.
In this demo, you use DataStage to complete ETL data processing in a traditional enterprise data warehouse. You will complete the following steps:
- Run a traditional Data Warehouse ETL job
- Move data from Data Warehouse to Hadoop
- Run a Data Warehouse ETL by using Hadoop data
- Run ETL processing inside Hadoop by using YARN