Modify stage in DataStage

The Modify stage alters the record schema of its input data set. The modified data set is then output. You can drop or keep columns from the schema, or change the type of a column.

The Modify stage is a processing stage. It can have a single input link and a single output link.

The stage editor has three tabs:

  • Stage. This is always present and is used to specify general information about the stage.
  • Input. This is where you specify details about the input link.
  • Output. This is where you specify details about the modified data being output from the stage.

Input tab

The Columns section specifies the column definitions of incoming data. The Advanced section allows you to change the default buffering settings for the input link.

Output tab

The Columns section specifies the column definitions of the data. The Maps from column input column section that appears when you click Edit in the columns section allows you to specify the relationship between the columns being input to the Filter stage and the output columns. The Advanced section allows you to change the default buffering settings for the output links.