Managing data sets

The parallel jobs use data sets to store data being operated on in a persistent form. You can create and read data sets using the Data Set stage.

InfoSphere® DataStage® parallel jobs use data sets to store data being operated on in a persistent form. Data sets are operating system files, each referred to by a descriptor file, usually with the suffix .ds.

You can create and read data sets using the Data Set stage, which is described in File set stage. InfoSphere DataStage also provides a utility for managing data sets from outside a job. This utility is available from the InfoSphere DataStage Designer and Director clients.