Single Processor and Multi-Processor Systems
The default behavior when compiling IBM® InfoSphere® DataStage® jobs is to run all adjacent active stages in a single process. This makes good sense when you are running the job on a single processor system. When you are running on a multiprocessor system it is better to run each active stage in a separate process so the processes can be distributed among available processors and run in parallel. The enhancements to server jobs at Release 6 of InfoSphere DataStage make it possible for you to stipulate at design time that jobs should be compiled in this way. There are two ways of doing this:
- Explicitly - by inserting InterProcess (IPC) stages between connected active stages.
- Implicitly - by turning on interprocess row buffering either project wide (using the InfoSphere DataStage and QualityStage Administrator) or for individual jobs (in the Job Properties dialog box)
The IPC facility can also be used to produce multiple processes where passive stages are directly connected. This means that an operation reading from one data source and writing to another can be divided into a reading process and a writing process able to take advantage of multiprocessor systems.




