Creating StreamSets Data Collector flows

Create a StreamSets Data Collector flow to define the movement of data from a source system to target systems and the transformation of the data along the way.

You can use a single source stage to represent the source system, multiple processor stages to transform data, and multiple target stages to represent target systems.

When you develop a flow, you can use development stages to provide sample data and generate errors to test error handling.

You can use executor stages to perform event-triggered task execution or to save event information. To process large volumes of data, you can use multithreaded flows.

For more information about configuring a flow, see Configuring a flow.

When you are done with flow development, you run a job to execute the processing defined in the flow. For more information about jobs, see Running StreamSets jobs.

Learn more