To generate a new data flow or SQL script for a data preparation profile, select New Data Flow or SQL Script in the profile editor.
The New Data Flow or SQL Script wizard guides you through the following pages:
Specify a name for the flow or the script to be created. If a data flow or an SQL script with the same name already exists, it is replaced.
If you want to work with a database, you must select the database connection on the next page of the wizard.
In the data flow to be created, you can include a Target Table operator by selecting the appropriate check box.
If you include a Target Table operator in the data flow, you must specify the following parameters:
If you reuse an existing table, you must specify the name and the schema of the table. If you create the data flow in offline mode, you must also specify the data model.
You can view a list of schemas and tables in the database or in the selected data model by clicking Browse.
If you reuse a table, it must be compatible with the result table that is specified in the profile. This means that every column of the result table must also exist in the reused table with the same name and the same data type. If the reused table includes additional columns that do not exist in the result table, these columns are filled with NULL values. Therefore these columns must be nullable.
You can create a new table by specifying a name, a schema, and a table space for the table. If you create the data flow in offline mode, you must also specify the data model the table is to be created in.
In online mode, you can specify to create the new table after pressing Finish or when executing the generated flow. In offline mode, the table is created during the execution of the generated flow.
If you selected to create a Target Table operator, you can specify more parameters for the data flow. For example, you can specify whether the target table is not to be logged initially.
If the data flow is going to process thousands of rows, you can specify to run a commit operation every time after a particular amount of rows is processed. If you do not want to use this option, set the commit count to 0.
In the SQL script to be created, you can include a SELECT statement or you can additionally create a table in which the data is to be inserted. You can also create a view.
You can reuse an existing table by specifying the data model and the name and the schema of the table. You can view a list of schemas and tables in the selected data model by clicking Browse.
If you reuse a table, it must be compatible with the result table that is specified in the profile. This means that every column of the result table must also exist in the reused table with the same name and the same data type. If the reused table includes additional columns that do not exist in the result table, these columns are filled with NULL values. Therefore these columns must be nullable.
You can create a new table by specifying the data model and the table space the table is to be created in and a name and a schema for the new table.
To create the data flow, the SQL script, or both, click Finish. Depending on your specification, the Data Flow editor, the SQL Query Builder, or both are opened.