NLS Support environment variables in DataStage

These environment variables are concerned with DataStage®'s implementation of NLS.

Note: You should not change the settings of any of these environment variables other than APT_COLLATION _STRENGTH if NLS is enabled.

APT_COLLATION_SEQUENCE environment variable in DataStage

Set the APT_COLLATION_SEQUENCE environment variable to specify the global collation sequence to be used by sorts, compares, and other operations. This value of APT_COLLATION_SEQUENCE can be overridden at the stage level.

APT_COLLATION_STRENGTH environment variable in DataStage

Set the APT_COLLATION_STRENGTH environment variable to specify the specifics of the collation algorithm. APT_COLLATION_STRENGTH can be used to ignore accents, punctuation, or other details.

APT_COLLATION_STRENGTH is set to one of Identical, Primary, Secondary, Tertiary, or Quaternary. Setting it to Default unsets the environment variable.

APT_ENGLISH_MESSAGES environment variable in DataStage

Set the APT_ENGLISH_MESSAGES environment variable to 1 so that every message that is issued is output with its English equivalent.

APT_EXPORT_FIELD_NAMES_USING_IMPEXP_CHARSET environment variable in DataStage

Set the APT_EXPORT_FIELD_NAMES_USING_IMPEXP_CHARSET environment variable to allow the export operator to output the schema field names by using the import and export charset, which bypasses any schema charset that might be defined.

APT_IMPEXP_CHARSET environment variable in DataStage

Set the APT_IMPEXP_CHARSET environment variable to control the character encoding of ustring data that is imported and exported to and from DataStage, and the record and field properties applied to ustring fields.

Set APT_IMPEXP_CHARSET to an ICU character set:


APT_IMPEXP_CHARSET icu_character_set

APT_INPUT_CHARSET environment variable in DataStage

Set the APT_INPUT_CHARSET environment variable to control the character encoding of data input to schema and configuration files.

Set APT_INPUT_CHARSET to an ICU character set:


APT_INPUT_CHARSET icu_character_set

APT_OS_CHARSET environment variable in DataStage

Set the APT_OS_CHARSET environment variable to control the character encoding DataStage uses for operating system data such as the names of created files and the parameters to system calls.

Set APT_OS_CHARSET to an ICU character set:


APT_OS_CHARSET icu_character_set

If the APT_OS_CHARSET environment variable is not set, the operating system character set is used. The operating system character set is determined from the LANG or LC_ALL locale settings, which have the format language[_territory][.codeset]. If the ICU character set you want to use contains a codeset, the codeset must be specified.

APT_OUTPUT_CHARSET environment variable in DataStage

Set the APT_OUTPUT_CHARSET environment variable to control the character encoding of DataStage output messages and operators like peek that use the error logging system to output data input to the osh parser.

Set APT_OUTPUT_CHARSET to an ICU character set:


APT_OUTPUT_CHARSET icu_character_set

APT_STRING_CHARSET environment variable in DataStage

Set the APT_STRING_CHARSET environment variable to control the character encoding DataStage uses when it converts between string and ustring fields.

Set APT_STRING_CHARSET to an ICU character set:


APT_STRING_CHARSET icu_character_set

ICU_DATA environment variable in DataStage

The ICU_DATA environment variable corresponds to the ICU data directory, which is the default location for all ICU data.

In DataStage, any requests for user-defined character maps from the DataStage flows are resolved to files that are located in the ICU data directory.

The files must be accessible to the conductor and compute pods from a persistent volume. Set the ICU_DATA environment variable to point to the ICU data directory:
ICU_DATA path_to_icu_data

For more information about the ICU data directory, see ICU Data Directory.