Administering DataStage

If you have the required permissions, you can complete administrative tasks for IBM DataStage.

DataStage centralizes core administrative tasks across the platform. As an administrator of DataStage, you can manage the following administration areas:
  • Creating DataStage runtime instances where jobs run
  • Managing persistent volume claims
  • Logging
  • User management

You complete administration tasks by using the Cloud Pak for Data administration console (such as managing users, managing pods, and log collecting). You can complete project-specific administrative tasks directly from analytics projects. You can manage DataStage runtime instances from the service instances page and by customizing the PX instance resource by command line.

Service configuration

Task Required permissions Details
Creating a service instance for DataStage Instance administrator Create a DataStage instance.
Customizing resources in DataStage Instance administrator Customize instances for DataStage.
Setting DataStage environment definitions Instance administrator Set the environment definitions in DataStage.
Setting up an NFS mount in DataStage Instance administrator Set up an NFS mount in DataStage pods by using a persistent volume (PV).
Dynamic workload management in DataStage Instance administrator Use dynamic workload management to more quickly and efficiently run DataStage workloads across available compute pod resources in each PX instance.
Creating and setting the APT_CONFIG_FILE environment variable in DataStage Instance administrator Set the environment variable APT_CONFIG_FILE to define the path to the configuration file.
Job log retention in DataStage Instance administrator Modify the log retention file on the px-runtime persistent volume to set log cleanup policy.
Generating a core dump file for px-runtime image Instance administrator Generate a core dump file for your px-runtime image when a DataStage job fails.
Storing DataStage job run metrics in the DataStage repository Instance administrator Set up metrics storage with the metrics repository.
Installing third-party libraries and creating custom images in DataStage Instance administrator Install third-party libraries and use them as part of a DataStage flow.
Adding local scratch devices to DataStage pods Instance administrator Ensure that you do not run out of data storage by adding local scratch devices, which use the EmptyDir volume, to DataStage pods.
Enabling the Data service connector for DataStage Enterprise Plus Instance administrator Enable the Data service for DataStage Enterprise plus.
Setting up license files to connect to Db2 for z/OS database with Db2 for DataStage connector Instance administrator Set up license certificate files to create the connection.
Setting the time zone in DataStage Instance administrator Set the time zone for DataStage.

Storage configuration

Task Required permissions Details
Disabling SELinux relabeling Cluster administrator Configure IBM® Software Hub pods to skip SELinux relabeling for all bulk operations if you're using a CSI-based storage type.
Runtime storage layout Instance administrator DataStage storage consists of a global persistent volume, a local persistent volume, and local image storage.

Security

Task Required permissions Details
Exporting IBM Software Hub audit records to a security information and event management solution Instance administrator Record user activities for DataStage.

Resource management

Task Required permissions Details
Manually scaling resources for services Instance administrator Adjust DataStage by scaling resources to support high availability or to increase processing capacity. Resources can be scaled based on predefined resource configurations.
Automatically scaling resources for services Instance administrator Enable autoscaling with the OpenShift® Horizontal Pod Autoscaler (HPA). The HPA changes the resource allocation of services by increasing or decreasing the number of pods in response to CPU or memory consumption.
Shutting down and restarting services Instance administrator Manually shut down DataStage to prevent it from using resources. You can restart the service when you need to use it again.

Backup and restore

Task Required permissions Details
Backing up and restoring IBM Software Hub Cluster administrator Choose the correct method to back up and restore IBM Software Hub.