DataStage repository
You can store DataStage job run metrics and indexed assets in a separate repository.
Prerequisites
PostgreSQL is the designated database for storing metrics data and is not intended as an operational data store. With PostgreSQL, you can run your own queries for insights into job performance and indexed assets. Depending on your preferences, you can host the PostgreSQL database within the same IBM Cloud Pak® for Data environment, operated on a virtual machine, or managed within a PostgreSQL service.
Creating a connection
Under the Manage tab of your Cloud Pak for Data project, go to DataStage > Repository. You can configure the connection to manage the DataStage repository and to enable persisting metrics and asset indexing. Specify a connection type, configure properties and security details, and test the connection to verify that it works.
ds-metrics. To clear any previous DataStage job run metrics data from the
ds-metrics schema, run the following
commands:drop schema if exists ds_metrics cascade;
drop table if exists public.databasechangelog;
drop table if exists public.databasechangeloglock;indexer schema, run the following
commands:drop schema if exists indexer cascade;The repository username that you specify must have both the permission to create schemas and
create tables in the public schema. To check the permissions, connect to the
database as the specified repository username and run the following query. If the query returns
true, true then the necessary permissions exist:
select has_database_privilege(current_database(), 'create'), has_schema_privilege('public', 'create');
Learn more
Storing DataStage job run metrics in the DataStage repository