Exporting assets from the source system (Watson Knowledge Catalog)
On your source system, export the data that you want to migrate to Cloud Pak for Data.
- Required roles in the source system
- To export assets from Information Server, you must have the Suite Administrator and Common Metadata User roles.
- Order when exporting asset types
- Export the asset types that you want to migrate to the target system in any order. However, it is important to import assets in a specific order. Therefore, the export procedures are also presented in the same order in which the import must be completed.
- Location of the istool command
- In many cases, you use the istool command to export assets. The command is in the installation_directory/Clients/istools/cli directory, where installation_directory is for example /opt/IBM/InformationServer.
- Dividing export files into chunks
- When you use the istool command to export assets, you can specify the
-maxArchiveSize
option to improve the performance. This option creates additional archive files when the size of the export file exceeds the specified number of megabytes. In the example commands, this parameter is set to 500, so the exported files don’t exceed the limit of 500 MB. This is a recommended value for the migration.
See the following sections for the export instructions.
- Bulk export of asset groups by using the export -all command
- Data classes
- Custom attribute definitions
- Bundle assets (open IGC)
- InfoSphere DataStage and QualityStage assets
- Glossary assets
- Lineage filters and report templates
- Automation rules
- Analysis database, automated discovery results, and data rule and rule set run history
- Quick scan results
- Data connections with mappings
Bulk export of asset groups by using the export -all command
You can use the istool export -all command to export many asset types at the same time. Alternatively, if you’d rather export asset groups one by one, you can use commands specific to the asset group. Exporting asset groups one by one might be the preferred solution when you want to use additional options with the export commands.
The following table provides a list of asset groups which are exported by the istool
export -all
command, the alternative commands for each asset group, and links to resources
with more information about each command.
Asset group | Alternative istool command | More information (Information Server documentation) |
---|---|---|
Common metadata assets:
|
export -commonmetadata Options automatically used by the
export
-all command:
|
Export command for common metadata assets |
Data quality assets:
|
export -ia Options automatically used by the
export -all command:
|
Export command for InfoSphere Information Analyzer assets |
Extension mapping documents and extended data sources | export -igc |
Extension mappings export command |
InfoSphere Metadata Asset Manager import areas and data connections (staging area assets are not exported) | export -imam |
Export command for import areas |
InfoSphere Streams assets | export -streamsEndpoint |
Export command for InfoSphere Streams assets |
- Export all assets at the same time
- To export all assets at the same time, run this
command:
./istool.sh export -dom host:port -username username -password password -archive "/tmp/iisassets.isx" -maxArchiveSize 500 -all -Xmx4096M
- Export each asset group individually
- To export each asset group individually, run these commands.
Data classes
./istool.sh export -dom host:port -username username -password password -archive "/tmp/dataclasses.isx" -dca '/*.dc /*.als -inccls -incca -incannot' -maxArchiveSize 500 -Xmx4096M
The
-Xmx
option specifies the max heap size to use when the command runs. For more
information about the command, see the Export command for data class and data collection analysis
assets topic in the Information Server documentation.Custom attribute definitions
- Log in to Information Governance Catalog classic. Access it by using this
URL:
https://source-host-name/ibm/iis/igc/
- Go to .
- Select all custom attribute definitions and, from the menu, select Export.
- Save the XML file.
Bundle assets (open IGC)
- Open the Information Governance Catalog REST API by using this
URL:
https://source-host-name/ibm/iis/igc-rest-explorer/
- Use the following commands to export bundle assets and flows. You must repeat the procedure for
all bundles and flows.
- To export bundle assets, expand the
bundles
section and use theGET/bundles/assets
command. Find the name of a bundle by usingGET/bundles
command and provide it in the family parameter. Click Try it out. Save the XML file with the results. - To export flows, expand the
flows
section and use theGET/flows/
command. In the asset parameter, specify the flowUnit IDs. If you use more than one value, separate them with commas. Click Try it out. Save the XML file with the results.
- To export bundle assets, expand the
InfoSphere DataStage and QualityStage assets
./istool.sh export -dom host:port -username username -password password -archive "/tmp/project_assets.isx" -ds '"dsServer/project/folder/asset_name.suffix"' -maxArchiveSize 500 -Xmx4096M
The
-Xmx
option specifies the max heap size to use when the command runs. The
"dsServer/project/folder/asset_name.suffix"
option specifies the paths of assets to
be added to the exported file. For example, to export all assets in the project
dstage1
, run this
command:./istool.sh export -dom host:port -username username -password password -archive "/tmp/dstage1_assets.isx" -ds 'IIDEMO5/dstage1/*/*.*' -maxArchiveSize 500 -Xmx4096M
Run
this command for each project that you want to export.For more information, see the Export command for InfoSphere DataStage and QualityStage assets and Asset paths for InfoSphere DataStage and QualityStage assets topics in the Information Server documentation.
Glossary assets
Glossary assets include terms, categories, information governance rules, information governance policies, and labels. You must export them to an XML file.
./istool.sh glossary export -dom host:port -username username -password password -filename "/tmp/glossaryassets.xml" -format XML -allcategories -allpoliciesrules -includeassignedassets -includestewardship -includelabeledassets -Xmx4096M
The
-Xmx
option specifies the max heap size to use when the command runs. For more
information about this command, see the Glossary export command topic in the Information Server
documentation.Lineage filters and report templates
- In Information Governance Catalog classic, go to .
- To export lineage filters:
- Open Manage Lineage Filters.
- Select the filters that you want to export.
- From the menu, select Export.
- Save the file.
- To export lineage report templates:
- Open Manage Lineage Report Templates.
- Select the templates that you want to export.
- From the menu, select Export.
- Save the file.
Automation rules
/opt/IBM/InformationServer/ASBServer/bin/IAAdmin.sh -user username -password password -url https://host:port -getDataQualityConfigurationRule
For
more information about the command, see the Commands to import, export, and delete automation rules topic
in the Information Server documentation.Analysis database, automated discovery results, and data rule and rule set run history
Data type | db_name | schema_name |
---|---|---|
Analysis database | IADB | IAUSER |
Automated discovery results | XMETA | AUDIT_TRAIL |
Data rule and rule set run history | XMETA | ASSET_RELATION |
- In the source system, log in to the database as the
db2inst1
user and set up the environment to run Db2 commands by running this command:. ~/sqllib/db2profile
- Create the database directory in the
/tmp
directory.mkdir /tmp/db_name chmod 777 /tmp/db_name cd /tmp/db_name
- Export database assets by running this
command:
db2move db_name export -aw -l lobs -sn schema_name
Quick scan results
- Log in to the solr pod on the source
system:
kubectl exec -it solr-0 bash
- Create temporary
directories:
mkdir /var/solr/data/solr_analysis_backup
mkdir /var/solr/data/solr_discovery_backup
- Run the following backup
commands:
curl -k 'http://localhost:8983/solr/analysis/replication?command=backup&location=/var/solr/data/solr_analysis_backup'
curl -k 'http://localhost:8983/solr/discovery/replication?command=backup&location=/var/solr/data/solr_discovery_backup'
- To display the status of the backup process, run these
commands:
curl -k 'http://localhost:8983/solr/analysis/replication?command=details&wt=xml'
curl -k 'http://localhost:8983/solr/discovery/replication?command=details&wt=xml'
- Copy the
solr_analysis_backup
andsolr_discovery_backup
directories from the solr pod to thetmp
directory on the source system.
Data connections with mappings
You must export data connections separately so that you can later import them with a mapping document. The mapping document is used to specify the source and target host names. It is required when the source and target host names differ.
./istool.sh export -dom host:port -username username -password password -archive "/tmp/dataconnections.isx" -cm '/*/*.dcn' -maxArchiveSize 500 -Xmx4096M
The
-Xmx
option specifies the max heap size to use when the command runs. For more
information about this command, see the Export command for common metadata assets topic in the
Information Server documentation.