Preparing to install IBM Knowledge Catalog
Plan and prepare to install IBM Knowledge Catalog.
Before you begin
Complete these tasks before you install one of the IBM Knowledge Catalog editions.
Determining the optional features to enable
For each of the IBM Knowledge Catalog editions, you can enable several optional features during installation, upgrade, or at any time later.
- IBM Knowledge Catalog
-
enableDataQuality: false enableKnowledgeGraph: false useFDB: false - IBM Knowledge Catalog Premium
-
enableDataQuality: false enableKnowledgeGraph: false enableSemanticAutomation: false enableSemanticEmbedding: false enableSemanticEnrichment: true enableAISearch: false enableTextToSql: false enableModelsOn: cpu useFDB: false - IBM Knowledge Catalog Standard
-
enableKnowledgeGraph: false enableSemanticAutomation: false enableSemanticEnrichment: true enableAISearch: false enableModelsOn: cpu useFDB: false
| Feature | Component needed | Entry in the custom resource |
|---|---|---|
| Lineage Availability:
You can install one of these lineage services:
|
Knowledge graph |
|
| Relationship explorer Availability:
|
Knowledge graph |
|
| Set FoundationDB as the database to use to store
the data generated by knowledge graph. By default, the Neo4j graph database is installed during installation or upgrade. This graph database is required if you want to use IBM Manta Data Lineage as your lineage service.
Availability:
|
FoundationDB graph database |
|
| Business-term relationship search Availability:
Business-term relationship search is available with both types of graph database. |
Knowledge graph |
|
| Data quality features Availability:
|
Data quality |
|
| Gen AI based capabilities in metadata
enrichment You must set both parameters. Availability:
If you enable this feature, you can work with the default
See Deployment mode. |
Gen AI based enrichment |
|
|
Deployment mode for the models that are used for the gen AI capabilities Availability:
This option determines where the foundation models run that are used with the gen AI capabilities in metadata enrichment.
You must also set If you switch from using models on CPU to a different deployment mode later, make sure to stop
the
semantic-text-generation pod where the CPU models run to avoid
conflicts. |
Model deployment mode |
|
| Generate embeddings Availability:
Enable computation of vectors for the metadata and the data during metadata enrichment. These vectors are used, for example, for generating SQL queries from natural-language text (Text to SQL feature). This settings is also required if you want to enable the Text to SQL feature. You must also set |
Embeddings generation |
|
| Text-To-SQL capabilities Availability:
Enable Text-to-SQL capabilities for generating SQL queries from natural language input. These capabilities can be used for creating query-based data assets, for example, for data products, SQL-based data quality rules, or in searches. If you enable this feature, you must also set
With You can set a custom model instead:
enableSemanticEnrichment: true.
|
Text-to-SQL capabilities |
|
| AI search Enable LLM-based search for assets and artifacts across all workspaces. Availability:
|
AI search |
|
- For more information about enabling these features during installation, see Installing IBM Knowledge Catalog .
- For more information about enabling these features during the upgrade, see the topic that is specific to your product version in the Upgrading IBM Knowledge Catalog section.
- For more information about enabling these features after the initial install or upgrade, see Enabling optional features after installation or upgrade for IBM Knowledge Catalog.
Generative AI capabilities per model deployment mode
Depending on your requirements, you must specify different combinations of installation parameter
for your generative AI setup in addition to enableSemanticAutomation: true. Some
further service-specific configuration might be required after the installation is complete. Check
the following table for possible configuration options:
| Model deployment mode | Capabilities | Additional settings in IBM Knowledge Catalog |
|---|---|---|
enableModelsOn: cpuFor details, see |
Generation of names and descriptions for tables and columns, and generation and assignment of business terms in metadata enrichment (Designing metadata enrichment) | Enable gen AI based capabilities in the metadata enrichment project settings. |
enableModelsOn: gpuFor details, see |
|
|
enableModelsOn: remoteFor details, see A connection to the remote watsonx.ai instance must be configured in . See Connecting to a remote watsonx.ai instance. |
Custom models must be set in . |
|
Certified foundation models for metadata enrichment and Text-to-SQL
- Metadata enrichment
-
- meta-llama/llama-3-3-70b-instruct 5.3.1 and later
- openai/ gpt-oss-120b 5.3.1 and later
- ibm/ granite-4-h-small
- ibm/granite-3-3-8b-instruct
- ibm/granite-8b-code-instruct
- Text-to-SQL
-
- meta-llama/llama-3-3-70b-instruct 5.3.1 and later
- openai/ gpt-oss-120b 5.3.1 and later
- meta-llama/llama-4-maverick-17b-128e-instruct-fp8
Models that are identified as certified models have undergone evaluation with the gen AI capabilities in IBM Knowledge Catalog. Models that are not certified are not guaranteed to work as expected, and their accuracy and performance can vary.
- Supported foundation models in watsonx.ai on IBM Software Hub
- Supported foundation models in watsonx.ai
- Billing details for generative AI assets in watsonx.ai Runtime for models in watsonx.ai as a Service