SSAO5N - Bot Index

Welcome
Overview
- What's new in watsonx.data
- watsonx.data deployment options and plans
- Platform UI and Console UI comparison
- Platform architecture
  - Asset types and properties
  - Object storage for workspaces
- AI assistants and agents
- Services
- FAQs
- Known issues for the platform UI
- Known issues for the console UI on IBM Cloud
- Known issues on AWS
- Getting help
Getting started and tutorials
- Signing up for the Lite plan
- Joining your organization's watsonx.data account
- Switching between experiences
- Creating task credentials
- Generating an API key and bearer token
- watsonx APIs and SDKs
- Tutorials
- AI solution accelerators
  - Q&A RAG accelerator
  - Medallion accelerator
Projects
- Shared projects across experiences
- Creating a project
  - Importing a project
  - Importing project assets
- Administering projects
- Managing assets in projects
- Downloading data assets
- Choosing compute resources for tools
- Managing compute resources
- Creating and managing jobs
- Adding catalog assets to a project
- Publishing assets to a catalog
- Leaving a project
- Markdown cheatsheet
Preparing data
- Adding data to a project
- Connectors
- Adding platform connections
  - Managing collaborators on platform connections
- Building custom connectors with Connector forge
  - Creating and deploying custom connectors
  - Troubleshooting custom connectors
- Parametrized connections
- Data protection with data source definitions
- Orchestrating tasks with Orchestration Pipelines
Managing watsonx.data infrastructure
- Presto (Java)
- Presto (C++)
- watsonx.data Spark engine
- Apache Gluten accelerated Spark engine
- Catalogs
- Metadata Service
- Data Access Service (DAS)
- Milvus
- Query Optimizer
- Presto (Java) mixed-case support
- API customization
- Data Gate
- Accessing data in external data platforms
- Metadata Service
- Resource groups
- Access management and governance
- Gathering diagnostics
- OpenTelemetry
- IBM Manta Data Lineage
- Customizing max pool size in Metadata Service
- Provisioning a Presto (Java) engine
- Provisioning a Presto (C++) engine
- Provisioning a serverless Spark engine for Lite plan
- Provisioning a Spark engine
- Provisioning Apache Gluten accelerated Spark engine
- Managing watsonx.data Spark
  - Customization overview
  - Managing Spark engine capacity
  - Managing native Spark engine details
- Registering an engine
- Managing engines
- Associating a catalog with an engine
- Exploring the catalog objects
- Dissociating a catalog from an engine
- Configuring Presto resource groups
- Adding storage
  - IBM Cloud Object Storage
  - Amazon S3
  - IBM Storage Ceph
  - MinIO
  - Hadoop Distributed File System (HDFS)
  - Google Cloud Storage
  - Azure Data Lake Storage
  - Apache Ozone
  - Custom S3 storage
- Adding multiple Apache Iceberg catalogs to a single storage
- Exploring the storage details and objects
- Editing storage details
- Deleting a storage-catalog pair
- Setting up GlusterFS replicated storage with MinIO
- Disabling or enabling ACL on an ACL-enabled storage
- Registering external data into
- Adding data source
  - Apache Druid
  - Apache Kafka
  - Apache Phoenix
  - Apache Pinot
  - Amazon Redshift
  - BigQuery
  - Apache Cassandra
  - ClickHouse
  - HANA
  - IBM Db2 for i
  - Elasticsearch
  - IBM Data Virtualization Manager
  - IBM Db2
  - IBM Netezza
  - IBM Db2 for z/OS
  - IBM Informix
  - MongoDB
  - MySQL
  - Oracle
  - PostgreSQL
  - Prometheus
  - Redis
  - SingleStore
  - Snowflake
  - SQL Server
  - Teradata
  - Custom
  - Arrow Flight service
    - Apache Derby
    - Greenplum
    - MariaDB
    - Salesforce
- Updating data source credentials
- Editing data source details
- Deleting a data source-catalog pair
- Managing IAM access for
- Managing user access
- Managing roles and privileges
- Managing data policy rules
- Common Policy Gateway (CPG) connector
- Enabling or disabling common policy gateway engines
- Protecting your lakehouse with context-based restrictions
- Introduction to OpenRAG
- Quick start: Provision OpenRAG and OpenSearch
- Adding an OpenRAG service
- Astra DB in watsonx.data
  - Adding an Astra DB service
  - Terminating an Astra DB service
  - Viewing Astra DB database details
  - Creating an application token
  - Creating a custom role in Astra DB
  - Managing access control for Astra DB service
- Semantic automation for data enrichment
  - Registering and activating semantic layer
  - Enriching data with semantic automation layer
  - Performing semantic searches
- Driver manager
- Billing and usage
- Connecting to Presto server
- Account‑scoped metadata model
Engineering data
- Engineering structured data
- Working with Spark
- Querying data with Data workbench
  - Creating a data product
  - Table Optimizer
    - Table Optimizer configuration options
- Interacting with data through an MCP server
  - Setting up the remote MCP server
  - Setting up a local MCP server
- Finding and querying data in metastores
- Getting connection information
- SQL statements, data types and mixed-case behavior supported by Presto
- IBM Cloud Pak for Data Command Line Interface (IBM cpdctl)
  - Downloading and installing IBM Cloud Pak for Data Command Line Interface (IBM cpdctl)
  - Supporting commands and usage for watsonx.data in IBM cpdctl
Analyzing data
- Notebooks and scripts
- Analyzing and processing data with Spark
  - Manage your Spark jobs
Building a RAG solution
- Terms of use
- Tokens
- Supported foundation models
- Curating and integrating unstructured data
- Building prompts
- Adding Milvus service
- Connecting to Milvus service
- Working with Milvus
- Pause and resume Milvus service
- Connecting watsonx Assistant to Milvus for custom search
- Using the Milvus backup tool
- Using the Vector Transport Service
- Optimizing your RAG knowledge base
- Retrieval service
- Integrating your RAG pipeline with AI agents
  - IBM watsonx.data local Model Context Protocol (MCP) server
    - Integrating with watsonx Orchestrate
    - Integrating with other agentic framework
  - IBM watsonx.data remote Model Context Protocol (MCP) server
    - Integrating with watsonx Orchestrate
    - Integrating with LangChain agentic framework
Data governance
- Catalogs
  - Administering a catalog
  - Catalog assets
- Categories
- Business Terms
- Classifications
  - Designing classifications
  - Predefined classifications
- Data Classes
- Reference Data
- Policies
  - Designing policies
- Governance rules
  - Designing governance rules
- Data protection rules
- Data quality SLAs
  - Designing data quality SLAs
  - Managing data quality SLAs
- Data lineage
  - Lineage for unstructured data
- Managing IBM watsonx.data intelligence
Administration
- Administration on IBM Cloud
- Administration on AWS
- Administration on Azure
  - Setting up watsonx.data on Azure with a user-managed data plane
- Integrations
- Troubleshooting
- Managing the user API key
- Managing your settings on IBM Cloud
- Activity Tracker Event Routing
- Managing your cloud account
Auditing events for watsonx.data
Logging for watsonx.data
Monitoring Presto engine JMX metrics with Sysdig on IBM Cloud
Milvus metrics
Metering and usage experience
Default limits and quotas for Spark engine
Default instance limits for engines and services
IBM watsonx.data pricing plans
Architecture and concepts in serverless instances
Best practices
Getting connection information
Presto exposed JMX metrics
Metrics exposed by Milvus
Mixed-case behavior
Understanding your responsibilities when using watsonx.data
High availability and disaster recovery
Disaster scenarios in watsonx.data
Presto update process for watsonx.data
Configuration properties for Presto (Java) - coordinator and worker nodes
JVM properties for Presto (Java) - coordinator and worker nodes
Catalog properties for Presto (Java)
Event listener properties for Presto (Java)
Configuration properties for Presto (C++) - worker nodes
Configuration properties for Presto (C++) - coordinator nodes
Catalog properties for Presto (C++)
Velox properties for Presto (C++)
JVM properties for Presto (C++) - coordinator nodes
Global properties for Presto (C++)
LogConfig worker properties for Presto (C++)
LogConfig coordinator properties for Presto (C++)
Resource group properties
Glossary

SSAO5N - Documentation Index

Table of Contents