Indexing object text with Content Search Services

Content Search Services provides full-text search capabilities for Content Platform Engine object stores. Configuration involves setting up index areas, enabling classes for indexing, selecting text languages, and tuning the indexing process.

Overview

Content Search Services indexing enables users to search for documents based on their text content. Configuration involves setting up the infrastructure and policies for indexing content, processing text, and executing search queries.

Configuration is performed at the object store level using the Administration Console for Content Platform Engine. After initial configuration, Content Search Services automatically indexes new and modified documents in CBR-enabled classes.

Key configuration areas

Configuring Content Search Services indexing involves these main areas:

Index areas
Index areas define the connection between your object store and Content Search Services. You must create at least one Content Search Services index area before content can be indexed. Index areas store the full-text indexes and can be partitioned for large-scale deployments.
CBR-enabled classes
Content-based retrieval (CBR) must be enabled for each document class that you want to index. You can enable CBR for individual classes or for all classes in the object store. Only documents in CBR-enabled classes are added to the full-text index.
Text languages
Select the text languages that reflect the languages of your documents. Language selection affects how text is analyzed and indexed, improving search accuracy and relevance. You can specify multiple languages and set a default language for the object store.
Indexing process
The indexing process consists of preparation and pipeline phases. During preparation, documents are queued for indexing and text is extracted. During the pipeline phase, extracted text is processed and added to the index. You can configure workload distribution, text extraction settings, and pipeline parameters to optimize indexing performance.
Index maintenance
After initial configuration, you maintain indexes by managing index areas, controlling resource status, configuring partitioning, and performing reindexing operations when needed. Regular monitoring helps ensure optimal search performance.

Configuration workflow

Follow this general workflow when configuring Content Search Services indexing:

  1. Verify that Content Search Services is installed and the Content Platform Engine is configured to connect to it
  2. Create one or more Content Search Services index areas for your object store
  3. Enable CBR for the document classes that you want to index
  4. Select text languages for the object store
  5. Configure the indexing process parameters (workload, text extraction, pipeline settings)
  6. Enable the indexing queue sweep to begin processing documents
  7. Monitor indexing status and tune performance as needed

The topics in this section provide detailed procedures for each configuration area.

Migration considerations

If you are planning to migrate from Content Search Services to Elasticsearch or OpenSearch, you can configure dual mode indexing to index content to both engines simultaneously during the migration period. This allows you to validate Elasticsearch or OpenSearch functionality while maintaining Content Search Services search capabilities.