NVIDIA Multimodal Processing Engine settings
This section enables you to configure the NVIDIA Multimodal pipeline.
DocumentProcessor custom resource (CR) supports the following configuration
options for controlling the NVIDIA Multimodal
file processing engine. Each option affects content chunking, processing performance, and search
quality. When you select a processing approach, consider the tradeoffs among these factors.- NVIDIA Multimodal
DocumentProcessorCR configuration options and considerations -
Option Valid option values Description Considerations spec>tasks>name>nvidiaExtract>extractTextTrue/False(Default:
True)Extracts the actual text from a document, including the text in tables, charts, and infographics. Extract text is the most performant for ingestion. spec>tasks>name>nvidiaExtract>extractTablesTrue/False(Default:
True)Extracts text by summarizing each table. A second chunk is created if extractTextis also selected.spec>tasks>name>nvidiaExtract>extractChartsTrue/False(Default:
True)Extracts text by summarizing each chart. A second chunk is created if extractTextis also selected.spec>tasks>name>nvidiaExtract>extract>infographicsTrue/False(Default:
True)Extracts text by summarizing each infographic. A second chunk is created if extractTextis also selected.spec>tasks>name>nvidiaExtract>extractImagesTrue/False(Default:
True)Extracts caption by summarizing each image. Impacts the time of ingestion. spec>tasks>name>nvidiaExtract>tableOutputFormatEnumeration:
markdown,pseudo_markdown,simple(Default:
pseudo_markdown)Extracted tables are returned in a particular format to be returned as the text that is vectorized. N/A spec>tasks>name>nvidiaExtract>extractTextDepthEnumeration:
page,document(Default:
page)Extracts text at levels of granularity. Impacts the number of chunks that are stored and the size of text returned. spec>tasks>name>nvidiaSplit>chunkSizeNumeric
(Default:
1024)Size of each chunk in number of bytes. For more information on all splitting options, see Split Documents.
spec>tasks>name>nvidiaSplit>chunkOverlapNumeric
(Default:
150)Number of bytes to overlap between consecutive chunks. spec>tasks>name>nvidiaSplit>tokenizerString
(Default:
meta-llama/Llama-3.2-1B)Identifies the embedding model to split the content with. spec>tasks>name>nvidiaSplit>splitSourceTypesArray of Strings
(Default:
empty)Identifies the file types to apply the prior nvidiaSplitconfiguration.During the Creating a domain and connecting with data source procedure, the CAS user interface creates a
Domaincustom resource (CR) and an associated NVIDIA MultimodalDocumentProcessorCR. TheDocumentProcessorCR configures the processing engine to use the default settings, which include extracting text only and processing content on a per‑page basis. To apply more configuration options, you must create or modify the NVIDIA MultimodalDocumentProcessorCR by using the OpenShift console or the CLI.
Configuring NVIDIA Multimodal pipeline
Domain CR in the CAS user interface, a corresponding
DocumentProcessor CR is generated for that domain. Update the spec
section of the DocumentProcessor CR with the required configuration options.
- Find the
DocumentProcessorCR that is associated with theDomain.Tip: TheDocumentProcessorCR has the same name as theDomain.Example:-
oc get domains.cas.isf.ibm.com -n ibm-cas -
oc get documentprocessors.cas.isf.ibm.com -n ibm-cas
-
- View the current configuration of the
DocumentProcessorCR.Example:oc get documentprocessors.cas.isf.ibm.com mydomain -n ibm-cas -o yaml - Ensure that the type is
nvidia_multimodal. Example:type: nvidia_multimodal - Modify the
DocumentProcessorCR according to the options added in the table. - Save the
DocumentProcessorCR. - For the changes to take effect, the associated processing engine pods must be restarted. To
restart the processing engine pods, follow these steps:
- List the
Deploymentsin theibm-casnamespace.Example:oc get deployment -n ibm-cas - Find the
Deploymentthat has the same name as theDomain. Example:mydomain - Delete the
Deployment.Example:oc delete deployment/mydomain -n ibm-cas - List the
Podsin theibm-casnamespace.Example:oc get pods -n ibm-cas - Find the CAS Operator Controller
Podwith a name that begins withibm-isf-cas-operator-controller-manage. Example:ibm-isf-cas-operator-controller-manager-6dd8f5dc86-pjdrs - Delete the
Pod.Example:oc delete pod/ibm-isf-cas-operator-controller-manager-6dd8f5dc86-pjdrs -n ibm-cas
- List the
- Results
-
- The CAS Operator Controller Pod restarts and reconciles the updated
DocumentProcesssorCR. - During reconciliation, the operator re-creates the
Deploymentassociated with theDomainandDocumentProcessorCRs by using the updated configuration. The corresponding processing engine pods are then started with the new settings.
- The CAS Operator Controller Pod restarts and reconciles the updated