Enabling enhanced extraction

Enable enhanced extraction to extract text and metadata from complex document types.

Before you begin

Make sure that you complete the following prerequisites:

About this task

By default, document classes are not enabled for enhanced extraction. When you enable enhanced extraction for a document class, the feature applies to the class and all subclasses. Documents are automatically processed by the configured handlers when they are added to the object store.

Procedure

Complete the following steps to enable enhanced extraction for a document class.

  1. In the administration console, navigate to the object store where you want to enable enhanced extraction.
    1. In the domain navigation pane, select the object store.
    2. In the object store navigation pane, select Data Design > Classes.
  2. Select Document or any document subclass for which you want to enable enhanced extraction.
    To process all incoming documents with enhanced extraction handlers, enable enhanced extraction for the base Document class.
  3. On the General tab, select the Enhanced Extraction enabled checkbox.
    The Enhanced Extraction enabled checkbox appears below the Persistent Text Extracts enabled checkbox. If the checkbox is already selected and appears unavailable, the option is inherited from the parent class.
    Note: You can enable enhanced extraction for a document class only if persistent text extraction is enabled for that class.
  4. Click Save to apply the changes to the document class.

Results

Enhanced extraction is enabled for the document class and its subclasses. When documents are uploaded or updated, the configured handlers process them.

What to do next

Verify that the extraction handlers are working correctly by uploading a test document and checking that the appropriate annotations are created.