IBM Content Analytics with Enterprise Search, Version 3.0.0                  

Annotators

IBM® Content Analytics with Enterprise Search provides a number of UIMA annotators for advanced text analysis.

When documents are processed through the document processing pipeline, the annotators extract concepts, words, phrases, classifications, and named entities from unstructured content and mark these extractions as annotations. The annotations are added to the index as tokens or facets and are used as the source for content analysis. Some annotators support user-defined dictionaries, user-defined rules, and custom configurations.

When configuring the document processing pipeline for a collection, an administrator selects the annotators to be used. Some of the key functions the annotators support include:


Feedback

Last updated: May 2012

© Copyright IBM Corporation 2004, 2012.
This information center is powered by Eclipse technology. (http://www.eclipse.org)