IBM Content Analytics with Enterprise Search, Version 3.0.0                  

Linguistic support for semantic search

IBM® Content Analytics with Enterprise Search provides linguistic support for semantic search in most Indo-European languages and Asian languages, including Japanese.

You can use the linguistic support to improve the quality of search results.

Linguistic processing is performed in two stages: when a text document is processed to be added into the index, and when a user enters a query.

IBM Content Analytics with Enterprise Search includes basic linguistic functions that are used to determine the language of an input document and to segment the document input stream into words or tokens.

If you know that your searches will be restricted primarily to basic facet value searches or native XML searches that uses the document structure, the included linguistic processing adequately covers your needs.

Most information in text documents is unstructured, which makes it difficult to use effectively because it is not easy to access the meaning of the information.

Searching for keywords is simple, but it is not always satisfactory if you want to search beyond the mere words in the document, as is illustrated in the following examples:

Feedback

Last updated: May 2012

© Copyright IBM Corporation 2004, 2012.
This information center is powered by Eclipse technology. (http://www.eclipse.org)