Linguistic support in Watson Content Analytics

The linguistic analysis functions that are provided with Watson Content Analytics include document language detection and segmentation.

When a document is processed, parsing and tokenization functions determine the language of that document and breaks up the stream of input text into distinct units or tokens.

During a search, the user or search application must specify the query language. The query string is segmented and analyzed, and then the index is searched.

Document and query string analysis includes: