Content Analytics Studio for advanced text analytics

Content Analytics Studio is a development environment for building and testing custom text analysis engines for target business domains. This environment eliminates the need for specialist knowledge of the underlying technologies of natural language processing or UIMA. By using Content Analytics Studio, you can develop text analysis engines without needing to write any code.

You can use Content Analytics Studio for the following tasks:
  • Build language and domain-specific terms into dictionaries.
  • Develop character rules to recognize patterns of characters that represent specific types of information, such as telephone numbers or email addresses.
  • Develop parsing rules to identify patterns of text that represent particular concepts, such as person names.
  • Annotate text and browse the contents of each annotation.
  • Export a UIMA pipeline directly to Watson Explorer Content Analytics and automatically create fields and facets that are mapped to the annotations.
  • Analyze documents with a Watson Explorer Content Analytics pipeline and view the resulting annotations in Content Analytics Studio.
Use Content Analytics Studio with Watson Explorer Content Analytics to iteratively develop custom annotators:
  1. In Content Analytics Studio, develop and test a UIMA pipeline that includes linguistic resources such as dictionaries, character rules, and parsing rules.
  2. Export the pipeline to Watson Explorer Content Analytics and verify the annotation results by exploring facets in the content analytics miner or enterprise search application.
  3. In Content Analytics Studio, fine-tune the linguistic resources.
  4. Export the updated UIMA pipeline to Watson Explorer Content Analytics and verify your changes in the content analytics miner or enterprise search application.