Toolkit structure

The Text Toolkit provides tools that help you process unstructured text.

The toolkit contains the following predefined directories:
  • bin: Contains the createTypes.pl script, which generates type definitions for the output views and can also generate a composite operator and a main file that calls the composite operator.
  • com.ibm.streams.text.analytics: Contains the TextExtract operator model file and icons.
  • doc: Contains the documentation that is used by IBM Streams Studio.
  • impl: The support library (lib) directory contains the file TextAnalyticsForStreams.jar, which contains the implementation of the TextExtract operator.
  • lib: Contains the Text Analytics engine. The lib/TextAnalytics/data/tam directory includes the pre-compiled extractor libraries.