File system monitor overview

The file system monitor is the client interface of the unstructured correlation text engine. It handles unstructured text files such as electronic mail, automatic speech recognition transcripts, comments, chat-logs, and text that is scanned with optical character recognition.

The file system monitor supports all of the file formats supported by Apache Tika 1.0 that are listed at http://tika.apache.org/1.0/.

Illustrates the flow of information the file system monitor and MDM.
The file system monitor process has five steps:
  1. The user starts the file system monitor by running the TextCorrelation.sh or TextCorrelation.bat script.
  2. The file system monitor reads the documents in the TEXTCORRELATION_HOME/work/documents/toprocess folder.
  3. The file system monitor starts the unstructured text correlation engine to process the documents. If update handling is set to on in the config.xml file, the file system monitor starts the update handler to process operational server changes.
  4. Documents that are successfully processed by the unstructured text correlation engine are saved in the operational server.
  5. The unstructured text correlation engine identifies the correlation between the document and operational server records and saves the extracted records and relationships in the operational server.


Last updated: November 6, 2015