IBM Content Analytics with Enterprise Search, Version 3.0.0                  

Monitoring the parser

Monitor the parser when you need to view information about documents that are parsed and analyzed before they are added to the index. Options enable you to review statistics and administer parser activity.

If your administrative role limits you to monitoring collections, you can view the status of the parser, but you cannot start or stop the parser.

When you monitor parser details, you see a snapshot of parser activity that provides statistics about parsing activities at a specific moment in time. For example, the statistics show you the number of documents that were parsed and added to the index and the total size of the index.

When the parser is active, messages provide you with additional information about the state of the parser. For example:
  • The parser might be actively parsing documents.
  • The parser might be idle. The parser sleeps until more documents are available to parse.

To monitor the parser for a collection:

  1. On the Collections view, expand the collection that you want to monitor.
  2. If the parser is running and you want to see detailed status information about parsing activity, click Details in the Parse and Index pane.

    If your administrative role allows you to administer processes for a collection, you can start and stop the parser while you view details about parsing activities.

    When you first create a collection, start the parser only after the crawler begins crawling data. This ensures that the parser has data to analyze and tokenize. Unless you make changes to parsing rules, you can let the parser run continuously.

    You need to stop and restart the parser when you make changes to parsing rules. For example, if you change the parser configuration, you must stop and restart the parser before your changes become effective.

    If you monitor an enterprise search collection that does not use a document cache to support rebuilding the index from cached documents, you must recrawl documents to apply parsing changes to documents that are already stored in the index.

    If global processes or analytic resources are configured for the collection, you can monitor those activities. For example, you can see how many documents have been processed. If a schedule was configured for link analysis or thumbnail generation, you can see when the process is scheduled to run again.


Feedback

Last updated: May 2012

© Copyright IBM Corporation 2004, 2012.
This information center is powered by Eclipse technology. (http://www.eclipse.org)