Step 3: Express Tagging

After creating, fine-tuning, and extending the classes into which the documents in your search collection have been organized, the last step is to preserve that class hierarchy. This is done by using the Express Tagging capability to tag all of the documents with information about the classes and subclasses in which each document is located. As explained in the next section, this tag is used to produce structured navigation for that class hierarchy when displaying search results for collections that have been auto-classified.

The final step in the auto-classification process is shown in Figure 1.

Figure 1. Step 3 of the Auto Classification Process

This portion of the auto-classification process enables you to optionally identify other sources that you want to apply the same classifications to, and requires that you provide information to link the class hierarchy that you've created to the Tag annotation where it will be stored. The fields that you can specify are the following:

  • Sources - Any source that you want to tag with the current auto-classification hierarchy should be checked here. This is typically the list of source(s) that were used to generate the current auto-classification hierarchy.

    The text area below the source list enables you to specify the names of other sources to which you want to apply the current classification hierarchy. This field enables you to apply an existing auto-classification taxonomy to other Watson™ Explorer Engine sources, helping you standardize taxonomies across those sources.

  • Project - the name of the project that uses the display in which a Tag annotation has been created to hold the auto-classification information. This tutorial suggested the project name auto-class-tutorial.
  • Tag Content - the name of the Tag annotation that was created to hold the auto-classification information. This tutorial used an annotation named tags to hold this information.

Once you have specified these values, click Start Express Tagging to begin the tagging operation. A dialog like the one shown in Figure 2 displays.

Figure 2. The Final Auto-Classification Dialog

Once this dialog displays, you can select the Close and Refresh option to return to the Auto-Classification interaction screen, but you may find it more useful to click the check tagging status link to ensure that the tagging phase of the auto-classification process is working correctly. After clicking this link, a screen like the one shown in Figure 3 displays.

Figure 3. The Express Tagging Status Screen

If express tagging is working correctly, you should see positive values in both fields of the Docs Processed column for each tagging operation. The first value in this column is the number of documents that have been tagged so far, and the second number is the total number of documents that should eventually be tagged for that query. If some or all of the values in this column are non-zero, everything is working correctly, and you should proceed to Using the Auto-Classified Results for information about identifying and using the taxonomy that you created in this tutorial. If either of these is zero:

  • Double-check that you specified the correct Project name and Tag Content in the final section of the Auto-classification interaction screen, as explained in Step 3: Express Tagging. You may need to revisit the display for that project to verify the latter setting.

At any time after documents in your sources have begun to be tagged, you can perform a search against your application to see and use the auto-classified taxonomy that you have generated, as explained in the next section.

To proceed to the next section, click Using the Auto-Classified Results.