Importing a Taxonomy

About this task

If you want to import a taxonomy and classify the documents in your search collection based on the hierarchy that it specifies, that taxonomy must be defined in terms of labels and associated queries. The XML taxonomy format that is supported looks like the following:

<?xml version="1.0"?>
<taxonomy>
   <entity>
     <label>Meetings</label>
     <query>meeting OR conference OR committee</query>
   </entity>
   <entity>
     <label>Enron Officers</label>
     <query>"jeffrey skilling" OR "kenneth lay" OR "ken lay"</query>
   </entity>
   <entity>
     <label>Accounting</label>
     <query>"arthur andersen"</query>
   </entity>
   <entity>
     <label>Agreements</label>
     <query>agreement OR contract</query>
   </entity>
</taxonomy>

Entries in a taxonomy file need not be in alphabetical order; they will be automatically organized during the import process.

To import a taxonomy, the auto-classify collection must be configured to understand it.

Procedure

  1. Select the auto-classify search collection.
  2. Add the Auto-Classify - import Taxonomy converter to the collection.

    After adding the source form, you will be required to identify the XPath of each component. In our example each class is contained in an entity tag, and each class contains a label tagged as label and a query tagged as query.

    To add the taxonomy converter to your collection

  3. Enter: //entity as the XPath to each top-level entry
  4. Enter: label as the Relative XPath to Category Label
  5. Enter: query as the Relative XPath to Category Query.
  6. Save the converter by clicking the OK button
  7. Return to the Auto-Classification page.

Results

You are now ready to import the taxonomy

Select the Import a Taxonomy radio button

Enter the URL of the file containing the taxonomy that you want to use in the Taxonomy Location field

Select the format of that taxonomy from the Taxonomy Format drop-down list.

Click Import to proceed.

Note: Auto-classification can easily be extended to support importing other structured taxonomy definition formats. Contact your product support representative for more information about this process.