Category Model Nugget: Model Tab

For category models, the model tab displays the list of categories in the category model on the left and the descriptors for a selected category on the right. Each category is made up of a number of descriptors. For each category you select, the associated descriptors appear in the table. These descriptors can include concepts, category rules, types, and TLA patterns. The type of each descriptor, as well as some examples of what each descriptor represents, is also shown.

On this tab, the objective is to select the categories you want to use for scoring. For a category model, documents and records are scored into categories. If a document or record contains one or more of the descriptors in its text or any underlying terms, then that document or record is assigned to the category to which the descriptor belongs. These underlying terms include the synonyms defined in the linguistic resources (regardless of whether they were found in the text or not) as well as any extracted plural/singular terms found in the text used to generate the model nugget, permuted terms, terms from fuzzy grouping, and so on.

Note: If you generated a concept model nugget instead, this tab will contain different results. See the topic Concept Model: Model Tab for more information.

Category Tree

To learn more about each category, select that category and review the information that appears for the descriptors in that category. For each descriptor, you can review the following information:

  • Descriptor name. This field contains an icon representing what kind of descriptor it is, as well as the descriptor name.
    Table 1. Descriptor icons
     
    Concepts
    TLA Patterns
     
    Types
    Category Rules
  • Type. This field contains the type name for the descriptor. Types are collections of similar concepts (semantic groupings), such as organization names, products, or positive opinions. Rules are not assigned to types.
  • Details. This field contains a list of what is included in that descriptor. Depending on the number of matches, you may not see the entire list for each descriptor due to size limitations in the dialog box.

Selecting and Copying Categories

All top categories are selected for scoring by default, as shown in the check boxes in the left pane. A checked box means that the category will be used for scoring. An unchecked box means that the category will be excluded from scoring. You can check multiple rows by selecting them and clicking one of the check boxes in your selection. Also, if a category or subcategory is selected but one of its subcategories is not selected, then the checkbox shows a blue background to indicate that there is only a partial selection in the children of the selected category.

By right-clicking a category in the tree, you can display a context menu from which you can:

  • Check Selected. Checks all check boxes for the selected rows in the table.
  • Uncheck Selected. Unchecks all check boxes for the selected rows in the table.
  • Check All. Checks all check boxes in the table. This results in all categories being used in the final output. You can also use the corresponding checkbox icon on the toolbar.
  • Uncheck All. Unchecks all check boxes in the table. Unchecking a category means that it will not be used in the final output.You can also use the corresponding empty checkbox icon on the toolbar.

By right-clicking a cell in the descriptor table, you can display a context menu in which you can:

  • Copy. The selected concept(s) are copied to the clipboard.
  • Copy With Fields. The selected descriptor is copied to the clipboard along with the column headings.
  • Select All. All rows in the table will be selected.