Filtering Extraction Results

When you are working with very large datasets, the extraction process could produce millions of results. For many users, this amount can make it more difficult to review the results effectively. Therefore, in order to zoom in on those that are most interesting, you can filter these results through the Filter dialog available in the Extraction Results pane.

Keep in mind that all of the settings in this Filter dialog are used together to filter the extraction results that are available for categories.

Filter by Frequency You can filter to display only those results with a certain global or document frequency value.

  • Global frequency is the total number of times a concept appears in the entire set of documents or records and is shown in the Global column.
  • Document frequency is the total number of documents or records in which a concept appears and is shown in the Docs column.

For example, if the concept nato appeared 800 times in 500 records, we would say that this concept has a global frequency of 800 and a document frequency of 500.

And by Type You can filter to display only those results belonging to certain types. You can choose all types or only specific types.

And by Match Text You can also filter to display only those results that match the rule you define here. Enter the set of characters to be matched in the Match text field and then select the condition in which to apply the match.

Table 1. Match text conditions
Condition Description
Contains The text is matched if the string occurs anywhere. (Default choice)
Starts with Text is matched only if the concept or type starts with the specified text.
Ends with Text is matched only if the concept or type ends with the specified text.
Exact match The entire string must match the concept or type name.

Results Displayed in Extraction Result Pane

Here are some examples of how the results might be displayed, in English, in the Extraction Results pane toolbar based on the filters.

Table 2. Examples of filter feedback
Filter feedback Description
The toolbar shows the number of results. Since there was no text matching filter and the maximum was not met, no additional icons are shown.
The toolbar shows results were limited to the maximum specified in the filter, which in this case was 300. If a purple icon is present, this means that the maximum number of concepts was met. Hover over the icon for more information.
The toolbar shows results were limited using a match text filter. This is shown by the magnifying glass icon.

To Filter the Results

  1. From the menus, choose Tools > Filter. The Filter dialog box opens.
  2. Select and refine the filters you want to use.
  3. Click OK to apply the filters and see the new results in the Extraction Results pane.