Configuring character rules graphically
You can configure character rule expressions based on sample text that contains a particular character sequence.
About this task
Before you create character rules, you must create a character rules database and include the compiled dictionary file in the lexical analysis stage of your UIMA pipeline. You can then analyze the sample text that contains the patterns to use for the basis of the rules.
After the sample text is analyzed by Content Analytics Studio, the pattern of character classes that represent the selected text is displayed in a tree format. You can modify the character sequence for the rule to match, such as modifying the pattern to match similar sequences of characters, and then define one or more annotations to create when matching text is found in the document. You can also create features for the annotations. After you add the rule to the database, rebuild the character rules file.
Alternatively, you can manually add character class elements to a character rule by using the Add Character option. Using this approach, you can create a character rule without dragging any sample text. For example, this approach might be an easier way to create character rules when the exact pattern that you want to match is not available in the text.
Procedure
To create character rules: