If the content to be analyzed includes XML documents with meaningful markup, and you want to use this markup to annotate text, you can map the XML elements to the common analysis structure.
To enable custom text analysis processes to access specific XML elements, or to map several XML elements to a common Type for use in semantic search, you can create custom mapping files. The mapping files must adhere to the UIMA framework for text analysis.
When you add mapping files to a collection that uses a custom text analysis engine, you enable XML elements in source documents to be mapped to annotations in the common analysis structure. These annotations can then be used by your custom text analysis engine.
For example, you can map the content of <addressee> and <customer> elements to Person annotations in the common analysis structure. These annotations can then be accessed by your custom annotators, which might detect additional information (for example, they might detect the gender of the Person).
To map XML elements to the common analysis structure: