The document hierarchy defines both the document types that your application supports and the page types that are associated with each document type.
For example, the TravelDocs document hierarchy defines three document types, four associated page types, and the generic page type Other.
After page identification assigns the correct page type to each incoming page, your application uses information in the document hierarchy to determine the corresponding document. For example, a page of type Rental_Agreement is part of a car rental document. A page of type Air_Ticket is part of a flight document.
IBM® Datacap Taskmaster Capture then uses the information in the document hierarchy to assemble individual pages into multi-page documents. Each page has three variables that define the structure of the parent document:
For example, here are the variables that you specified earlier for each of the TravelDocs pages.
Max | Min | Order | Description | |
---|---|---|---|---|
Rental Agreement | 1 | 1 | 1 | One per document; required; must be first |
Optional Insurance | 1 | 0 | 2 | One per document; optional; must be second |
Air_Ticket | 1 | 1 | 1 | One per document; required; must be first |
Room_Receipt | 1 | 1 | 1 | One per document; required; must be first |
The variables determine that each car rental document must contain one rental agreement page, and that the page might be followed by an optional insurance page. If Taskmaster identifies a rental agreement page that is immediately followed by an optional insurance page, it groups the two pages as a single document. The following example is a portion of the runtime data file (PageID.xml) that is generated after document creation.
<?xml-stylesheet type="text/xsl" href="..\..\dco.xsl"?>
<B id="20100321.011">
<V n="TYPE">TravelDocs</V>
<V n="LAST_RR_TPROFILE">Rulerunner:m:eRun</V>
<D id="20100321.011.01"> <-- Document ID
<V n="TYPE">Car_Rental</V> <-- Document type
<V n="STATUS">0</V>
<P id="TM000001"> <-- Page ID
<V n="TYPE">Rental_Agreement</V> <-- Page type
<V n="STATUS">49</V>
<V n="IMAGEFILE">tm000001.tif</V>
etc.
</P>
<P id="TM000002"> <-- Page ID
<V n="TYPE">Optional_Insurance</V> <-- Page type
<V n="STATUS">49</V>
<V n="IMAGEFILE">tm000002.tif</V>
etc.
</P>
</D>
etc.