IBM Datacap, Version 8.1            

General Taskmaster application architecture

Taskmaster applications are designed to scan, process, and verify the data in your documents.

Although each Taskmaster application is different, most include seven basic steps.
Table 1. Flow chart of seven basic steps of an application, from page input to data export
Application step Description
Page input Scan a batch of hardcopy pages or import electronic documents into your application. The output from this stage is a batch of individual TIFF image files. Each page is initially assigned the page type Other.
Page identification Perform image enhancement to improve the image quality. Then, determine each page type, automatically or by displaying it to an operator for manual identification if necessary. The goal is to identify the page type, but not a variant (for example, an airline ticket, but not a ticket from a specific airline).
Document assembly Organize the individual page files into a document according to predefined document definitions (for example, a form might have two required pages and an optional attachment). Run document integrity confirmation to ensure that each document satisfies the rules for that document type.
Data recognition On each page, locate the data fields for that page type (for example, an airline ticket contains a passenger name, a departure airport). Then, use a Taskmaster recognition engine to obtain the character data for each field. The recognition engine indicates the degree of confidence for each character.
Data validation Check the validity of specific fields. For example, you can check for valid dates, valid field formats, and correct totals. You can also complete searches to ensure that a state abbreviation is valid, or a purchase order number matches an item in a purchase order database.
Data verification Display low-confidence data and fields that failed validation to an operator for verification, correction, and exception handling. When the operator submits the batch, the application runs the validation rules again to ensure that all data satisfies the validation criteria.
Data export Export the data or document images to a text file, an XML file, a database, a Document Management system, or the next stage in a workflow.


Feedback

Last updated: November 2013
dcadg265.htm

© Copyright IBM Corporation 2013.