IBM Datacap, Version 8.1            

Page Identification

Page identification is one of the first steps in any Taskmaster application. All incoming pages are initially assigned the default page type Other. Before Taskmaster can assemble those pages into documents and extract data from the pages, it must determine the correct type for each page.

Page identification methods include fingerprint recognition, structure-based identification, text matching, and manual page identification. Image enhancement is typically done before page identification to remove lines, shading, and other graphic elements that might interfere with the recognition process.



Feedback

Last updated: November 2013
dcadg365.htm

© Copyright IBM Corporation 2013.