How the pat_RecogMatch_Id action works
When Datacap runs
the pat_RecogMatch_Id action, it gathers all the
anchor objects from the fingerprint library and looks for a match
on the current page.
For each anchor object, Datacap searches
the current page in a region 400 pixels greater in each direction
than the text zone defined in the fingerprint. If it finds a match
that meets the required confidence level, it sets the page type and
computes the offset values.
Restriction: The METRIC variable
does not change the size of the search region that is used by
pat_RecogMatch_Id.The following RRS log entries illustrate how pat_RecogMatch_Id works.
Created PatternMatch Object
Aquired PM lock
Opening 'provider=microsoft.jet.oledb.4.0;
data source=C:\Datacap\TravelDocs\TravelDocsFingerprint.mdb;persist security info=false'
Fingerprint/Rules Database connection established.
#572, path:'C:\Datacap\TravelDocs\fingerprint\572.cco'
FPZone:'1384,313,1529,392' TXTZone:'1408,334,1498,359' Value:'Room' 1
FPZone:'697,1988,1006,2074' TXTZone:'758,2018,811,2036' Value:'Hotel #3'
----------------------------
ANCHOR TEXT:'Room'--->'R[oO0][oO0]m' METRIC:'400,400' 2
SEARCH AREA:'Hotel #3
Room
Check out Wed Nov 24 2010
speed internet microwave fridge 3
$109 95
$329 85'
Matched Value >>Room<< 4
Check FingerPrintID# 572 Match Confidence: 9. Search Area: 1008,0,1700,759
Offset(-80,-100) 5
----------------------------
ANCHOR TEXT:'Hotel #3' --->
'[Z2][oO0][\(\)iItl1][oO0][\ ]*H[\(\)iItl1][\(\)iItl1][\(\)iItl1][\(\)iItl1][oO0]p[\ ]*H[oO0]
[\(\)iItl1]e[\(\)iItl1]s' METRIC:'400,400'
SEARCH AREA:'Hotel #3' 6
Matched Value >>Hotel[SPACE CHARACTER]#3<<
Check FingerPrintID# 572 Match Confidence: 9. Search Area: 358,1618,1211,2200
Offset(-100,-101)
RecogMatch FingerPrint#:572 PAGETYPE:Room_Receipt 7
| RRS log entry | Description |
|---|---|
| 1 | The action finds two anchor zones that are defined in fingerprint
572: Room and Hotel #3. |
| 2 | It computes a bounding region 400 pixels greater in each direction
than the region (TXTZone) defined in the fingerprint
CCO file for the first anchor value (Room). |
| 3 | It identifies all the text within that bounding region on the current page. |
| 4 | It locates the anchor value Room within the
search region. |
| 5 | It computes the offset by comparing the word's position on the page to the position in the fingerprint. |
| 6 | It repeats the process for the second anchor value. |
| 7 | It sets the page's template ID and type because at least one of the zones matched. |