Product Documentation
Abstract
This document provides details about the languages that are supported by the different IBM Datacap Version 9.1.6 components.
Content
The following tables show the languages that are supported in the corresponding Datacap 9.1.6 component.
Notes
- OCR-M: Mobile
- Legal Dict.: OCR-S Legal Dictionary
- Financial Dict.: OCR-S Financial Dictionary
- Medical Dict.: OCR-S Medical Dictionary
- ICR-P: Advanced Handwriting
- Admin/Install doc.: Administration/installation documentation
Languages:
Important:
- For Chinese (traditional) OCR-S/OCR-SR support, HKSCS extensions are not supported.
- For Chinese (simplified) and Chinese (traditional), OCR-A is recommended instead of OCR-S/OCR-SR, because OCR-S confidence calculation might return high confidence for replaced characters. OCR-A also supports more than a thousand glyphs of the most commonly used HKSCS extensions of traditional Chinese.
- ICR/C is deprecated. The recommended engines are OCR-A and OCR-SR.
Table 1
| Language | Data Entry | Datacap Desktop | FastDoc | Datacap Web | Datacap Navigator | Mobile | OCR-SR Machine Print | Legal Dict. | Financial Dict. | OCR-A Hand Print | OCR-SR Hand Print |
| Afrikaans | |||||||||||
| Albanian | |||||||||||
| Arabic | |||||||||||
| Austrian | |||||||||||
| AzeriLatin | |||||||||||
| Bashkir | |||||||||||
| Belgian | |||||||||||
| Bosnian (Latin) | |||||||||||
| Bulgarian | |||||||||||
| Catalan | |||||||||||
| Chinese (simplified) | |||||||||||
| Chinese (traditional) | |||||||||||
| Croatian | |||||||||||
| Czech |
Table 1 continued
| Language | Medical Dict | OCR-A Machine Print | ICR-C | ICR-P | OCR-M | Accounts Payble application | Medical Claims application | IBM Content Classification | Admin/Install doc. | Online Help |
| Afrikaans | ||||||||||
| Albanian | ||||||||||
| Arabic | ||||||||||
| Agul | ||||||||||
| Bosnian (Latin) | ||||||||||
| Catalan | ||||||||||
| Chinese (simplified) | ||||||||||
| Chinese (traditional) | ||||||||||
| Croatian | ||||||||||
| Czech |
Table 2 Danish through Estonian
| Language | Data Entry | Datacap Desktop | FastDoc | Datacap Web | Datacap Navigator | Mobile | OCR-SR Machine Print | Legal Dict. | Financial Dict. | Medical Dict. | OCR-A Hand Print | OCR-SR Hand Print |
| Danish | ||||||||||||
| Dutch | ||||||||||||
| Dutch Belgian | ||||||||||||
| English | ||||||||||||
| Esperanto | ||||||||||||
| Estonian |
Table 2 Danish through Estonian continued
| Language | OCR-A Machine Print | ICR-C | ICR-P | OCR-M | Accounts Payble application | Medical Claims application | IBM Content Classification | Admin/Install doc. | Online Help |
| Danish | |||||||||
| Dutch | |||||||||
| Dutch Belgian | |||||||||
| English | |||||||||
| Esperanto | |||||||||
| Estonian |
| Language | Data Entry | Datacap Desktop | FastDoc | Datacap Web | Datacap Navigator | Mobile | OCR-SR Machine Print | Legal Dict. | Financial Dict. | Medical Dict. | OCR-A Hand Print | OCR-SR Hand Print |
| Faroese | ||||||||||||
| Finnish | ||||||||||||
| French | ||||||||||||
| Gaelic Irish | ||||||||||||
| Gaelic Scottish | ||||||||||||
| German | ||||||||||||
| Greek |
Table 3 Faroese through Greek continued
| Language | OCR-A | ICR-C | ICR-P | OCR-M | Accounts Payble application | Medical Claims application | IBM Content Classification | Admin/Install doc. | Online Help |
| Faroese | |||||||||
| Finnish | |||||||||
| French | |||||||||
| Gaelic Irish | |||||||||
| Gaelic Scottish | |||||||||
| German | |||||||||
| Greek |
Table 4 Hebrew through Norwegian
For Japanese, OCR-A is recommended instead of OCR-S/OCR-SR, because OCR-S confidence calculation might return high confidence for replaced characters.
| Language | Data Entry | Datacap Desktop | FastDoc | Datacap Web | Datacap Navigator | Mobile | OCR-SR Machine Print | Legal Dict. | Financial Dict. | Medical Dict. | OCR-A Hand Print | OCR-SR Hand Print |
| Hebrew | ||||||||||||
| Hungarian | ||||||||||||
| Icelandic | ||||||||||||
| Indonesian | ||||||||||||
| Irish | ||||||||||||
| Italian | ||||||||||||
| Japanese | ||||||||||||
| Korean | ||||||||||||
| Latvian | ||||||||||||
| Latin | ||||||||||||
| Lithuanian | ||||||||||||
| Maltese | ||||||||||||
| Norwegian |
Table 4 Hebrew through Norwegian continued
| Language | OCR-A | ICR-C | ICR-P | OCR-M | Accounts Payble application | Medical Claims application | IBM Content Classification | Admin/Install doc. | Online Help |
| Hebrew | |||||||||
| Hungarian | |||||||||
| Icelandic | |||||||||
| Italian | |||||||||
| Japanese | |||||||||
| Korean | |||||||||
| Latvian | |||||||||
| Lithuanian | |||||||||
| Maltese | |||||||||
| Norwegian |
Table 5 Polish through Sami Southern
| Language | Data Entry | Datacap Desktop | FastDoc | Datacap Web | Datacap Navigator | Mobile | OCR-SR Machine Print | Legal Dict. | Financial Dict. | Medical Dict. | OCR-A Hand Print | OCR-SR Hand Print |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Polish | ||||||||||||
| Portuguese (Brazil) | ||||||||||||
| Portuguese (Portugal) | ||||||||||||
| Rhaeto-Romanic | ||||||||||||
| Romanian | ||||||||||||
| Russian | ||||||||||||
| Sami | ||||||||||||
| Sami Northern | ||||||||||||
| Sami Southern |
Table 5 Polish through Sami Southern continued
| Language | OCR-A | ICR-C | ICR-P | OCR-M | Accounts Payble application | Medical Claims application | IBM Content Classification | Admin/Install doc. | Online Help |
| Polish | |||||||||
| Portuguese (Brazil) | |||||||||
| Portuguese (Portugal) | |||||||||
| Rhaeto-Romanic | |||||||||
| Romanian | |||||||||
| Russian | |||||||||
| Sami | |||||||||
| Sami Northern | |||||||||
| Sami Southern |
Table 6 Serbian through Vietnamese
| Language | Data Entry | Datacap Desktop | FastDoc | Datacap Web | Datacap Navigator | Mobile | OCR-SR Machine Print | Legal Dict. | Financial Dict. | Medical Dict. | OCR-A Hand Print | OCR-SR Hand Print |
| Serbian (Cyrillic)* | ||||||||||||
| Serbian (Latin) | ||||||||||||
| Slovak | ||||||||||||
| Slovenian | ||||||||||||
| Spanish | ||||||||||||
| Swahili | ||||||||||||
| Swedish | ||||||||||||
| Thai** | ||||||||||||
| Turkish | ||||||||||||
| Vietnamese |
Table 6 Serbian through Vietnamese continued
| Language | OCR-A | ICR-C | ICR-P | OCR-M | Accounts Payble application | Medical Claims application | IBM Content Classification | Admin/Install doc. | OCR-A Hand Print | OCR-SR Hand Print | Online Help |
| Serbian (Cyrillic)* | |||||||||||
| Serbian (Latin) | |||||||||||
| Slovak | |||||||||||
| Slovenian | |||||||||||
| Spanish | |||||||||||
| Swahili | |||||||||||
| Swedish | |||||||||||
| Swiss | |||||||||||
| Thai** | |||||||||||
| Turkish | |||||||||||
| Ukrainian | |||||||||||
| Vietnamese |
*Important: Datacap Version 9.1.6 does not expose a user interface to select the Serbian Cyrillic recognition option, but support for Serbian (Cyrillic) is invoked through the implementation of actions in Datacap Studio. See the technical document, Setting the OCR/S recognition language to Serbian (Cyrillic).
**Restriction: Some action libraries may be incompatible with certain Thai characters.
Restriction: The data entry in Thai language is only supported in Datacap Navigator.
The Datacap Web, Datacap Desktop and FastDoc do not support data entry in Thai language.
Was this topic helpful?
Document Information
Modified date:
08 September 2020
UID
ibm10886537