IBM Support

IBM Datacap Taskmaster Capture 8.1.0 Language Support

Product Documentation


Abstract

This document provides details about the languages that are supported by the different IBM Datacap Taskmaster Capture Version 8.1.0 components.

Content

The following tables show the languages that are supported in the corresponding Datacap Taskmaster 8.1.0 component.

Notes

  • OCR-S/OCR-SR: Nuance engine
  • OCR-A: ABBYY engine
  • OCR-N: NovoDynamics engine
  • ICR-C: RecoStar engine
  • Legal Dict.: OCR-S Legal Dictionary
  • Financial Dict.: OCR-S Financial Dictionary
  • Medical Dict.: OCR-S Medical Dictionary
  • ICR-P: Parascript engine
  • Admin/Install doc.: Administration/installation documentation

Languages:

Afrikaans through Czech

Important:

Support for Arabic requires that customers license NovoDynamics NovoVarus separately and install it on the Rulerunner machine where the Datacap Studio actions for Arabic (Datacap.Libraries.NovoDynamics) will be running.

For Chinese (traditional) OCR-S/OCR-SR support, HKSCS extensions are not supported.

For Chinese (simplified) and Chinese (traditional), OCR-A is recommended instead of OCR-S/OCR-SR, because OCR-S confidence calculation might return high confidence for replaced characters.

Table 1

LanguageData EntryDotEdit and DotScanFastDocTaskmaster WebOCR-NOCR-S OCR-SRLegal Dict.Financial Dict.Medical Dict.
AfrikaansSupported



Supported

AlbanianSupported



Supported

ArabicSupportedSupported

Supported</td><td width=



Bosnian (Latin)Supported




CatalanSupported



Supported

Chinese (simplified)
Supported
SupportedSupportedSupported
Supported


Chinese (traditional)




Supported


CroatianSupportedSupportedSupportedSupported
Supported

CzechSupportedSupportedSupportedSupported
Supported

Table 1 continued

LanguageOCR-A ICR-C ICR-PIBM Content ClassificationAdmin/Install doc.Online Help
Afrikaans SupportedSupported
Albanian SupportedSupported
Arabic





Bosnian (Latin)
Supported
Catalan SupportedSupported
Chinese (simplified) Supported

Supported

Chinese (traditional) Supported




Croatian SupportedSupported
CzechSupportedSupported

Back to top

Table 2 Danish through Estonian

LanguageData EntryDotEdit and DotScanFastDocTaskmaster WebOCR-S OCR-SRLegal Dict.Financial Dict.Medical Dict.
DanishSupported


Supported
DutchSupportedSupportedSupportedSupportedSupportedSupported Supported
Dutch BelgianSupported



EnglishSupportedSupportedSupportedSupportedSupportedSupportedSupportedSupported
EsperantoSupported


Supported
EstonianSupported


Supported

Table 2 Danish through Estonian continued

LanguageOCR-AICR-C ICR-PIBM Content ClassificationAdmin/Install doc.Online Help
Danish SupportedSupported
DutchSupportedSupported Supported
Dutch BelgianSupported
EnglishSupportedSupportedSupportedSupportedSupportedSupported
Esperanto Supported
Estonian SupportedSupported

Back to top

Table 3 Faroese through Greek

LanguageData EntryDotEdit and DotScanFastDocTaskmaster WebOCR-S OCR-SRLegal Dict.Financial Dict.Medical Dict.
FaroeseSupported


Supported
FinnishSupported


Supported
FrenchSupportedSupportedSupportedSupportedSupportedSupported Supported
Gaelic IrishSupported


Supported
Gaelic ScottishSupported


Supported
GermanSupportedSupportedSupportedSupportedSupportedSupported Supported
GreekSupportedSupportedSupportedSupportedSupported

Table 3 Faroese through Greek continued

LanguageOCR-A ICR-CICR-PIBM Content ClassificationAdmin/Install doc.Online Help
Faroese SupportedSupported
FinnishSupportedSupported
FrenchSupportedSupported Supported
Gaelic Irish SupportedSupported
Gaelic Scottish Supported
GermanSupportedSupported Supported
GreekSupportedSupported

Back to top

Table 4 Hebrew through Norwegian

Important: OCR-A support for Hebrew and Japanese requires the IBM Datacap Taskmaster Capture interim fix, 8.1.0.2-Datacap-Taskmaster-WIN-IF-OCRA:0609577, which is available at IBM Support Fix Central.

For Japanese, OCR-A is recommended instead of OCR-S/OCR-SR, because OCR-S confidence calculation might return high confidence for replaced characters.

LanguageData EntryDotEdit and DotScanFastDocTaskmaster WebOCR-S OCR-SR Legal Dict.Financial Dict.Medical Dict.
HebrewSupportedSupported



HungarianSupportedSupportedSupportedSupportedSupported
IcelandicSupported


Supported
ItalianSupportedSupportedSupportedSupportedSupported
JapaneseSupportedSupportedSupportedSupportedSupported


LatvianSupported


Supported
LithuanianSupported


Supported
MalteseSupported


Supported
NorwegianSupported


Supported

Table 4 Hebrew through Norwegian continued

LanguageOCR-A ICR-CICR-PIBM Content ClassificationAdmin/Install doc.Online Help
HebrewSupported
HungarianSupportedSupported
Icelandic SupportedSupported
ItalianSupportedSupported Supported
Japanese Supported

Supported

Latvian SupportedSupported
Lithuanian
Supported
Maltese Supported
Norwegian SupportedSupported Supported

Back to top

Table 5 Polish through Sami Southern

LanguageData EntryDotEdit and DotScanFastDocTaskmaster WebOCR-S OCR-SR Legal Dict.Financial Dict.Medical Dict.
PolishSupportedSupportedSupportedSupportedSupported
Portuguese (Brazil)SupportedSupportedSupportedSupportedSupported
Portuguese (Portugal)Supported


Supported
Rhaeto-RomanicSupported


Supported
RomanianSupportedSupportedSupportedSupportedSupported
Russian SupportedSupportedSupportedSupported Supported


SamiSupported


Supported
Sami NorthernSupported


Supported
Sami SouthernSupported


Supported

Table 5 Polish through Sami Southern continued

LanguageOCR-AICR-CICR-PIBM Content ClassificationAdmin/Install doc.Online Help
PolishSupportedSupported
Portuguese (Brazil)SupportedSupported Supported
Portuguese (Portugal) SupportedSupported Supported
Rhaeto-Romanic SupportedSupported
RomanianSupportedSupported
RussianSupported Supported
Supported

Sami
Sami Northern
Sami Southern

Back to top

Table 6 Serbian through Turkish

LanguageData EntryDotEdit and DotScanFastDocTaskmaster WebOCR-S OCR-SR Legal Dict.Financial Dict.Medical Dict.
Serbian (Cyrillic)*Supported


Supported
Serbian (Latin)Supported


Supported
SlovakSupportedSupportedSupportedSupportedSupported
SlovenianSupported


Supported
SpanishSupportedSupportedSupportedSupportedSupported
SwahiliSupported


Supported
SwedishSupportedSupportedSupportedSupportedSupported
TurkishSupportedSupportedSupportedSupportedSupported

Table 6 Serbian through Turkish continued

LanguageOCR-A ICR-CICR-PIBM Content ClassificationAdmin/Install doc.Online Help
Serbian (Cyrillic)*

Serbian (Latin)
Supported
SlovakSupportedSupported
Slovenian
Supported
SpanishSupportedSupported Supported
Swahili SupportedSupported
Swedish SupportedSupported Supported
TurkishSupportedSupported

*Important: Datacap Taskmaster Version 8.1.0 does not expose a user interface to select the Serbian Cyrillic recognition option, but support for Serbian (Cyrillic) is invoked through the implementation of actions in Datacap Studio. See the technical document, Setting the OCR/S recognition language to Serbian (Cyrillic).

Back to top

[{"Product":{"code":"SSZRWV","label":"IBM Datacap"},"Business Unit":{"code":"BU053","label":"Cloud & Data Platform"},"Component":"Not Applicable","Platform":[{"code":"PF033","label":"Windows"}],"Version":"8.1.0","Edition":"","Line of Business":{"code":"LOB45","label":"Automation"}}]

Document Information

Modified date:
17 June 2018

UID

swg27035841