Supported languages

On this topic, see what are the supported languages available for the supported OCR providers of IBM Robotic Process Automation and learn how to override a supported language if needed.

IBM Robotic Process Automation has three OCR providers, each with different language support. The tables in the following sections show each of the languages ​​supported by each of the OCR providers.

You can select languages that are not listed as the default supported languages for an OCR provider by using a language code that the OCR provider supports. See Overriding OCR languages for details.

Abbyy®

The following table displays the supported languages when using Abbyy™

Designer mode label Script mode name
Chinese (zh-CN, zh-TW, zh-Hans, zh-Hant) zh-CN, zh-TW, zh-Hans, zh-Hant
English (en-US) en-US
French (fr-FA, fr-CA) fr-FA, fr-CA
German (de-DE) de-DE
Italian (it-IT) it-IT
Japanese (ja-JP) ja-JP
Korean (ko-KR) ko-KR
Portuguese (pt-PT, pt-BR) pt-PT, pt-BR
Russian (ru-RU) ru-RU
Spanish (es-ES) es-ES

Google Cloud Vision™

The following table displays the supported languages when using Google Cloud Vision™

Designer mode label Script mode name
Chinese (zh-CN, zh-TW, zh-Hans, zh-Hant) zh-CN, zh-TW, zh-Hans, zh-Hant
English (en-US) en-US
French (fr-FA, fr-CA) fr-FA, fr-CA
German (de-DE) de-DE
Italian (it-IT) it-IT
Japanese (ja-JP) ja-JP
Korean (ko-KR) ko-KR
Portuguese (pt-PT, pt-BR) pt-PT, pt-BR
Russian (ru-RU) ru-RU
Spanish (es-ES) es-ES

Google Tesseract™

The following table displays the supported languages when using Google Tesseract™

Designer mode label Script mode name
English (en-US) en-US
Portuguese (pt-PT, pt-BR) pt-PT, pt-BR
Spanish (es-ES) es-ES

Overriding OCR languages

If you want OCR results for a language that is not listed on a command's input parameter as a supported language, you can override the OCR language by setting the input parameter with a variable that defines a language code.

A language code is a code that assigns letters or numbers as identifiers or classifiers for languages. You can use a language code that is supported by the chosen OCR provider to override a pre-defined language code from IBM RPA Studio.

Important:Tesseract is embedded in IBM RPA Studio's code, so it only supports the languages listed in Supported languages. Trying to override the languages might lead to errors if you choose Tesseract as the OCR provider.

Procedure

  1. In IBM RPA Studio, enter the OCR command.
  2. Select the OCR provider.
  3. In the Language parameter, enter the language code according to the OCR provider patterns.

See the language support for the OCR provider that you are using: