Supported languages

You can specify that text documents be processed by using a specific language. The text search feature supports the linguistic processing of text documents by 26 different language codes.

You can specify the language for the indexed text data in the SYSPROC.SYSTS_CREATE administration stored procedure. If you set the value to AUTO, the text search server tries to determine the language.

Automatic language detection is more accurate for longer documents. For very short documents that consist of just a few words, automatic language detection is not recommended. The default language for linguistic processing is English (en_US).

The following table shows the five-character language codes for the supported languages.

Table 1. The five-character language codes for the supported languages
Language code Language
ar_AA Arabic
cs_CZ Czech
da_DK Danish
de_CH German (Switzerland)
de_DE German (Germany)
el_GR Greek
en_AU English (Australia)
en_GB English (United Kingdom)
en_US English (United States)
es_ES Spanish (Spain)
fi_FI Finnish
fr_CA French (Canada)
fr_FR French (France)
it_IT Italian
ja_JP Japanese
ko_KR Korean
nb_NO Norwegian Bokmal
nl_NL Dutch
nn_NO Norwegian Nynorsk
pl_PL Polish
pt_BR Brazilian Portuguese
pt_PT Portuguese (Portugal)
ru_RU Russian
sv_SE Swedish
zh_CN Simplified Chinese
zh_TW Traditional Chinese