UTF-8 interchange converters

This section will discuss conversions provided in both directions for each code set and UTF-8.

UTF-8 is a universal, multibyte encoding. Conversions for each code set are provided in both directions, between the code set and UTF-8.

UTF-8 conversions are usually done by using the Universal_UCS_Conv and /usr/lib/nls/loc/uconv/UTF-8 converter.

Converter Description
ISO8859-1 UTF-8 <—> ISO Latin-1
ISO8859-2 UTF-8 <—> ISO Latin-2
ISO8859-3 UTF-8 <—> ISO Latin-3
ISO8859-4 UTF-8 <—> ISO Baltic
ISO8859-5 UTF-8 <—> ISO Cyrillic
ISO8859-6 UTF-8 <—> ISO Arabic
ISO8859-7 UTF-8 <—> ISO Greek
ISO8859-8 UTF-8 <—> ISO Hebrew
ISO8859-9 UTF-8 <—> ISO Turkish
JISX0201.1976-0 UTF-8 <—> Japanese JISX0201-0
JISX0208.1983-0 UTF-8 <—> Japanese JISX0208-0
CNS11643.1986-1 UTF-8 <—> Chinese CNS11643-1
CNS11643.1986-2 UTF-8 <—> Chinese CNS11643-2
KSC5601.1987-0 UTF-8 <—> Korean KSC5601-0
IBM-eucCN UTF-8 <—> Simplified Chinese EUC
IBM-eucJP UTF-8 <—> Japanese EUC
IBM-eucKR UTF-8 <—> Korean EUC
IBM-eucTW UTF-8 <—> Traditional Chinese EUC
IBM-udcJP UTF-8 <—> Japanese user-defined characters
IBM-udcTW UTF-8 <—> Traditional Chinese user-defined characters
IBM-sbdTW UTF-8 <—> Traditional Chinese IBM-specific characters
UCS-2 UTF-8 <—> UCS-2
IBM-437 UTF-8 <—> USA PC data code
IBM-850 UTF-8 <—> Latin-1 PC data code
IBM-852 UTF-8 <—> Latin-2 PC data code
IBM-857 UTF-8 <—> Turkish PC data code
IBM-860 UTF-8 <—> Portuguese PC data code
IBM-861 UTF-8 <—> Icelandic PC data code
IBM-863 UTF-8 <—> French Canadian PC data code
IBM-865 UTF-8 <—> Nordic PC data code
IBM-868 UTF-8 <—> Urdu IBM-868
IBM-869 UTF-8 <—> Greek PC data code
IBM-918 UTF-8 <—> Urdu IBM-918
IBM-921 UTF-8 <—> Baltic Multilingual data code
IBM-922 UTF-8 <—> Estonian data code
IBM-932 UTF-8 <—> Japanese PC data code
IBM-943 UTF-8 <—> Japanese PC data code
IBM-934 UTF-8 <—> Korea PC data code
IBM-935 UTF-8 <—> Simplified Chinese EBCDIC
IBM-936 UTF-8 <—> People's Republic of China PC data code
IBM-938 UTF-8 <—> Taiwanese PC data code
IBM-942 UTF-8 <—> Extended Japanese PC data code
IBM-944 UTF-8 <—> Korean PC data code
IBM-946 UTF-8 <—> People's Republic of China SAA data code
IBM-948 UTF-8 <—> Traditional Chinese PC data code
IBM-1006 UTF-8 <—> Urdu IBM-1006
IBM-1124 UTF-8 <—> Ukrainian PC data code
IBM-1129 UTF-8 <—> Vietnamese PC data code
TIS-620 UTF-8 <—> Thailand PC data code
IBM-037 UTF-8 <—> USA, Canada EBCDIC
IBM-273 UTF-8 <—> Germany, Austria EBCDIC
IBM-277 UTF-8 <—> Denmark, Norway EBCDIC
IBM-278 UTF-8 <—> Finland, Sweden EBCDIC
IBM-280 UTF-8 <—> Italy EBCDIC
IBM-284 UTF-8 <—> Spain, Latin America EBCDIC
IBM-285 UTF-8 <—> United Kingdom EBCDIC
IBM-297 UTF-8 <—> France EBCDIC
IBM-500 UTF-8 <—> International EBCDIC
IBM-875 UTF-8 <—> Greek EBCDIC
IBM-930 UTF-8 <—> Japanese Katakana-Kanji EBCDIC
IBM-933 UTF-8 <—> Korean EBCDIC
IBM-937 UTF-8 <—> Traditional Chinese EBCDIC
IBM-939 UTF-8 <—> Japanese Latin-Kanji EBCDIC
IBM-1026 UTF-8 <—> Turkish EBCDIC
IBM-1112 UTF-8 <—> Baltic Multilingual EBCDIC
IBM-1122 UTF-8 <—> Estonian EBCDIC
IBM-1124 UTF-8 <—> Ukranian EBCDIC
IBM-1129 UTF-8 <—> Vietnamese EBCDIC
IBM-1381 UTF-8 <—> Simplified Chinese PC data code
GB18030 UTF-8<—> Simplified Chinese
TIS-620 UTF-8 <—> Thailand EBCDIC