UTF-8 interchange converters
This section will discuss conversions provided in both directions for each code set and UTF-8.
UTF-8 is a universal, multibyte encoding. Conversions for each code set are provided in both directions, between the code set and UTF-8.
UTF-8 conversions are usually done by using the Universal_UCS_Conv and /usr/lib/nls/loc/uconv/UTF-8 converter.
Converter | Description |
---|---|
ISO8859-1 | UTF-8 <—> ISO Latin-1 |
ISO8859-2 | UTF-8 <—> ISO Latin-2 |
ISO8859-3 | UTF-8 <—> ISO Latin-3 |
ISO8859-4 | UTF-8 <—> ISO Baltic |
ISO8859-5 | UTF-8 <—> ISO Cyrillic |
ISO8859-6 | UTF-8 <—> ISO Arabic |
ISO8859-7 | UTF-8 <—> ISO Greek |
ISO8859-8 | UTF-8 <—> ISO Hebrew |
ISO8859-9 | UTF-8 <—> ISO Turkish |
JISX0201.1976-0 | UTF-8 <—> Japanese JISX0201-0 |
JISX0208.1983-0 | UTF-8 <—> Japanese JISX0208-0 |
CNS11643.1986-1 | UTF-8 <—> Chinese CNS11643-1 |
CNS11643.1986-2 | UTF-8 <—> Chinese CNS11643-2 |
KSC5601.1987-0 | UTF-8 <—> Korean KSC5601-0 |
IBM-eucCN | UTF-8 <—> Simplified Chinese EUC |
IBM-eucJP | UTF-8 <—> Japanese EUC |
IBM-eucKR | UTF-8 <—> Korean EUC |
IBM-eucTW | UTF-8 <—> Traditional Chinese EUC |
IBM-udcJP | UTF-8 <—> Japanese user-defined characters |
IBM-udcTW | UTF-8 <—> Traditional Chinese user-defined characters |
IBM-sbdTW | UTF-8 <—> Traditional Chinese IBM-specific characters |
UCS-2 | UTF-8 <—> UCS-2 |
IBM-437 | UTF-8 <—> USA PC data code |
IBM-850 | UTF-8 <—> Latin-1 PC data code |
IBM-852 | UTF-8 <—> Latin-2 PC data code |
IBM-857 | UTF-8 <—> Turkish PC data code |
IBM-860 | UTF-8 <—> Portuguese PC data code |
IBM-861 | UTF-8 <—> Icelandic PC data code |
IBM-863 | UTF-8 <—> French Canadian PC data code |
IBM-865 | UTF-8 <—> Nordic PC data code |
IBM-868 | UTF-8 <—> Urdu IBM-868 |
IBM-869 | UTF-8 <—> Greek PC data code |
IBM-918 | UTF-8 <—> Urdu IBM-918 |
IBM-921 | UTF-8 <—> Baltic Multilingual data code |
IBM-922 | UTF-8 <—> Estonian data code |
IBM-932 | UTF-8 <—> Japanese PC data code |
IBM-943 | UTF-8 <—> Japanese PC data code |
IBM-934 | UTF-8 <—> Korea PC data code |
IBM-935 | UTF-8 <—> Simplified Chinese EBCDIC |
IBM-936 | UTF-8 <—> People's Republic of China PC data code |
IBM-938 | UTF-8 <—> Taiwanese PC data code |
IBM-942 | UTF-8 <—> Extended Japanese PC data code |
IBM-944 | UTF-8 <—> Korean PC data code |
IBM-946 | UTF-8 <—> People's Republic of China SAA data code |
IBM-948 | UTF-8 <—> Traditional Chinese PC data code |
IBM-1006 | UTF-8 <—> Urdu IBM-1006 |
IBM-1124 | UTF-8 <—> Ukrainian PC data code |
IBM-1129 | UTF-8 <—> Vietnamese PC data code |
TIS-620 | UTF-8 <—> Thailand PC data code |
IBM-037 | UTF-8 <—> USA, Canada EBCDIC |
IBM-273 | UTF-8 <—> Germany, Austria EBCDIC |
IBM-277 | UTF-8 <—> Denmark, Norway EBCDIC |
IBM-278 | UTF-8 <—> Finland, Sweden EBCDIC |
IBM-280 | UTF-8 <—> Italy EBCDIC |
IBM-284 | UTF-8 <—> Spain, Latin America EBCDIC |
IBM-285 | UTF-8 <—> United Kingdom EBCDIC |
IBM-297 | UTF-8 <—> France EBCDIC |
IBM-500 | UTF-8 <—> International EBCDIC |
IBM-875 | UTF-8 <—> Greek EBCDIC |
IBM-930 | UTF-8 <—> Japanese Katakana-Kanji EBCDIC |
IBM-933 | UTF-8 <—> Korean EBCDIC |
IBM-937 | UTF-8 <—> Traditional Chinese EBCDIC |
IBM-939 | UTF-8 <—> Japanese Latin-Kanji EBCDIC |
IBM-1026 | UTF-8 <—> Turkish EBCDIC |
IBM-1112 | UTF-8 <—> Baltic Multilingual EBCDIC |
IBM-1122 | UTF-8 <—> Estonian EBCDIC |
IBM-1124 | UTF-8 <—> Ukranian EBCDIC |
IBM-1129 | UTF-8 <—> Vietnamese EBCDIC |
IBM-1381 | UTF-8 <—> Simplified Chinese PC data code |
GB18030 | UTF-8<—> Simplified Chinese |
TIS-620 | UTF-8 <—> Thailand EBCDIC |