UTF-8 interchange converters
This section will discuss conversions provided in both directions for each code set and UTF-8.
UTF-8 is a universal, multibyte encoding. Conversions for each code set are provided in both directions, between the code set and UTF-8.
UTF-8 conversions are usually done by using the Universal_UCS_Conv and /usr/lib/nls/loc/uconv/UTF-8 converter.
| Converter | Description |
|---|---|
| ISO8859-1 | UTF-8 <—> ISO Latin-1 |
| ISO8859-2 | UTF-8 <—> ISO Latin-2 |
| ISO8859-3 | UTF-8 <—> ISO Latin-3 |
| ISO8859-4 | UTF-8 <—> ISO Baltic |
| ISO8859-5 | UTF-8 <—> ISO Cyrillic |
| ISO8859-6 | UTF-8 <—> ISO Arabic |
| ISO8859-7 | UTF-8 <—> ISO Greek |
| ISO8859-8 | UTF-8 <—> ISO Hebrew |
| ISO8859-9 | UTF-8 <—> ISO Turkish |
| JISX0201.1976-0 | UTF-8 <—> Japanese JISX0201-0 |
| JISX0208.1983-0 | UTF-8 <—> Japanese JISX0208-0 |
| CNS11643.1986-1 | UTF-8 <—> Chinese CNS11643-1 |
| CNS11643.1986-2 | UTF-8 <—> Chinese CNS11643-2 |
| KSC5601.1987-0 | UTF-8 <—> Korean KSC5601-0 |
| IBM-eucCN | UTF-8 <—> Simplified Chinese EUC |
| IBM-eucJP | UTF-8 <—> Japanese EUC |
| IBM-eucKR | UTF-8 <—> Korean EUC |
| IBM-eucTW | UTF-8 <—> Traditional Chinese EUC |
| IBM-udcJP | UTF-8 <—> Japanese user-defined characters |
| IBM-udcTW | UTF-8 <—> Traditional Chinese user-defined characters |
| IBM-sbdTW | UTF-8 <—> Traditional Chinese IBM-specific characters |
| UCS-2 | UTF-8 <—> UCS-2 |
| IBM-437 | UTF-8 <—> USA PC data code |
| IBM-850 | UTF-8 <—> Latin-1 PC data code |
| IBM-852 | UTF-8 <—> Latin-2 PC data code |
| IBM-857 | UTF-8 <—> Turkish PC data code |
| IBM-860 | UTF-8 <—> Portuguese PC data code |
| IBM-861 | UTF-8 <—> Icelandic PC data code |
| IBM-863 | UTF-8 <—> French Canadian PC data code |
| IBM-865 | UTF-8 <—> Nordic PC data code |
| IBM-868 | UTF-8 <—> Urdu IBM-868 |
| IBM-869 | UTF-8 <—> Greek PC data code |
| IBM-918 | UTF-8 <—> Urdu IBM-918 |
| IBM-921 | UTF-8 <—> Baltic Multilingual data code |
| IBM-922 | UTF-8 <—> Estonian data code |
| IBM-932 | UTF-8 <—> Japanese PC data code |
| IBM-943 | UTF-8 <—> Japanese PC data code |
| IBM-934 | UTF-8 <—> Korea PC data code |
| IBM-935 | UTF-8 <—> Simplified Chinese EBCDIC |
| IBM-936 | UTF-8 <—> People's Republic of China PC data code |
| IBM-938 | UTF-8 <—> Taiwanese PC data code |
| IBM-942 | UTF-8 <—> Extended Japanese PC data code |
| IBM-944 | UTF-8 <—> Korean PC data code |
| IBM-946 | UTF-8 <—> People's Republic of China SAA data code |
| IBM-948 | UTF-8 <—> Traditional Chinese PC data code |
| IBM-1006 | UTF-8 <—> Urdu IBM-1006 |
| IBM-1124 | UTF-8 <—> Ukrainian PC data code |
| IBM-1129 | UTF-8 <—> Vietnamese PC data code |
| TIS-620 | UTF-8 <—> Thailand PC data code |
| IBM-037 | UTF-8 <—> USA, Canada EBCDIC |
| IBM-273 | UTF-8 <—> Germany, Austria EBCDIC |
| IBM-277 | UTF-8 <—> Denmark, Norway EBCDIC |
| IBM-278 | UTF-8 <—> Finland, Sweden EBCDIC |
| IBM-280 | UTF-8 <—> Italy EBCDIC |
| IBM-284 | UTF-8 <—> Spain, Latin America EBCDIC |
| IBM-285 | UTF-8 <—> United Kingdom EBCDIC |
| IBM-297 | UTF-8 <—> France EBCDIC |
| IBM-500 | UTF-8 <—> International EBCDIC |
| IBM-875 | UTF-8 <—> Greek EBCDIC |
| IBM-930 | UTF-8 <—> Japanese Katakana-Kanji EBCDIC |
| IBM-933 | UTF-8 <—> Korean EBCDIC |
| IBM-937 | UTF-8 <—> Traditional Chinese EBCDIC |
| IBM-939 | UTF-8 <—> Japanese Latin-Kanji EBCDIC |
| IBM-1026 | UTF-8 <—> Turkish EBCDIC |
| IBM-1112 | UTF-8 <—> Baltic Multilingual EBCDIC |
| IBM-1122 | UTF-8 <—> Estonian EBCDIC |
| IBM-1124 | UTF-8 <—> Ukranian EBCDIC |
| IBM-1129 | UTF-8 <—> Vietnamese EBCDIC |
| IBM-1381 | UTF-8 <—> Simplified Chinese PC data code |
| GB18030 | UTF-8<—> Simplified Chinese |
| TIS-620 | UTF-8 <—> Thailand EBCDIC |