Multibyte code set converters

This section describes the code sets that multibyte code set converters use to convert.

Multibyte code set converters convert characters among the following code sets:

  • PC multibyte code sets
  • EUC multibyte code sets (ISO-based)
  • EBCDIC multibyte code sets

The following table lists code set names that are compatible. Each line defines to/from strings that may be used when requesting a converter.

Table 1. Code Set Compatibility
Language PC ISO EBCDIC
Japanese IBM-932 IBM-eucJP IBM-930, IBM-939
Japanese (MS compatible) IBM-943 IBM-eucJP IBM-930, IBM-939
Korean IBM-934 IBM-eucKR IBM-933
Traditional Chinese IBM-938, big-5 IBM-eucTW IBM-937
Simplified Chinese IBM-1381 IBM-eucCN IBM-935
  1. Conversions between Simplified and Traditional Chinese are provided (IBM-eucTW <—> IBM-eucCN and big5 <—> IBM-eucCN).
  2. UTF-8 is an additional code set.