IBM-eucJP

The EUC for Japanese is an encoding consisting of single-byte and multibyte characters. The encoding is based on ISO2022, Japanese Industrial Standard (JIS), and EUC definitions.

The IBM-eucJP code set consists of the following character sets:

Character set Description
JISCII JISX0201 Graphic Left character set
JISX0201.1976 Katakana/Hiragana Graphic Right character set
JISX0208.1983 Kanji level 1 and 2 character sets
IBM-udcJP IBM-user definable characters

The IBM-eucJP code set is also capable of supporting the following:

Character set Description
JISX0212.1990 Supplemental Kanji

The IBM-eucJP code set is encoded as follows:

  • CS0 maps JISX0201 Graphic Left characters starting at the 0x00 position.
  • CS1 maps the JISX0208 character set starting at the 0xa1xa1 position. The positions 0xf5a1 through 0xfefe (940 characters) in CS1 are reserved as primary user-definable character areas.
  • CS2 maps the JISX0201 Graphic Right starting at the 0x8ea1 position.
  • CS3 is capable of mapping JISX0212 starting at the 0x8fa1a1 position. The positions 0x8ff5a1 through 0x8ffefe in CS3 (940 characters) are reserved as secondary user-definable character areas. The positions 0x8feea1 through 0x8ff4fe in CS3 (658 characters) are reserved for future system use. Therefore, users should not use this area.