IBM-eucJP
The EUC for Japanese is an encoding consisting of single-byte and multibyte characters. The encoding is based on ISO2022, Japanese Industrial Standard (JIS), and EUC definitions.
The IBM-eucJP code set consists of the following character sets:
Character set | Description |
---|---|
JISCII | JISX0201 Graphic Left character set |
JISX0201.1976 | Katakana/Hiragana Graphic Right character set |
JISX0208.1983 | Kanji level 1 and 2 character sets |
IBM-udcJP | IBM-user definable characters |
The IBM-eucJP code set is also capable of supporting the following:
Character set | Description |
---|---|
JISX0212.1990 | Supplemental Kanji |
The IBM-eucJP code set is encoded as follows:
- CS0 maps JISX0201 Graphic Left characters starting at the 0x00 position.
- CS1 maps the JISX0208 character set starting at the 0xa1xa1 position. The positions 0xf5a1 through 0xfefe (940 characters) in CS1 are reserved as primary user-definable character areas.
- CS2 maps the JISX0201 Graphic Right starting at the 0x8ea1 position.
- CS3 is capable of mapping JISX0212 starting at the 0x8fa1a1 position. The positions 0x8ff5a1 through 0x8ffefe in CS3 (940 characters) are reserved as secondary user-definable character areas. The positions 0x8feea1 through 0x8ff4fe in CS3 (658 characters) are reserved for future system use. Therefore, users should not use this area.