IBM-eucCN
The EUC for the Simplified Chinese language is an encoding consisting of characters that contain 1 or 2 bytes. The EUC encoding is based on ISO2022, GB2312 as defined by the People's Republic of China, and multibyte character definitions unique to the manufacturer.
The current GB2312 defines 6,763 Simplified Chinese characters and 682 symbols. The IBM-eucCN is based upon a concept of one plane containing up to 94x94 characters. The encoding values of these characters range from 0xa1a1 to 0xfefe.
The GB2312 is mapped into the CS1 of EUC. Specifically, the IBM-eucCN consists of the following character sets:
Character set | Description |
---|---|
ISO0646-IRV | 7-bit ASCII character set, Graphic Left. |
GB2312.1980 | Contains 7445 characters. It occupies positions 0xa1a1 to 0xfedf (some user-defined characters scattered in 0xa1a1 to 0xfedf). |
IBM-udcCN | Scattered in GB. It occupies positions Oxa1a1 to Oxfedf. The
actual values are:
|
IBM-sbdCN | Scattered in GB. It occupies positions 0xfee0 to 0xfefe. |