UTF-32

UTF-32 is an encoding of Unicode in which each character is composed of 4 bytes.

The IBM® i operating system does not support UTF-32 encoding with a CCSID value.

Unicode was originally designed as a pure 16-bit encoding, aimed at representing all modern scripts. Over time, and especially after the addition of over 14 500 composite characters for compatibility with established sets, it became clear that 16 bits were not sufficient for many users. Out of this arose UTF-32.

UTF-32 allows characters to be encoded as 4 bytes at any code point from 00000000 to 0010FFFF. For example, the string ABC in UTF-32 is encoded as x"000000410000004200000043".