Code page and Coded Character Set Identifier (CCSID) numbers for Unicode graphic data

Within IBM®, the UTF-16 code page has been registered as code page 1200, with a growing character set. When new characters are added to a code page, the code page number does not change. Code page 1200 always refers to the current version of Unicode.

A specific version of the Unicode standard, as defined by Unicode 2.0 and ISO/IEC 10646-1, has also been registered within IBM as CCSID 13488. This CCSID has been used internally by Db2® for storing graphic string data in IBM eucJP (Japan) and IBM eucTW (Taiwan) databases. CCSID 13488 and code page 1200 both refer to UTF-16, and are handled the same way, except for the value of their "double-byte" (DBCS) space:
CP/CCSID Single-byte (SBCS) space Double-byte (DBCS) space
1200 N/A U+0020
13488 N/A U+3000
Note: In a Unicode database, U+3000 has no special meaning.

The same conversion tables are used for both code page 1200 and CCSID 13488.

Within IBM, UTF-8 has been registered as CCSID 1208 with growing character set (sometimes also referred to as code page 1208). As new characters are added to the standard, this number (1208) will not change.

The MBCS code page number is 1208, which is the database code page number, and the code page of character string data within the database. The double-byte code page number for UTF-16 is 1200, which is the code page of graphic string data within the database.