Multibyte character code data representation
A multibyte character code is an external representation of data, regardless of whether it is character input from a keyboard or a file on a disk. Within the same code set, the number of bytes that represent the multibyte code of a character can vary. You must use multicultural support functions for character processing to ensure code set independence.
For example, a code set may specify the following character encodings:
C = 0x43
* = 0x81 0x43
*C = 0x81 0x43& 0x43A program searching for C, not accounting for
multibyte characters, finds the second byte of the *C string
and assumes it found C when in fact it found the
second byte of the * (asterisk) character.