Conventions describing Unicode data

When a specific Unicode code point is referenced, it is expressed as U+n where n is four to six hexadecimal digits, using the digits 0-9 and uppercase letters A-F.

Leading zeros are omitted unless the code point would have fewer than four hexadecimal digits. The space character, for example, is expressed as U+0020. In most cases, the n value is the same as the UTF-16BE encoding.