language data type

The xs:language data type represents a natural language identifier as defined by RFC 3066. Derived from data type xs:token.

The lexical form of xs:language consists of strings of tags connected by hyphens. Each tag contains no more than eight characters. The first tag can contain only alphabetic characters, and subsequent tags can contain alphabetic and numeric characters. For example, the value en-US represents the English language as used in the United States. The string conforms to the pattern [a-zA-Z]{1,8}(-[a-zA-Z0-9]{1,8})*.