data.language
data.language defines the range of attribute values used to identify a particular combination of human language and writing system. vi.1. Language identification | |
Module | tei — 1 The TEI Infrastructure |
Used by |
Class:
Element:
|
Declaration |
data.language = xsd:language |
Note |
The values for this attribute are language
‘tags’ as defined in BCP 47. Currently
BCP 47 comprises RFC 4646 and RFC 4647; over time, other IETF
documents may succeed these as the best current practice.
A ‘language tag’, per BCP 47, is assembled
from a sequence of components or subtags separated by
the hyphen character (-, U+002D). The tag
is made of the following subtags, in the following order. Every
subtag except the first is optional. If present, each occurs only
once, except the fourth and fifth components (variant and
extension), which are repeatable.
There are two exceptions to the above format. First, there are
language tags in the IANA
registry that do not match the above syntax, but are present
because they have been ‘grandfathered’ from
previous specifications.
Second, an entire language tag can consist of only a private use
subtag. These tags start with x-, and do not need to
follow any further rules established by the IETF and endorsed by
these Guidelines. Like all language tags that make use of private use
subtags, the language in question must be documented in a
corresponding language element in the TEI header.
Examples include
The W3C Internationalization Activity has published a useful
introduction to BCP 47, Language
tags in HTML and XML.
|