data.language
data.language defines the range of attribute values used to identify a particular combination of human language and writing system. vi.1. Language identification | |
Module | tei — 1 The TEI Infrastructure |
Used by |
Class:
Element:
|
Declaration |
data.language = xsd:language |
Note |
The values for this attribute are language ‘tags’ as defined in BCP 47. Currently BCP 47 comprises RFC 4646
and RFC 4647; over time, other IETF documents may succeed these as the best current practice.
A ‘language tag’, per BCP 47, is assembled from a sequence of components
or subtags separated by the hyphen character (-, U+002D).
The tag is made of the following subtags, in the following order. Every subtag except the
first is optional. If present, each occurs only once, except the fourth and fifth components
(variant and extension), which are repeatable.
There are two exceptions to the above format. First, there are language tags in the IANA registry that
do not match the above syntax, but are present because they have been
‘grandfathered’ from previous specifications.
Second, an entire language tag can consist of only a private use subtag. These tags start
with x-, and do not need to follow any further rules established by the IETF and
endorsed by these Guidelines. Like all language tags that make use of private use subtags, the
language in question must be documented in a corresponding language element in the
TEI header.
Examples include
The W3C Internationalization Activity has published a useful introduction to BCP 47, Language
tags in HTML and XML.
|