TEI Character Encoding Workgroup

The TEI Character Encoding Workgroup, chaired by Christian Wittern, began its work in 2003. The group completed its work in 2005.

Draft Documents for P5

Draft Papers

Meetings and Reports

Some use cases

Typographic Regularization in the WWP Textbase A proposal for ACH/ALLC 2001 by Jacqueline H. Russom and Sydney D. Bauman (Scholarly Technology Group, Brown University)

How to refer to characters/glyphs not in the document character set

The SVG Specification uses an element AltGlyph to refer to variant glyphs
MathML uses an element <mglyph> for "presentation glyphs".
Unicode has specific and generic Variation Selectors (U+FE00~U+FE0F), see (Unicode Consortium) Standardized Variants. The usage of these is also discussed in the document Unicode in XML and other Markup Languages mentioned above.

Character semantics

Unicode defines character semantics in the Unicode Character Database (UCD, available at UnicodeData.txt; here is an explanation of its contents: Unicode Data File Format, see also: (Unicode Consortium, UTR Draft) Unicode Technical Report #23 CHARACTER Properties
(Unicode Consortium, TUS Annex 21) Case Mappings
(Unicode Consortium, UTR Draft) Unicode Technical Report #30 Character Foldings
(Unicode Consortium, TUS Annex 15) Unicode Normalization Forms

Last recorded change to this page: 2007-09-16 • For corrections or updates, contact webmaster AT tei-c DOT org