3 Elements Available in All TEI Documents

This chapter describes elements which may appear in any kind of text and the tags used to mark them in all TEI documents. Most of these elements are freely floating phrases, which can appear at any point within the textual structure, although they should generally be contained by a higher-level element of some kind (such as a paragraph). A few of the elements described in this chapter (for example, bibliographic citations and lists) have a comparatively well-defined internal structure, but most of them have no consistent inner structure of their own. In the general case, they contain only a few words, and are often identifiable in a conventionally printed text by the use of typographic conventions such as shifts of font, use of quotation or other punctuation marks, or other changes in layout.

This chapter begins by describing the p tag used to mark paragraphs, the prototypical formal unit for running text in many TEI modules. This is followed, in section 3.2 Treatment of Punctuation, by a discussion of some specific problems associated with the interpretation of conventional punctuation, and the methods proposed by these Guidelines for resolving ambiguities therein.

The next section (section 3.3 Highlighting and Quotation) describes a number of phrase-level elements commonly marked by typographic features (and thus well-represented in conventional markup languages). These include features commonly marked by font shifts (section 3.3.2 Emphasis, Foreign Words, and Unusual Language) and features commonly marked by quotation marks (section 3.3.3 Quotation) as well as such features as terms, cited words, and glosses (section 3.4 Terms and Glosses, Ruby Annotations, and Equivalents and Descriptions).

Section 3.5 Simple Editorial Changes introduces some phrase-level elements which may be used to record simple editorial interventions, such as emendation or correction of the encoded text. The elements described here constitute a simple subset of the full mechanisms for encoding such information (described in full in chapter 11 Representation of Primary Sources), which should be adequate to most commonly encountered situations.

The next section (section 3.6 Names, Numbers, Dates, Abbreviations, and Addresses) describes several phrase-level and inter-level elements which, although often of interest for analysis or processing, are rarely explicitly identified in conventional printing. These include names (section 3.6.1 Referring Strings), numbers and measures (section 3.6.3 Numbers and Measures), dates and times (section 3.6.4 Dates and Times), abbreviations (section 3.6.5 Abbreviations and Their Expansions), and addresses (section 3.6.2 Addresses).

In the same way, the following section (section 3.7 Simple Links and Cross-References) presents only a subset of the facilities available for the encoding of cross-references or text-linkage. The full story may be found in chapter 16 Linking, Segmentation, and Alignment; the tags presented here are intended to be usable for a wide variety of simple applications.

Sections 3.8 Lists, and 3.9 Notes, Annotation, and Indexing, describe two kinds of quasi-structural elements: lists and notes. These may appear either within chunk-level elements such as paragraphs, or between them. Several kinds of lists are catered for, of an arbitrary complexity. The section on notes discusses both notes found in the source and simple mechanisms for adding annotations of an interpretive nature during the encoding; again, only a subset of the facilities described in full elsewhere (specifically, in chapter 17 Simple Analytic Mechanisms) is discussed.

Section 3.10 Graphics and Other Non-textual Components introduces some simple ways of representing graphic or other non-textual content found in a text. A fuller discussion of the multimedia facilities supported by these Guidelines may be found in chapters 14 Tables, Formulæ, Graphics and Notated Music and 16 Linking, Segmentation, and Alignment.

Next, section 3.11 Reference Systems, describes methods of encoding within a text the conventional system or systems used when making references to the text. Some reference systems have attained canonical authority and should be recorded to make the text useable in normal work; in other cases, a convenient reference system should be created by the creator or analyst of an electronic text.

Like lists and notes, the bibliographic citations discussed in section 3.12 Bibliographic Citations and References, may be regarded as structural elements in their own right. A range of possibilities is presented for the encoding of bibliographic citations or references, which may be treated as simple phrases within a running text, or as highly-structured components suitable for inclusion in a bibliographic database.

Additional elements for the encoding of passages of verse or drama (whether prose or verse) are discussed in section 3.13 Passages of Verse or Drama.

The chapter concludes with a technical overview of the structure and organization of the module described here. This should be read in conjunction with chapter 1 The TEI Infrastructure, describing the structure of the TEI document type definition.

TEI: Paragraphs¶3.1 Paragraphs

The paragraph is the fundamental organizational unit for all prose texts, being the smallest regular unit into which prose can be divided. Prose can appear in all TEI texts, even those that are primarily of another genre (e.g., verse); thus the paragraph is described here, as an element which can appear in any kind of text.

Paragraphs can contain any of the other elements described within this chapter, as well as some other elements which are specific to individual text types. We distinguish phrase-level elements, which must be entirely contained within a paragraph or similar structure and cannot appear except within one, from chunks, which can appear between, but not within, paragraphs, and from inter-level elements, which can appear either within a single paragraph or between paragraphs. The class of phrases includes emphasized or quoted phrases, names, dates, etc. The class of inter-level elements includes bibliographic citations, notes, lists, etc. The class of chunks includes the paragraph itself, and other elements which have similar structural properties, notably the ab (anonymous block) element described in 16.3 Blocks, Segments, and Anchors) which may be used as an alternative to the paragraph in some kinds of texts.

Because paragraphs may appear in different base or additional tag sets, their possible contents may differ in different kinds of documents. In particular, additional elements not listed in this chapter may appear in paragraphs in certain kinds of text. However, the elements described in this chapter are always by default available in all kinds of text.

The paragraph is marked using the p element:

p (paragraph) 散文の段落を示す．

If a consistent internal subdivision of paragraphs is desired, the s or seg (‘segment’) elements may be used, as discussed in chapters 16 Linking, Segmentation, and Alignment and 17 Simple Analytic Mechanisms respectively. More usually, however, paragraphs have no firm internal structure, but contain prose encoded as a mix of characters, entity references, phrases marked as described in the rest of this chapter, and embedded elements like lists, figures, or tables.

Since paragraphs are usually explicitly marked in Western texts, typically by indentation, the application of the p tag usually presents few problems.

In some cases, the body of a text may comprise but a single paragraph:

<body>
I fully appreciate Gen. Pope's splendid achievements with their
invaluable results; but you must know that Major Generalships in the
Regular Army, are not as plenty as blackberries.
</body>

direct	引用内容が，直接または間接的な話(法)かどうかを示す．
aloud	引用内容が言語または記号化されているかどうかを示す．

uri	(uniform resource identifier) 外部識別子によって親要素の意義を表す．
filter	当該要素を標準的XMLデータに変形する外部スクリプトへの参照を示す．
name	親要素の意義を表す．
predicate [att.predicate]	the condition under which the element bearing this attribute applies, given as an XPath predicate expression.

cert	(確実性) 当該解釈や調整の確信度を示す．介入や解釈に関する確信度を示す。
resp	(責任者) 介入や解釈の責任者を示す．例えば，編集者，翻刻者など。

unit	当該大きさの単位を示す．提案する値は以下の通り: 1] cm (centimetres); 2] mm (millimetres); 3] in (inches); 4] line; 5] char (characters)
quantity	当該単位の大きさを示す．
extent	単一の文字列の数量と単位を組み合わせたプロジェクト固有の語彙を使用して，関連するオブジェクトのサイズを示す。
precision	他の属性によって指定された値の精度を特徴付ける。
scope	対象物が複数あった場合に，当該数値の適応範囲を示す。例としての値は以下の通り: 1] all; 2] most; 3] range

type	当該要素の分類を示す．
subtype	(subtype) 必要であれば，当該要素の下位分類を示す．

key	何らかのコード化された値を用いて，名付けられたエンティティを識別する外部的に定義された手段を提供する．
ref	(参照) 一つ以上のURIを用いて，名付けられたエンティティの完全な定義かIDを参照するための明確な手段を提供する．

model.nameLike.agent	個人や団体の名前を含む要素をまとめる．
model.offsetLike	場所名の部分として出現可能な要素をまとめる．
model.persNamePart	個人名の部分を構成する要素をまとめる．
model.placeStateLike	場所の変容を示す要素をまとめる．

idno	(identifier) 書誌項目、人物、タイトル、組織など、何らかのオブジェクトを標準化された方法で識別するために使用される任意の形式の識別子を提供する。
lang	(language name) 語源学または他の言語学上の論議で現れる言語名を示す．
objectName	(name of an object) contains a proper noun or noun phrase used to refer to an object.
rs	(参照文字列) 一般的な意味での名前や参照文字列．

addName	(追加的名称) 付加的な名前要素を示す．例えば，愛称，渾名，別名などの個人名．
forename	(forename) 人物に与えられた名前のうち，名の部分，すなわち個人を表す部分を示す．
genName	(generational name component) 似た名前を区別する為に，相対的な年齢関係，世代関係などの情報を示す．
nameLink	(name link) 名前の中で使われているが，その一部としては見られない，関連する句やリンクを示す．例えば，van derや ofなど．
persPronouns	(personal pronouns) indicates the personal pronouns used, or assumed to be used, by the individual being described.
roleName	(role name) 参照されるものの，社会的な役割や地位，例えば，公式な役職名や地位などを示す，名前要素を示す
surname	(surname) (継承される)苗字を示す．姓名中の名，洗礼名，愛称，別称とは異なる．

bloc	(bloc) 複数の国や地域を跨ぐ地政学的な名前を示す．
country	(country) 1つの国家に相当する地政学的な単位名を示す．国家，植民地，共同体・連邦を含む．これは，行政単位上の地域よりも大きい単位で，連合より小さな単位である．
district	(district) 場所を示す要素として，集落より小さい名前を示す．例えば，小教区や区など，行政上・地勢上の単位．
geogName	(geographical name) ウィンドラッシュ峡谷，シナイ山などの地理的特性に関する名前．
placeName	(place name) 絶対的，相対的場所名を示す．
region	(region) 行政上の単位の名前を示す．例えば，地方，郡，居住地など．居住地よりも広く，国家より狭い地域．
settlement	(settlement) 地政学上または行政上の単位としてある市，街，村などの居住地の名前を示す．

type	数値の種類を示す．提案する値は以下の通り: 1] cardinal; 2] ordinal; 3] fraction; 4] percentage
value	標準的な形式で数値を示す．

atLeast	gives a minimum estimated value for the approximate measurement.
atMost	gives a maximum estimated value for the approximate measurement.

quantity	(quantity) 計測単位の数を示す．
unit	(unit) 一般には標準記号により，計測単位を示す．提案する値は以下の通り: 1] m (metre); 2] kg (kilogram); 3] s (second); 4] Hz (hertz); 5] Pa (pascal); 6] Ω (ohm); 7] L (litre); 8] t (tonne); 9] ha (hectare); 10] Å (ångström); 11] mL (millilitre); 12] cm (centimetre); 13] dB (decibel); 14] kbit (kilobit); 15] Kibit (kibibit); 16] kB (kilobyte); 17] KiB (kibibyte); 18] MB (megabyte); 19] MiB (mebibyte)
commodity	(commodity) 計測される対象を示す．

target	ひとつ以上のURIで，参照先を特定する．
evaluate	(evaluate) 当該ポインタの参照先がポインタである場合，その意図を示す．

width	Where the media are displayed, indicates the display width
height	Where the media are displayed, indicates the display height
scale	Where the media are displayed, indicates a scale factor to be applied when generating the desired display size

bibl	(bibliographic citation) 厳密でない構造を持つ書誌情報の引用を含む．下位要素で明示されていたり，いなかったりする．
biblFull	(fully-structured bibliographic citation) 厳密な構造を持つ書誌情報を示す．．TEIのファイル記述の全要素は，ここに記述される．
biblStruct	(structured bibliographic citation) 構造を持った書誌情報を示す．下位要素として，書誌情報を示す要素が，決められた順番で出現する．
listBibl	(citation list) 書誌項目引用のリストを示す．
msDesc	(manuscript description) 単一の識別可能な手書き資料の解説を示す．

biblScope	(scope of bibliographic reference) 書誌情報の参照範囲を示す．例えば，ページ番号，下部単位の名前など．
distributor	(distributor) テキストの頒布に責任を持つ人物または団体の名前を示す．
publisher	(publisher) 書誌項目の出版や頒布に責任のある団体の名前を示す．
pubPlace	(publication place) 書誌項目が出版された場所の名前を示す．

date	(date) 日付を示す．
time	(time) 時間を表す語句を示す．

mainLang	(main language) 当該手書き資料中で主に使用される言語を特定するコードを示す．
otherLangs	(other languages) 当該手書き資料中で使用されている他の言語を特定する，ひとつ以上のコード．

P5: TEIガイドライン

3 Elements Available in All TEI Documents

TEI: Paragraphs¶3.1 Paragraphs

TEI: Treatment of Punctuation¶3.2 Treatment of Punctuation

TEI: Functions of Punctuation¶3.2.1 Functions of Punctuation

TEI: Hyphenation¶3.2.2 Hyphenation

TEI: Highlighting and Quotation¶3.3 Highlighting and Quotation

TEI: What Is Highlighting?¶3.3.1 What Is Highlighting?

TEI: Emphasis, Foreign Words, and Unusual Language¶3.3.2 Emphasis, Foreign Words, and Unusual Language

TEI: Foreign Words or Expressions¶3.3.2.1 Foreign Words or Expressions

TEI: Emphatic Words and Phrases¶3.3.2.2 Emphatic Words and Phrases

TEI: Other Linguistically Distinct Material¶3.3.2.3 Other Linguistically Distinct Material

TEI: Quotation¶3.3.3 Quotation

TEI: Terms and Glosses, Ruby Annotations, and Equivalents and Descriptions¶3.4 Terms and Glosses, Ruby Annotations, and Equivalents and Descriptions

TEI: Terms and Glosses¶3.4.1 Terms and Glosses

TEI: Some Further Examples¶3.4.1.1 Some Further Examples

TEI: Ruby Annotations¶3.4.2 Ruby Annotations

TEI: Equivalents and Descriptions¶3.4.3 Equivalents and Descriptions

TEI: Simple Editorial Changes¶3.5 Simple Editorial Changes

TEI: Apparent Errors¶3.5.1 Apparent Errors

TEI: Regularization and Normalization¶3.5.2 Regularization and Normalization

TEI: Additions, Deletions, and Omissions¶3.5.3 Additions, Deletions, and Omissions

TEI: Names, Numbers, Dates, Abbreviations, and Addresses¶3.6 Names, Numbers, Dates, Abbreviations, and Addresses

TEI: Referring Strings¶3.6.1 Referring Strings

TEI: Addresses¶3.6.2 Addresses

TEI: Numbers and Measures¶3.6.3 Numbers and Measures

TEI: Dates and Times¶3.6.4 Dates and Times

TEI: Abbreviations and Their Expansions¶3.6.5 Abbreviations and Their Expansions

TEI: Simple Links and Cross-References¶3.7 Simple Links and Cross-References

TEI: Lists¶3.8 Lists

TEI: Notes, Annotation, and Indexing¶3.9 Notes, Annotation, and Indexing

TEI: Notes and Simple Annotation¶3.9.1 Notes and Simple Annotation

TEI: Encoding Grouped Notes¶3.9.1.1 Encoding Grouped Notes

TEI: Index Entries¶3.9.2 Index Entries

TEI: Pre-existing Indexes¶3.9.2.1 Pre-existing Indexes

TEI: Auto-generated Indexes¶3.9.2.2 Auto-generated Indexes

TEI: Graphics and Other Non-textual Components¶3.10 Graphics and Other Non-textual Components

TEI: Reference Systems¶3.11 Reference Systems

TEI: Using the xml:id and n Attributes¶3.11.1 Using the xml:id and n Attributes

TEI: Creating New Reference Systems¶3.11.2 Creating New Reference Systems

TEI: Referencing system derived from markup¶3.11.2.1 Referencing system derived from markup

TEI: Referencing systems based on project conventions¶3.11.2.2 Referencing systems based on project conventions

TEI: Milestone Elements¶3.11.3 Milestone Elements

TEI: Declaring Reference Systems¶3.11.4 Declaring Reference Systems

TEI: Bibliographic Citations and References¶3.12 Bibliographic Citations and References

TEI: Methods of Encoding Bibliographic References and Lists of References¶3.12.1 Methods of Encoding Bibliographic References and Lists of References

TEI: Components of Bibliographic References¶3.12.2 Components of Bibliographic References

TEI: Analytic, Monographic, and Series Levels¶3.12.2.1 Analytic, Monographic, and Series Levels

TEI: Titles, Authors, and Editors¶3.12.2.2 Titles, Authors, and Editors

TEI: Document Identifiers¶3.12.2.3 Document Identifiers

TEI: Imprint, Size of a Document, and Reprint Information¶3.12.2.4 Imprint, Size of a Document, and Reprint Information

TEI: Scopes and Ranges in Bibliographic Citations¶3.12.2.5 Scopes and Ranges in Bibliographic Citations

TEI: Series Information¶3.12.2.6 Series Information

TEI: Related Items¶3.12.2.7 Related Items

TEI: Notes and Statement of Language¶3.12.2.8 Notes and Statement of Language

TEI: Order of Components within References¶3.12.2.9 Order of Components within References

TEI: Bibliographic Pointers ¶3.12.3 Bibliographic Pointers

TEI: Relationship to Other Bibliographic Schemes¶3.12.4 Relationship to Other Bibliographic Schemes

TEI: Passages of Verse or Drama¶3.13 Passages of Verse or Drama

TEI: Core Tags for Verse¶3.13.1 Core Tags for Verse

TEI: Core Tags for Drama¶3.13.2 Core Tags for Drama

TEI: Overview of the Core Module ¶3.14 Overview of the Core Module