3 Elements Available in All TEI Documents

This chapter describes elements which may appear in any kind of text and the tags used to mark them in all TEI documents. Most of these elements are freely floating phrases, which can appear at any point within the textual structure, although they must generally be contained by a higher-level element of some kind (such as a paragraph). A few of the elements described in this chapter (for example, bibliographic citations and lists) have a comparatively well-defined internal structure, but most of them have no consistent inner structure of their own. In the general case, they contain only a few words, and are often identifiable in a conventionally printed text by the use of typographic conventions such as shifts of font, use of quotation or other punctuation marks, or other changes in layout.

This chapter begins by describing the p tag used to mark paragraphs, the prototypical formal unit for running text in many TEI modules. This is followed, in section 3.2 Treatment of Punctuation, by a discussion of some specific problems associated with the interpretation of conventional punctuation, and the methods proposed by the Guidelines for resolving ambiguities therein.

The next section (section 3.3 Highlighting and Quotation) describes a number of phrase-level elements commonly marked by typographic features (and thus well-represented in conventional markup languages). These include features commonly marked by font shifts (section 3.3.2 Emphasis, Foreign Words, and Unusual Language) and features commonly marked by quotation marks (section 3.3.3 Quotation) as well as such features as terms, cited words, and glosses (section 3.3.4 Terms, Glosses, Equivalents, and Descriptions).

Section 3.4 Simple Editorial Changes introduces some phrase-level elements which may be used to record simple editorial interventions, such as emendation or correction of the encoded text. The elements described here constitute a simple subset of the full mechanisms for encoding such information (described in full in chapter 11 Representation of Primary Sources), which should be adequate to most commonly encountered situations.

The next section (section 3.5 Names, Numbers, Dates, Abbreviations, and Addresses) describes several phrase-level and inter-level elements which, although often of interest for analysis or processing, are rarely explicitly identified in conventional printing. These include names (section 3.5.1 Referring Strings), numbers and measures (section 3.5.3 Numbers and Measures), dates and times (section 3.5.4 Dates and Times), abbreviations (section 3.5.5 Abbreviations and Their Expansions), and addresses (section 3.5.2 Addresses).

In the same way, the following section (section 3.6 Simple Links and Cross-References) presents only a subset of the facilities available for the encoding of cross-references or text-linkage. The full story may be found in chapter 16 Linking, Segmentation, and Alignment; the tags presented here are intended to be usable for a wide variety of simple applications.

Sections 3.7 Lists, and 3.8 Notes, Annotation, and Indexing, describe two kinds of quasi-structural elements: lists and notes. These may appear either within chunk-level elements such as paragraphs, or between them. Several kinds of lists are catered for, of an arbitrary complexity. The section on notes discusses both notes found in the source and simple mechanisms for adding annotations of an interpretive nature during the encoding; again, only a subset of the facilities described in full elsewhere (specifically, in chapter 17 Simple Analytic Mechanisms) is discussed.

Section 3.9 Graphics and Other Non-textual Components introduces some simple ways of representing graphic or other non-textual content found in a text. A fuller discussion of the multimedia facilities supported by these Guidelines may be found in chapters 14 Tables, Formulæ, Graphics and Notated Music and 16 Linking, Segmentation, and Alignment.

Next, section 3.10 Reference Systems, describes methods of encoding within a text the conventional system or systems used when making references to the text. Some reference systems have attained canonical authority and must be recorded to make the text useable in normal work; in other cases, a convenient reference system must be created by the creator or analyst of an electronic text.

Like lists and notes, the bibliographic citations discussed in section 3.11 Bibliographic Citations and References, may be regarded as structural elements in their own right. A range of possibilities is presented for the encoding of bibliographic citations or references, which may be treated as simple phrases within a running text, or as highly-structured components suitable for inclusion in a bibliographic database.

Additional elements for the encoding of passages of verse or drama (whether prose or verse) are discussed in section 3.12 Passages of Verse or Drama.

The chapter concludes with a technical overview of the structure and organization of the module described here. This should be read in conjunction with chapter 1 The TEI Infrastructure, describing the structure of the TEI document type definition.

TEI: Paragraphs¶3.1 Paragraphs

The paragraph is the fundamental organizational unit for all prose texts, being the smallest regular unit into which prose can be divided. Prose can appear in all TEI texts, even those that are primarily of another genre (e.g., verse); thus the paragraph is described here, as an element which can appear in any kind of text.

Paragraphs can contain any of the other elements described within this chapter, as well as some other elements which are specific to individual text types. We distinguish phrase-level elements, which must be entirely contained within a paragraph and cannot appear except within one, from chunks, which can appear between, but not within, paragraphs, and from inter-level elements, which can appear either within a single paragraph or between paragraphs. The class of phrases includes emphasized or quoted phrases, names, dates, etc. The class of inter-level elements includes bibliographic citations, notes, lists, etc. The class of chunks includes the paragraph itself, and other elements which have similar structural properties, notably the ab (anonymous block) element described in 16.3 Blocks, Segments, and Anchors) which may be used as an alternative to the paragraph in some kinds of texts.

Because paragraphs may appear in different base or additional tag sets, their possible contents may differ in different kinds of documents. In particular, additional elements not listed in this chapter may appear in paragraphs in certain kinds of text. However, the elements described in this chapter are always by default available in all kinds of text.

The paragraph is marked using the p element:

p (문단) 산문에서 문단을 표시한다.

If a consistent internal subdivision of paragraphs is desired, the s or seg (‘segment’) elements may be used, as discussed in chapters 16 Linking, Segmentation, and Alignment and 17 Simple Analytic Mechanisms respectively. More usually, however, paragraphs have no firm internal structure, but contain prose encoded as a mix of characters, entity references, phrases marked as described in the rest of this chapter, and embedded elements like lists, figures, or tables.

Since paragraphs are usually explicitly marked in Western texts, typically by indentation, the application of the p tag usually presents few problems.

In some cases, the body of a text may comprise but a single paragraph:

<body>
I fully appreciate Gen. Pope's splendid achievements with their
invaluable results; but you must know that Major Generalships in the
Regular Army, are not as plenty as blackberries.
</body>

direct	인용부가 직접 대화로 또는 간접 대화로 간주할 수 있는지를 표시할 때 사용할 수 있다.
aloud	인용부가 발화된 것으로 또는 신호된 것으로 볼 수 있는지를 표시할 때 사용할 수 있다.

uri	(표준 자원 확인소(URL)) 부모가 외부 확인소를 통해서 표상하는 기저 개념을 지시한다.
filter	이 요소의 실례를 표준 TEI로 변환하는 방법을 포함하는 외부 스크립트를 참조한다.
name	부모가 표상하는 기저 개념에 대한 이름을 부여한다.

cert	(확실성) 간섭 또는 해석과 연관된 확실성의 정도를 나타낸다.
resp	(책임 당사자) 편집자 또는 전사자와 같이 또는 해석에 대한 책임이 있는 대리인을 나타낸다.

unit	측정 단위의 이름을 기술한다. 제안값은 다음을 포함한다: 1] cm(centimetres) ; 2] mm(millimetres) ; 3] in(inches) ; 4] lines; 5] chars(characters)
quantity	명시된 단위의 길이를 명시한다.
extent	indicates the size of the object concerned using a project-specific vocabulary combining quantity and units in a single string of words.
precision	characterizes the precision of the values specified by the other attributes.
scope	측정의 적용가능성을 명시하며, 하나 이상의 대상이 측정된다. 샘플 값은 다음을 포함한다 Sample values include: 1] all; 2] most; 3] range

type	다양한 분류 스키마 또는 유형을 사용해서 요소의 특성을 기술한다.
subtype	필요하다면 요소의 하위범주를 제시한다.

key	provides an externally-defined means of identifying the entity (or entities) being named, using a coded value of some kind.
ref	(reference) provides an explicit means of locating a full definition or identity for the entity being named by means of one or more URIs.

model.nameLike.agent	개인 또는 기업체의 이름을 포함하는 요소를 모아 놓는다.
model.offsetLike	장소명의 부분으로서만 나타날 수 있는 요소를 모아 놓는다.
model.persNamePart	사람 이름의 부분을 형성하는 요소를 모아놓는다.
model.placeStateLike	장소의 변화하는 상태를 기술하는 요소를 모아 놓는다.

idno	(식별 숫자) 서지 정보 항목을 식별하기 위해 사용되는 표준 또는 비표준 숫자를 제시한다.
lang	(언어명) 어원적 또는 기타 언어적 논의에서 언급된 언어의 이름
rs	(referencing string) contains a general purpose name or referring string.

addName	(부가명) 별명, 통명, 가명, 또는 개인 이름 내에서 사용되는 다른 기술적 구와 같이 부가적 이름 성분을 포함한다.
forename	이름 또는 세례명을 포함한다.
genName	(세대명 성분) 개인의 상대적 나이 또는 세대에 기반하여 유사 이름을 다른 방식으로 구분하는 이름 성분을 포함한다.
nameLink	(name link) van der 또는 of와 같이 이름의 부분으로 간주되지 않는 이름 내의 연결 구 또는 연결을 포함한다.
roleName	공식적 직함 또는 서열과 같이 사회에서 특별한 역할 또는 지위를 나타내는 이름 성분을 포함한다.
surname	이름, 세례명, 또는 별명에 반대되는 것으로 (물려받은) 성을 포함한다.

bloc	둘 이상의 민족국가 또는 국가로 구성된 지리-정치적 단위의 이름을 포함한다.
country	하나의 블록보다 큰 국가, 지역, 식민지, 또는 공화국, 또는 하나의 블록보다 작은 지역의 상급 행정기관과 같은, 지리-정치 단위명을 포함한다.
district	교구, 구 또는 다른 행정 지리적 단위와 같이 거주지의 하위 구분명을 포함한다.
geogName	(지리명) 윈드러시 계곡 또는 시나이 산과 같이 지리적 특성과 관련된 이름
placeName	절대적 또는 상대적 위치명을 포함한다.
region	도보다는 작고 정착지보다는 큰 주, 성, 도와 같은 행정단위명을 포함한다.
settlement	하나의 지리-정치 또는 행정 단위로 식별되는 시, 읍, 마을과 같이 거주지명을 포함한다.

type	수치의 유형을 나타낸다. 제안값은 다음을 포함한다: 1] cardinal; 2] ordinal; 3] fraction; 4] percentage
value	표준형의 숫자 값을 제시한다.

atLeast	gives a minimum estimated value for the approximate measurement.
atMost	gives a maximum estimated value for the approximate measurement.

quantity	측정을 구성하는 명시적 단위의 수를 명시한다.
unit	측정에 사용된 단위를 나타내며, 일반적으로 요구 단위에 대한 표준 기호를 사용한다. 제안값은 다음을 포함한다: 1] m(metre) ; 2] kg(kilogram) ; 3] s(second) ; 4] Hz(hertz) ; 5] Pa(pascal) ; 6] Ω(ohm) ; 7] L(litre) ; 8] t(tonne) ; 9] ha(hectare) ; 10] Å(ångström) ; 11] mL(millilitre) ; 12] cm(centimetre) ; 13] dB(decibel) ; 14] kbit(kilobit) ; 15] Kibit(kibibit) ; 16] kB(kilobyte) ; 17] KiB(kibibyte) ; 18] MB(megabyte) ; 19] MiB(mebibyte)
commodity	측정되고 있는 물질을 나타낸다.

target	하나 혹은 다수의 URI 참조를 제시하여 참조의 목적지를 명시한다.
evaluate	포인터의 대상이 포인터일 때 의도된 의미를 명시한다.

width	Where the media are displayed, indicates the display width
height	Where the media are displayed, indicates the display height
scale	Where the media are displayed, indicates a scale factor to be applied when generating the desired display size

bibl	(서지 인용) 하위 성분이 명시적으로 구분된 또는 그렇지 않은 덜 구조화된 서지 인용을 포함한다.
biblFull	(완전히 구조화된 서지 인용 정보) 완전히 구조화된 서지 정보를 포함하며, 그 안에 TEI 파일 기술의 모든 성분이 제시된다.
biblStruct	(구조화된 서지 인용) 서지의 하위 요소만이 나타나는, 명시적 순서로 구성되는 구조화된 서지 인용을 포함한다.
listBibl	(인용 목록) 여러 종류의 서지 인용 목록을 포함한다.
msDesc	(원고 기술) 하나의 식별가능한 원고에 대한 기술을 포함한다.

biblScope	(인용 범위) 예를 들어 페이지수의 목록 또는 작품의 이름 붙은 하위 성분으로, 문헌 참조의 범위를 정의한다.
distributor	텍스트 배포 권한을 갖는 개인 또는 기관의 이름을 제시한다.
publisher	서지 항목의 출판이나 배포에 책임이 있는 기구명을 제시한다.
pubPlace	(출판지) 서지 대상이 출판된 장소명을 포함한다.

date	다양한 형식의 날짜를 포함한다.
time	어떤 형식의, 하루의 시간을 정의하는 구를 포함한다.

mainLang	(주요 언어) 원고에 사용된 주요 언어를 식별하는 부호를 제공한다.
otherLangs	(다른 언어) 원고에 사용된 다른 언어를 식별하는 하나 이상의 부호

P5: 전자 텍스트 부호화 및 교환에 대한 지침

3 Elements Available in All TEI Documents

TEI: Paragraphs¶3.1 Paragraphs

TEI: Treatment of Punctuation¶3.2 Treatment of Punctuation

TEI: Functions of Punctuation¶3.2.1 Functions of Punctuation

TEI: Hyphenation¶3.2.2 Hyphenation

TEI: Highlighting and Quotation¶3.3 Highlighting and Quotation

TEI: What Is Highlighting?¶3.3.1 What Is Highlighting?

TEI: Emphasis, Foreign Words, and Unusual Language¶3.3.2 Emphasis, Foreign Words, and Unusual Language

TEI: Foreign Words or Expressions¶3.3.2.1 Foreign Words or Expressions

TEI: Emphatic Words and Phrases¶3.3.2.2 Emphatic Words and Phrases

TEI: Other Linguistically Distinct Material¶3.3.2.3 Other Linguistically Distinct Material

TEI: Quotation¶3.3.3 Quotation

TEI: Terms, Glosses, Equivalents, and Descriptions¶3.3.4 Terms, Glosses, Equivalents, and Descriptions

TEI: Some Further Examples¶3.3.5 Some Further Examples

TEI: Simple Editorial Changes¶3.4 Simple Editorial Changes

TEI: Apparent Errors¶3.4.1 Apparent Errors

TEI: Regularization and Normalization¶3.4.2 Regularization and Normalization

TEI: Additions, Deletions, and Omissions¶3.4.3 Additions, Deletions, and Omissions

TEI: Names, Numbers, Dates, Abbreviations, and Addresses¶3.5 Names, Numbers, Dates, Abbreviations, and Addresses

TEI: Referring Strings¶3.5.1 Referring Strings

TEI: Addresses¶3.5.2 Addresses

TEI: Numbers and Measures¶3.5.3 Numbers and Measures

TEI: Dates and Times¶3.5.4 Dates and Times

TEI: Abbreviations and Their Expansions¶3.5.5 Abbreviations and Their Expansions

TEI: Simple Links and Cross-References¶3.6 Simple Links and Cross-References

TEI: Lists¶3.7 Lists

TEI: Notes, Annotation, and Indexing¶3.8 Notes, Annotation, and Indexing

TEI: Notes and Simple Annotation¶3.8.1 Notes and Simple Annotation

TEI: Index Entries¶3.8.2 Index Entries

TEI: Pre-existing Indexes¶3.8.2.1 Pre-existing Indexes

TEI: Auto-generated Indexes¶3.8.2.2 Auto-generated Indexes

TEI: Graphics and Other Non-textual Components¶3.9 Graphics and Other Non-textual Components

TEI: Reference Systems¶3.10 Reference Systems

TEI: Using the xml:id and n Attributes¶3.10.1 Using the xml:id and n Attributes

TEI: Creating New Reference Systems¶3.10.2 Creating New Reference Systems

TEI: Referencing system derived from markup¶3.10.2.1 Referencing system derived from markup

TEI: Referencing systems based on project conventions¶3.10.2.2 Referencing systems based on project conventions

TEI: Milestone Elements¶3.10.3 Milestone Elements

TEI: Declaring Reference Systems¶3.10.4 Declaring Reference Systems

TEI: Bibliographic Citations and References¶3.11 Bibliographic Citations and References

TEI: Methods of Encoding Bibliographic References and Lists of References¶3.11.1 Methods of Encoding Bibliographic References and Lists of References

TEI: Components of Bibliographic References¶3.11.2 Components of Bibliographic References

TEI: Analytic, Monographic, and Series Levels¶3.11.2.1 Analytic, Monographic, and Series Levels

TEI: Titles, Authors, and Editors¶3.11.2.2 Titles, Authors, and Editors

TEI: Document Identifiers¶3.11.2.3 Document Identifiers

TEI: Imprint, Size of a Document, and Reprint Information¶3.11.2.4 Imprint, Size of a Document, and Reprint Information

TEI: Scopes and Ranges in Bibliographic Citations¶3.11.2.5 Scopes and Ranges in Bibliographic Citations

TEI: Series Information¶3.11.2.6 Series Information

TEI: Related Items¶3.11.2.7 Related Items

TEI: Notes and Statement of Language¶3.11.2.8 Notes and Statement of Language

TEI: Order of Components within References¶3.11.2.9 Order of Components within References

TEI: Bibliographic Pointers ¶3.11.3 Bibliographic Pointers

TEI: Relationship to Other Bibliographic Schemes¶3.11.4 Relationship to Other Bibliographic Schemes

TEI: Passages of Verse or Drama¶3.12 Passages of Verse or Drama

TEI: Core Tags for Verse¶3.12.1 Core Tags for Verse

TEI: Core Tags for Drama¶3.12.2 Core Tags for Drama

TEI: Overview of the Core Module ¶3.13 Overview of the Core Module