talk 2: default text structure; common elements - tei @...

56
Default Text Structure The ’Core’ elements Talk 2: Default Text Structure; Common Elements James Cummings 19 September 2013 @jamescummings 1/56

Upload: doanminh

Post on 28-Mar-2018

218 views

Category:

Documents


5 download

TRANSCRIPT

Page 1: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Talk 2: Default Text Structure; CommonElements

James Cummings

19 September 2013

@jamescummings 1/56

Page 2: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Default Text Structure

All TEI documents are structured in a particular manner. Thissection attempts to describe the different variations on this asbriefly as possible.

@jamescummings 2/56

Page 3: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Structure of a TEI Document

There are two basic structures of a TEI Document:<TEI> (に準拠する文書) contains a single TEI-conformant document,comprising a TEI header and a text, either in isolation or as part ofa teiCorpus element.<teiCorpus> (準拠のコーパス全体を示す. ヘダーが1つと,ひとつ以上の要素TEIから 成る.

各要素TEIには,テキストヘダーと要素textが1つある) contains the whole of a TEIencoded corpus, comprising a single corpus header and one or moreTEI elements, each containing a single text header and a text.

@jamescummings 3/56

Page 4: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

TEI basic structures (1)

.

......

<teiCorpus xmlns="http://www.tei-c.org/ns/1.0"><teiHeader><!-- required --></teiHeader><TEI><!-- required --></TEI><!-- More <TEI> elements --></teiCorpus>

@jamescummings 4/56

Page 5: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

TEI basic structures (2)

.

......

<TEI xmlns="http://www.tei-c.org/ns/1.0"><teiHeader><!-- required --></teiHeader><facsimile><!-- optional--></facsimile><sourceDoc><!-- optional --></sourceDoc><text><!-- required if no facsimile or sourceDoc--></text></TEI>

@jamescummings 5/56

Page 6: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

<text>

What is a text?A text may be unitary or composite

unitary: forming an organic wholecomposite: consisting of several components which are in someimportant sense independent of each other

a unitary text containsoptional front matter<body> (required)optional back matter

@jamescummings 6/56

Page 7: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Composite texts

A composite text containsoptional front matter<group> with <text> inside (required)optional back matter

A corpus is a collection of text and header pairs that also has itsown header.<group> tags may self-nest.

@jamescummings 7/56

Page 8: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

TEI text structure (1)

.

......

<text><front><!-- optional --></front><body><!-- required --></body><back><!-- optional --></back></text>

@jamescummings 8/56

Page 9: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

TEI text structure (2)

.

......

<text><front><!-- ... --></front><group><text><body><p>...</p></body></text></group><back><!-- ... --></back></text>

@jamescummings 9/56

Page 10: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Another Grouped Text Example.

......

<TEI xmlns="http://www.tei-c.org/ns/1.0"><teiHeader><!-- header information for the whole collection --></teiHeader><text><!-- optional front matter --><group><text>

<!-- optional front matter --><body>

<!-- First Body --></body></text><text>

<!-- optional front matter --><body>

<!-- Second Body--></body></text></group></text></TEI>

@jamescummings 10/56

Page 11: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Examples from WW1 Poetry Digital Archive

Many of our examples come from the First World War PoetryDigital Archive: http://www.oucs.ox.ac.uk/ww1lit/. and related’Great War Archive’. In specific we will be looking at thecorrespondence and poetry of Wilfred Owen and related materials.

@jamescummings 11/56

Page 12: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Postcard Front

@jamescummings 12/56

Page 13: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Postcard Front

@jamescummings 13/56

Page 14: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Letter 1917-01-10 (page 1)

@jamescummings 14/56

Page 15: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Letter 1917-01-10 (page 2)

@jamescummings 15/56

Page 16: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Letter 1917-01-10 (page 5)

@jamescummings 16/56

Page 17: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Preface MS

@jamescummings 17/56

Page 18: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Preface Edited

@jamescummings 18/56

Page 19: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Red Cross or Iron Cross?

@jamescummings 19/56

Page 20: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

The Kitchen is the Key to Victory

@jamescummings 20/56

Page 21: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Basic units of markup?

Identification information: titles, page numbers, sources,references”chunks” or divisions of text, which may contain a picture, apoem, some prose, or a combinationwithin the chunks, we can identify formal units such as

a picturesstanzas, linesparagraphs

graphical metadata: layout, zones, descriptions of figuresphrase level information such as names and dateand more...

@jamescummings 21/56

Page 22: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Why divisions rather than pages?It is often more powerful to mark the intellectual divisions ratherthan the physical ones. A division can start on one page and finishon another, or cross other physical boundariesWe use an empty element <pb/> to mark the boundary where apage begins, rather than enclosing each page in a <divtype="page">..

......

<pb n="5"/><div type="prose"><p>...</p></div><div type="verse"><head>Strange Meeting</head><lg> ...<pb n="6"/>...

</lg></div><div type="prose"><p>...</p></div>

@jamescummings 22/56

Page 23: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Divisions can contain divisions....

......

<div type="postcard"><div type="postmark"><div type="advert"><ab>BUY NATIONAL<lb/>WAR BONDS</ab></div><div type="dateStamp"><dateline><placeName>SCARBOROUGH</placeName><lb/><time>6.30 PM</time><lb/></dateline></div><div type="advert"><ab>BUY NATIONAL<lb/>WAR BONDS</ab></div></div><div type="address"><!-- <address> here --></div><div type="prose"><!-- text here --></div></div>

@jamescummings 23/56

Page 24: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

More about divisions

<div> (前付,本文,後付中のテキスト部分を示す)

generic, hierarchic subdivisions, each incomplete as a text as a wholethe @type attribute is used to label a particular level e.g. as ’part’ or’chapter’the @n attribute gives a particular division a name or numberthe @xml:id attribute gives a particular division a unique identifierDivisions must always tessellate: once ”down” a level, you cannotpop ”up” again within the same division.

@jamescummings 24/56

Page 25: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Divisions may have heads and trailers

.

......

<div><head>Preface</head><p><!-- content of the div --></p><trailer>...</trailer></div>

@jamescummings 25/56

Page 26: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Numbered and unnumbered divisions

The level can be made explicit by using ’numbered’ divs (div1,div2). Opinions vary:<div1> vs. <div n=”1”>

numbered: the number indicates the depth of this particulardivision within the hierarchy, the largest such division being‘div1’, any subdivision within it being ‘div2’, etc.unnumbered: nest recursively to indicate their hierarchicdepth. (And computers can count very well!)

The two styles must not be combined within a single <front>,<body>, or <back> element.N.B. Divisions always tessellate

@jamescummings 26/56

Page 27: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Whence <floatingText>?<div>s must tesselate over the entire text.

......

<div1><div2><!-- content --></div2><div2><!-- content --></div2></div1>

is valid, while.

......

<div1><!-- content --><div2><!-- content --></div2><!-- content --></div1>

is not valid.

@jamescummings 27/56

Page 28: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

<floatingText> Example.

......

<p>She was thus ruminating, when a Gentleman enter'd theRoom, the Door being a jar... calling for a Candle, shebeg'd a thousand Pardons, engaged him to sit down, and lether know, what had so long conceal'd him from herCorrespondence. </p><pb n="5"/><floatingText><body><head>The Story of <hi>Captain Manly</hi></head><p>

<!-- Captain Manly's store here --></p></body></floatingText><pb n="37"/><p>The Gentleman having finish'd his Story ...<!-- more --></p>

@jamescummings 28/56

Page 29: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Virtual divisionsWhere the whole of a division can be automatically generated, forexample because it is derived from another part of this or anotherdocument, an encoder may prefer not to represent it explicitly butinstead simply mark its location by means of a processinginstruction, or by using the special purpose <divGen> element:.

......

<front><divGen type="toc"/><div><head>Preface</head><p>...</p></div></front>

(intended primarily for use in document production ormanipulation, rather than in transcription of pre-existing material)(ソフトウェアで自動生成されたテキスト部分の場所を示す)

@jamescummings 29/56

Page 30: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Document order vs XML OrderThe order of XML encoding does not necessarily reflect the orderof the source document. Compare:.

......

<div type="postcard"><div type="address"><!-- <address> here --></div><div type="prose"><!-- text here --></div><div type="postmark"><div type="advert"><ab>BUY NATIONAL<lb/>WAR BONDS</ab></div><div type="dateStamp"><dateline><placeName>SCARBOROUGH</placeName><lb/><time>6.30 PM</time><lb/></dateline></div><div type="advert"><ab>BUY NATIONAL<lb/>WAR BONDS</ab></div></div></div>

with the version we saw earlier@jamescummings 30/56

Page 31: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

The ’Core’ elements

The so-called ’Core’ module groups together elements which oftenappear in almost any kind of text For example:

paragraphshighlighting, emphasis and quotationsimple editorial changesbasic names numbers, dates, addressessimple links and cross-referenceslists, notes, annotation, indexinggraphicsreference systems, bibliographic citationssimple verse and drama

@jamescummings 31/56

Page 32: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Paragraphs

<p> (散文の段落を示す) marks paragraphs in prose

Fundamental unit for prose texts<p> can contain all the phrase-level elements in the core<p> can appear directly inside <body> or inside <div> (divisions)

.

......

<p>Thanks for yours of this morning. I hope <lb/>you havehad my card posted last Monday. <lb/>On Mond. next I lecturethe <orgName ref="#Fieldclub">Field Club</orgName> - <lb/>aNat. Hist. Association, in the lines of our <lb/>old Society- Geological, (you + me) + Botanical <lb/>(New) Do youremember: you<supplied>r</supplied> old <lb/>Black Molt?</p>

@jamescummings 32/56

Page 33: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

HighlightingBy highlighting we mean the use of any combination oftypographic features (font, size, hue, etc.) in a printed or writtentext in order to distinguish some passage of a text from itssurroundings. For words and phrases which are:

distinct in some way (e.g. foreign, archaic, technical)emphatic or stressed when spokennot really part of the text (e.g. cross references, titles,headings)a distinct narrative stream (e.g. an internal monologue,commentary)attributed to some other agency inside or outside the text(e.g. direct speech, quotation)set apart in another way (e.g. proverbial phrases, wordsmentioned but not used)

@jamescummings 33/56

Page 34: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Highlighting Examples

<hi> (周囲のテキストとは見た目が異なっている語句を示す); <distinct>(言語上,異なる語句を示す. 例えば,古語,技術語,方言,忌諱語など.

また,特定グループでしか通用しない特殊言語など.).

......

<p>Last week I wrote (to order) a strong <lb/>bit of Blank:on <hi rend="ul">Antaeus v.

Heracles</hi>. <lb/>These are the best lines, methinks:<lb/>(N.B. Antaeus derivingstrength from his Mother Earth <lb/>nearly licked old<distinct>Herk</distinct>.) </p>

Other similar elements include: <emph>, <mentioned>,<soCalled>, <term> and <gloss>

@jamescummings 34/56

Page 35: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

QuotationQuotation marks can be used to set off text for many reasons, sothe TEI has the following elements:

<q> (周りのテキストとは(表面上)異なっているようにマークアップされている部分を示す)<said> (考えたり声に出されたりした一節を示す)<quote> (語り手や著者が,当該テキスト外にあるものに向けた,一節を示す)<cit> (書誌参照を伴い,他の文書からの引用を示す)

.

......

<cit><quote><l>... How Earth herself empowered him with her trick,</l><l>Gave him the grip and stringency of Winter,</l><l>And all the ardour of th' invincible Spring;</l></quote><bibl><author>Wilfred Owen</author><title>Letter to Leslie Gunston / The Wrestler</title><date when="1917-07"/></bibl></cit>

@jamescummings 35/56

Page 36: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Simple Editorial Changes: <choice> and Friends

<choice> (テキスト中の同じ場所で,異なる符号化記述をまとめる)Errors:

<sic> (明らかな間違い,不正確ではあるが,そのまま収録してあるテキスト)<corr> (元資料中の明らかな間違いを正したものを示す)

Regularization:<orig> (正規化または校正は施されていない,元の形のまま符号化されている読みを示す)<reg> (正規化された読みを示す)

Abbreviation:<abbr> (名称の省略)<expan> (省略形の元の表現を示す)

@jamescummings 36/56

Page 37: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Choice Example

.

......

<p>...any might,<unclear reason="scribbled">majesty</unclear>,<choice><abbr>domin<am/></abbr><expan>domin<ex>ion</ex></expan></choice> or power...</p>

@jamescummings 37/56

Page 38: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Additions, Deletions, and Omissions

<add> (著者,筆写者,注釈者,校正者による,文字,単語,句レベルでのテキスト挿入を示す)<del> (著者・筆写者・注釈者・校正者により,削除または削除として符号化または余分なものまたは間違いとして示されている,

文字,単語,句を示す)<gap> (ヘダーにある編集上の理由, または当該資料が判読できない・ 聞こえないことを理由に,

転記の際に省略された部分の場所を示す)<unclear> (元資料からは判読できないまたは聞こえないという理由で,確実に転記できない語句や一節を示す)

@jamescummings 38/56

Page 39: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Example of <add>, <del>, and <unclear>

.

......

<p><add place="left">My </add><del rend="stroked">It's </del><add place="above"><del rend="stroked">The </del></add> subject <del rend="stroked">of</del> is War, and the<unclear>pity </unclear>of <del rend="stroked">it</del> War.<lb/> The Poetry is in the pity.</p>

@jamescummings 39/56

Page 40: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Basic Names

<name> (固有名詞)(a name in the text, contains a proper noun ornoun phrase)<rs> (一般的な意味での名前や参照文字列) (a general-purpose name orreferencing string)

The @type attribute is useful for categorizing these, and they bothalso have @key, @ref, and @nymRef attributes..

......More interesting name-related elements can be had by includingthe namesdates module.

@jamescummings 40/56

Page 41: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Addresses

<email> (eメールを届けるeメールアドレスを示す)<address> (郵便配達情報を示す)<addrLine> (住所情報を記述する行を示す)<street> (住所情報としての,通りを表す完全情報を示す. 建物の名前や番号,通りの名前など)<postCode>(郵便の配達や区分けを簡単にするための,郵便の宛名情報の部分となる数値または文字を含む)<postBox> (郵便配達で識別子となる,通り名以外の,数値などを示す)<name> can also be usedand the ’namesdates’ module extends this with more geographicnames

@jamescummings 41/56

Page 42: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Basic Address Example

WW1 Poetry Archive Project:.......<email>[email protected]</email>

Shell-shock hospital ’Craiglockhart’ that Wilfred Owen stayed in:.

......

<address><street>14 Frederick Street</street><postCode>EH2 2HB</postCode><settlement>Edinburgh</settlement><country>United Kingdom</country></address>

@jamescummings 42/56

Page 43: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Basic Numbers and Measures

<num> (各種形式による数値を示す)<measure> (ある対象や商品の大きさを表す語句を示す.一般には,数値,単位,商品名を含む)<measureGrp> (大きさに関する規格を示す.例えば,手書き資料のページの高さや幅などを示すためのもの)While <num> has simple @type and @value attributes, <measure>has @type, @quantity, @unit and @commodity attributes

@jamescummings 43/56

Page 44: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Number and Measure examples

.

......<l>With a <num value="1000">thousand</num> pains thatvision's face was grained;</l>

.

......

... only<measure type="distance" unit="m" quantity="3218.69">twomiles</measure> from the front....

@jamescummings 44/56

Page 45: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Dates

<date> (日付を示す) (contains a date in any format and includes a@when attribute for a regularised form and a @calendar attribute tospecify what calendar system)<time> (時間を表す語句を示す) (contains a time in any format andincludes a @when attribute for a regularised form)

.

......<date when="1917-07">July 1917.<lb/> Wednesday</date>

@jamescummings 45/56

Page 46: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Simple Linking

<ptr> (他の場所を示すポインターを定義する)<ref> (他の場所への参照を定義する. 多くは,追加テキストまたはコメントを含む)Both elements have a @target attribute taking a URI referenceIf the linking text is able to be generated, <ptr> and <ref> mightbe used in the same place.

@jamescummings 46/56

Page 47: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Simple Linking Example

.

......See <ref target="#Section12">section 12 on page 34</ref>.

.

......See <ptr target="#Section12"/>.

@jamescummings 47/56

Page 48: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Lists

<list> (リストのような,項目列を示す)<item> (リストの一項目を示す)<label> (リスト中の項目に関連するラベルを示す. 用語集においては,定義される用語を示す)<headLabel>(リストラベルの表題,リストなどにおけるラベルや,用語集などにおける語彙を示す)<headItem> (用語集などのリスト構造における各項目の見出しを示す)

@jamescummings 48/56

Page 49: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Simple List Example

The previous slide contained only:.

......

<div><head>Lists</head><list><item><gi>list</gi> (リストのような, 項目列を示す)</item><item><gi>item</gi> (リストの一項目を示す)</item><item><gi>label</gi> (リスト中の項目に関連するラベルを示す.

用語集においては,定義される用語を示す)</item><item><gi>headLabel</gi>

(リストラベルの表題,リストなどにおけるラベルや,用語集などにおける語彙を示す)</item><item><gi>headItem</gi> (用語集などのリスト構造における各項目の見出しを示す)</item>

</list></div>

@jamescummings 49/56

Page 50: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Notes

<note> (注釈・コメント)Notes can be those existing in the text, or provided by the editor ofthe electronic textA @place attribute can be used to indicate the physical location ofthe noteNotes should usually be encoded where its identifier/mark firstappears; notes can also be kept separately and point back to theirlocation with a @target attribute

.

......<note>Painted by <persName>John Singer Sargent</persName>,1918</note>

@jamescummings 50/56

Page 51: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Indexing

If converting an existing index, use nested lists. Forauto-generated indexes:<index> (索引項目化されたものの場所を示す) with optional @indexNameattributeThe <term> element (技術用語とされる単一語,複数語,記号表示を示す)is used tomark a term inside an <index> elementThe <index> element can self-nest for hierarchical index entries

@jamescummings 51/56

Page 52: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Indexing Example

.

......

<p>Last week I wrote (to order) a strong <lb/>bit ofBlank<index><term>Verse</term><index><term>Blank Verse</term></index></index>:</p>

@jamescummings 52/56

Page 53: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Graphics

<graphic> (テキスト列中にある図,絵,図表の場所を示す)<binaryObject>(行中の画像やその他のオブジェクトを示す,符号化されたバイナリデータを示す)The figure module provides <figure> and <figDesc> for morecomplex graphics

.

......

<figure><graphic url="images/postcard-front.jpg"/><figDesc>A postcard image of two men relaxing at a table,smoking pipes and drinking. A dog and potted fruit tree arenearby with a house over the wall in the distance.</figDesc></figure>

@jamescummings 53/56

Page 54: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Simple Verse

.

......

<lg type="stanza"><l>It seemed that out of battle I escaped</l><l>Down some profound dull tunnel, long since scooped</l><l>Through granites which titanic wars had groined.</l></lg>

@jamescummings 54/56

Page 55: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Simple Drama

.

......

<sp><speaker>The reverend Doctor Opimiam</speaker><p>I do not think I have named a single unpresentablefish.</p></sp><sp><speaker>Mr Gryll</speaker><p>Bream, Doctor: there is not much to be said forbream.</p></sp>

@jamescummings 55/56

Page 56: Talk 2: Default Text Structure; Common Elements - TEI @ …tei.oucs.ox.ac.uk/Talks/2013-09-japan/talk2.pdf · correspondence and poetry of Wilfred Owen and related materials

Default Text Structure The ’Core’ elements

Next

We’re going to do the second exercise (and then have a lunchbreak).

@jamescummings 56/56