Talk:PubMan Func Spec Submission/Generic TEI 2 PubItem Mapping

Mapping of TEI Fields

 * Author/Editor:
 * Add author.email to address (?)
 * meeting.date
 * Map to: Event.StartDate, Event.EndDate, Here we should provide an example to know if we can distinguish between start and enddate

StartPage/EndPage
381-417  12 3-46


 * First check if attribute 'from', 'to' is provided
 * If two numbers are separated by '-' the first number is mapped to StartPage and the second to EndPage, else the whole String is mapped to StartPage

Extend
78 p. -mapped 19 pp. -mapped 3200 sentences -not mapped between 10 and 20 Mb -not mapped

Author
Paine, Thomas (1737-1809) La Fayette, Marie Madeleine Pioche de la Vergne, comtesse de (1634–1693)
 * All info within round brackets in an author element should be parsed out. This should apply to all person names (editor, contributor etc.)

If element persName does exist, apply the following mapping: Element author is an example for all roles which might appear in this mapping (editor, contributor etc.)

Address
Oxford University Computing Services, 13 Banbury Road, Oxford OX2 6RB, UK

110 Southmoor Road Oxford OX2 6RB</postCode> United Kingdom

Mapping of address: All sub elements are concatenated with comma and blank and are mapped to the corresponding pubItem address field.

Dates
If dates of the same type are modelled twice, the most recent date will be mapped.

Note: This value can be found in the date type attribute or in the value of the date or change element <change when="2009-01-16">Revised <change when="2009-01-28">Accepted <date type="publication" when="2008-11-18"/> <date when="2008-11-08">Online <date type="Accepted" when="2009-02-17"/>

General
(http://www.tei-c.org/release/doc/tei-p5-doc/en/html/HD.html#HD7) <teiHeader> <fileDesc> <titleStmt> Common sense, a machine-readable transcript Paine, Thomas (1737-1809) <respStmt> compiled by    Jon K Adams </respStmt> </titleStmt> <editionStmt> 1986  </editionStmt> <publicationStmt> Oxford Text Archive. Oxford University Computing Services,</addrLine> 13 Banbury Road,</addrLine> Oxford OX2 6RB,</addrLine> UK</addrLine> </publicationStmt> <notesStmt> Brief notes on the text are in a      supplementary file. </notesStmt> <sourceDesc> <biblStruct> Foner, Philip S.     The collected writings of Thomas Paine <pubPlace>New York</pubPlace> Citadel Press 1945   </biblStruct> </sourceDesc> </fileDesc> <encodingDesc> <samplingDecl> Editorial notes in the Foner edition have not been reproduced. Blank lines and multiple blank spaces, including paragraph indents, have not been preserved. </samplingDecl> <editorialDecl> <correction status="high" method="silent"> The following errors in the Foner edition have been corrected: p. 13 l. 7 cotemporaries contemporaries p. 28 l. 26 [comma] [period] p. 84 l. 4 kin kind p. 95 l. 1 stuggle struggle p. 101 l. 4 certainy certainty p. 167 l. 6 than that p. 209 l. 24 publshed published No normalization beyond that performed by Foner, if any. <quotation marks="all" form="std"> All double quotation marks rendered with ", all single quotation marks with        apostrophe.     Hyphenated words that appear at the         end of the line in the Foner edition have been reformed.    <stdVals>    The values of when-iso on the <gi>time</gi>         element always end in the format HH:MM or    HH ; i.e., seconds, fractions thereof, and time         zone designators are not present.    </stdVals>    Compound proper names are marked.     Dates are marked.     Italics are recorded without interpretation.   </editorialDecl>  <classDecl>   <taxonomy xml:id="lcsh">    Library of Congress Subject Headings    <taxonomy xml:id="lc">    Library of Congress Classification   </classDecl> </encodingDesc> <profileDesc>   1774   <langUsage>   <language ident="en" usage="100">English.   </langUsage>  <textClass>   <keywords scheme="#lcsh">     Political science      United States -- Politics and government — Revolution, 1775-1783 <classCode scheme="#lc">JC 177</classCode> </textClass> </profileDesc> <revisionDesc> <change when="1996-01-22"> CMSMcQ finished proofreading <change when="1995-10-30"> L.B. finished proofreading <change when="1995-07-20"> R.G. finished proofreading <change when="1995-07-04"> R.G. finished data entry <change when="1995-01-15"> R.G. began data entry </revisionDesc> </teiHeader>

Wikipedia (http://de.wikipedia.org/wiki/Text_Encoding_Initiative#Praxisbeispiel) <?xml version="1.0" encoding="UTF-8"?> <TEI xmlns="http://www.tei-c.org/ns/1.0"> <teiHeader> <fileDesc> <titleStmt> Auf dem Brocken Heinrich Heine (1797–1856) <respStmt> Wiki Autor Umwandlung in TEI-konformes XML </respStmt> </titleStmt> <publicationStmt> aus Wikisource, der freien Quellensammlung (<ptr target="http://de.wikisource.org/wiki/Auf_dem_Brocken"/>) </publicationStmt> <sourceDesc> <biblFull> <titleStmt> Auf dem Brocken Buch der Lieder <title level="m" type="sub">Aus der Harzreise Heine, Heinrich </titleStmt> <publicationStmt> Hoffmann und Campe <pubPlace>Hamburg</pubPlace> 1827                            Gemeinfrei, keine Nutzungsbeschränkungen </publicationStmt> </biblFull> </sourceDesc> </fileDesc> </teiHeader> Auf dem Brocken. <l>Heller wird es schon im Osten</l> <l>Durch der Sonne kleines Glimmen,</l> <l>Weit und breit die Bergesgipfel,</l> <l>In dem Nebelmeere schwimmen.</l> </lg> Hätt’ ich Siebenmeilenstiefel,</l> <l>Lief ich, mit der Hast des Windes,</l> <l>Ueber jene Bergesgipfel,</l> <l>Nach dem Haus des lieben Kindes.</l> </lg> <l>Von dem Bettchen, wo sie schlummert,</l> Zög’ ich leise die Gardinen,</l> <l>Leise küßt’ ich ihre Stirne,</l> <l>Leise ihres Munds Rubinen.</l> </lg> <l>Und noch leiser wollt’ ich flüstern</l> <l>In die kleinen Lilien-Ohren:</l> Denk’ im Traum, daß wir uns lieben,</l> <l>Und daß wir uns nie verloren.</l> </lg> </TEI>