Talk:PubMan Func Spec Submission/Generic TEI 2 PubItem Mapping

From MPDLMediaWiki
Jump to navigation Jump to search

Mapping of TEI Genres[edit]

  • Here we need a default, if no level attribute is provided. Would propose 'Article'.

Mapping of TEI Fields[edit]

  • Author/Editor:
    • '.name or .orgName are mapped to .Name' - what happens if both are provided?
    • What happens with sub elements like forename, persname, namelink etc.
    • Add author.email to address (?)
  • publicationStmt.*.address
    • Can this be added to the field PublishingInfo.Place or .Publisher ?
  • teiHeader.fileDesc.publicationStmt.authority
    • dc:rights? ("(release authority) supplies the name of a person or other agency responsible for making an electronic file available, other than a publisher or distributor.")
  • date
    • What with the different date types? like:
<monogr>
  <imprint>
     <date type="Accepted" when="2009-02-17"/>
  </imprint>
</monogr>
  • ...meeting.title
    • "Genre to "Conference Paper" (however, the Genres could also include "Talk at Event" or "Conference Report" etc.)" - We should only map if the mapping is distinct
  • biblStruct.monogr.imprint.biblScope
    • "This field may contain subfields like pubPlace, extent etc." - I am afraid we have to list relevant elements again, to be able to write the transformation (have to check with Julia)
  • teiHeader.fileDesc.sourceDesc
    • Unfortunatly a xslt can not distinguish between a phrase and a title, clear mapping needed. "extract everything that can be automatically recognized and mapping it to other fields, and putting the rest into Title" - Not clear, what can be in there? Please provide example.
  • teiHeader.fileDesc.seriesStmt
    • The genre type mapping is not clear to me.
  • ...biblStruct.monogr.imprint.biblScope@type="chap"
    • would not map to Source.SequenceNumber
  • *.adress
    • Needs more specific mapping, check example
  • publicationStmt.date
    • Map to dcterms:dateCopyrighted (?)
  • meeting.date
    • Map to: Event.StartDate, Event.EndDate, Here we should provide an example to know if we can distinguish between start and enddate
  • *.classCode.list.item
    • Would not map this, as the usage is unclear (for me), or do you have example?

General[edit]

  • "Organization editors are not supported in this mapping. " - Why not, is there any restriction from tei side? (same for creator)
  • *.idno: We have to specify what happens to identifier types which are not provided in PubMan

Examples[edit]

StartPage/EndPage[edit]

<biblScope type="pp">381-417</biblScope>
<biblScope type="pp" from="12" to="34"/>
<biblScope type="pp">12</biblScope> 
<biblScope type="pages">3-46</biblScope>
First check if attribute 'from', 'to' is provided
If two numbers are separated by '-' the first number is mapped to StartPage and the second to EndPage, else the whole String is mapped to StartPage

Extend[edit]

<extent>78 p.</extent> -mapped
<extent>19 pp.</extent> -mapped
<extent>3200 sentences</extent> -not mapped
<extent>between 10 and 20 Mb</extent> -not mapped

Author[edit]

<author>Paine, Thomas (1737-1809)</author>
<author>La Fayette, Marie Madeleine Pioche de la Vergne, comtesse de (1634–1693)</author>
All info within round brackets in an author element should be parsed out (?)

Address[edit]

<address>
   <addrLine>Oxford University Computing Services,</addrLine>
   <addrLine>13 Banbury Road,</addrLine>
   <addrLine>Oxford OX2 6RB,</addrLine>
   <addrLine>UK</addrLine>
</address>
<address>
   <street>110 Southmoor Road</street>
   <name type="city">Oxford</name>
   <postCode>OX2 6RB</postCode>
   <name type="country">United Kingdom</name>
</address>