Talk:PubMan Func Spec Submission/Generic TEI 2 PubItem Mapping

From MPDLMediaWiki
Jump to navigation Jump to search

Mapping of TEI Genres[edit]

  • Here we need a default, if no level attribute is provided. Would propose 'Article'.

Mapping of TEI Fields[edit]

  • Author/Editor:
    • '.name or .orgName are mapped to .Name' - what happens if both are provided?
    • What happens with sub elements like forename, persname, namelink etc.
    • Add author.email to address (?)
  • publicationStmt.*.address
    • Can this be added to the field PublishingInfo.Place or .Publisher ?
  • teiHeader.fileDesc.publicationStmt.authority
    • dc:rights? ("(release authority) supplies the name of a person or other agency responsible for making an electronic file available, other than a publisher or distributor.")
  • date
    • What with the different date types? like:
<monogr>
  <imprint>
     <date type="Accepted" when="2009-02-17"/>
  </imprint>
</monogr>
  • ...meeting.title
    • "Genre to "Conference Paper" (however, the Genres could also include "Talk at Event" or "Conference Report" etc.)" - We should only map if the mapping is distinct
  • biblStruct.monogr.imprint.biblScope
    • "This field may contain subfields like pubPlace, extent etc." - I am afraid we have to list relevant elements again, to be able to write the transformation (have to check with Julia)
  • teiHeader.fileDesc.sourceDesc
    • Unfortunatly a xslt can not distinguish between a phrase and a title, clear mapping needed. "extract everything that can be automatically recognized and mapping it to other fields, and putting the rest into Title" - Not clear, what can be in there? Please provide example.
  • teiHeader.fileDesc.seriesStmt
    • The genre type mapping is not clear to me.
  • ...biblStruct.monogr.imprint.biblScope@type="chap"
    • would not map to Source.SequenceNumber
  • *.adress
    • Needs more specific mapping, check example
  • publicationStmt.date
    • Map to dcterms:dateCopyrighted (?)
  • meeting.date
    • Map to: Event.StartDate, Event.EndDate, Here we should provide an example to know if we can distinguish between start and enddate
  • *.classCode.list.item
    • Would not map this, as the usage is unclear (for me), or do you have example?

General[edit]

  • "Organization editors are not supported in this mapping. " - Why not, is there any restriction from tei side? (same for creator)
  • *.idno: We have to specify what happens to identifier types which are not provided in PubMan

Examples[edit]

StartPage/EndPage[edit]

<biblScope type="pp">381-417</biblScope>
<biblScope type="pp" from="12" to="34"/>
<biblScope type="pp">12</biblScope> 
<biblScope type="pages">3-46</biblScope>
First check if attribute 'from', 'to' is provided
If two numbers are separated by '-' the first number is mapped to StartPage and the second to EndPage, else the whole String is mapped to StartPage

Extend[edit]

<extent>78 p.</extent> -mapped
<extent>19 pp.</extent> -mapped
<extent>3200 sentences</extent> -not mapped
<extent>between 10 and 20 Mb</extent> -not mapped

Author[edit]

<author>Paine, Thomas (1737-1809)</author>
<author>La Fayette, Marie Madeleine Pioche de la Vergne, comtesse de (1634–1693)</author>
All info within round brackets in an author element should be parsed out. This should apply to all person names (editor, contributor etc.)

If element persName does exist, apply the following mapping:

TEI Element PubMan Metadata Set Description
TEIHeader.fileDesc.sourceDesc.biblStruct.analytic.author.persName Publication.Creator.Person.FamilyName
Note: Only if ...author.persName.surname is empty
TEIHeader.fileDesc.sourceDesc.biblStruct.analytic.author.persName.forename Publication.Creator.Person.GivenName Add middle name to first name.
TEIHeader.fileDesc.sourceDesc.biblStruct.analytic.author.persName.surname Publication.Creator.Person.FamilyName --
TEIHeader.fileDesc.sourceDesc.biblStruct.analytic.author.persName.roleName not mapped contains a name component which indicates that the referent has a particular role or position in society, such as an official title or rank. Dr, Miss, M.Tech (degree)
TEIHeader.fileDesc.sourceDesc.biblStruct.analytic.author.persName.nameLink add to Publication.Creator.Person.FamilyName
Seperated from the family name by space
contains a connecting phrase or link used within a name but not regarded as part of it, such as van der or of
TEIHeader.fileDesc.sourceDesc.biblStruct.analytic.author.persName.genName add to Publication.Creator.Person.GivenName
seperated from the given name by comma
contains a name component like JR.

Address[edit]

<address>
   <addrLine>Oxford University Computing Services,</addrLine>
   <addrLine>13 Banbury Road,</addrLine>
   <addrLine>Oxford OX2 6RB,</addrLine>
   <addrLine>UK</addrLine>
</address>
<address>
   <street>110 Southmoor Road</street>
   <name type="city">Oxford</name>
   <postCode>OX2 6RB</postCode>
   <name type="country">United Kingdom</name>
</address>