Talk:PubMan Func Spec eSciDoc To eDoc Mapping

From MPDLMediaWiki
Revision as of 11:17, 23 February 2010 by Makarenko (talk | contribs) (→‎Other)
Jump to navigation Jump to search

Open Issues[edit]

Other[edit]

  • could you search if we filled in a source2 on Pubman with publication year 2009? If so, we want to check the mapping.


Resolved Issues[edit]

Article[edit]

Book[edit]

InBook[edit]

Conference paper[edit]

  • Free keywords are mapped to thesaurus instead of free keywords (might be a general error not only this genre - but I can't find the other example at the moment)
    • dcterms:subject element of pubman item is the list of the keywords, thus it can be directly mapped to edoc keywords field
    • dc:subject elements are list of disciplines taken from controlled vocabulary. Can be mapped to the eDoc element discipline --Makarenko 18:35, 9 February 2010 (UTC)
      • agreed --Karin 14:34, 10 February 2010 (UTC)
  • Date of publication not mapped
    • I need more precise mapping dates in whole. Please, revise. --Makarenko 18:35, 9 February 2010 (UTC)
      • I think the 'problem' lies with the online dates. I checked again with the new upload. There are dates (which are mandatory on eDoc) if a date published in print (dcterms:issued xsi:type="dcterms:W3CDTF">2009</dcterms:issued>) x5.snippet is available. However if no date in print is available, no date was mapped to edoc. Please, then take the date published online, like <eterms:published-online xsi:type="dcterms:W3CDTF">2009</eterms:published-online>. These proceedings will only be published online. Is this enough specs?--Karin 15:30, 10 February 2010 (UTC)
      • --Makarenko 17:29, 10 February 2010 (UTC): Done.
  • Separator of editors have to be '; '
    • Done --Makarenko 18:35, 9 February 2010 (UTC)
  • URI not mapped (see: http://edoc4.gwdg.de/display.epl?mode=doc&id=408193 http://pubman.mpdl.mpg.de/pubman/item/escidoc:101988:16 )
    • @Karin, do you mean the missing URI for the source? This info is actually not needed for the Yearbook, therefore we ignored it. --Ulla 14:56, 10 February 2010 (UTC)
      • Yes, I meant that URI. It's ok, since its not needed for the yearbook.
  • Place of publication and publisher not mapped
    • Done. --Makarenko 18:43, 9 February 2010 (UTC)
      • no, its'not yet done. Is this the same issue as with the inbook publishing info data? --Karin 11:00, 11 February 2010 (UTC)
      • Shall we handle this case as we handle InBook now? (map publishing info of source to publishing info of publication).--Friederike 12:35, 15 February 2010 (UTC)
      • As discussed with Karin, please handle Conference Paper like InBook for this issue.--Friederike 13:41, 15 February 2010 (UTC)
        • --Makarenko 14:56, 15 February 2010 (UTC): Fixed.

Issue[edit]

Thesis[edit]

  • thesis has to be mapped to GENRE PHD-Thesis (this is what we used before on eDoc)
    • --Makarenko 19:02, 9 February 2010 (UTC): Done.
  • date published in print should be mapped to date accepted on eDoc
    • --Makarenko 19:02, 9 February 2010 (UTC): See my comment above.
  • title of source is not mapped
    • Not relevant for the genre. (see: http://edoc4.gwdg.de/display.epl?mode=doc&id=408446 http://pubman.mpdl.mpg.de/pubman/item/escidoc:107934 ) --Makarenko 19:02, 9 February 2010 (UTC)
      • That maybe true, but is is our one and only series ....We can live it and enter it by hand...--Karin 15:32, 10 February 2010 (UTC)
        • --Makarenko 16:15, 10 February 2010 (UTC): Not needed do it manually, just define the mapping :)
        • Is this not anyway mapped? Please check: Mapping: "If source@type = series: zim_transfer.record.publication.source.inseries.titleofseries " this should be the case here I think. The source.title mapping only seperates for different source genres, not for different publication genre. --Friederike 16:20, 10 February 2010 (UTC)
        • Clarified with Vlad: A thesis can not have a series (source in general) in edoc. therefore he will not map it.--Friederike 13:27, 12 February 2010 (UTC)
        • Ok, that's true we had a workaround on eDoc. --Karin 12:27, 15 February 2010 (UTC)

Mapping of Genres[edit]

  • Thesis
    • PubMan genre type thesis has to be mapped to GENRE PHD-Thesis (this is what we used before on eDoc) or accordingly to degree type on PubMan. In the 2009 data for the yearbook we only used PhdThesis.--Karin 17:59, 9 February 2010 (UTC)
      • --Makarenko 11:33, 10 February 2010 (UTC): Done.
        • The pubman 'date published in print' should be mapped to 'date of approval'. this has to be done. At the moment there is no date with the PhD thesis on the edoc test upload.--Karin 15:35, 16 February 2010 (UTC)
          • --Makarenko 16:54, 16 February 2010 (UTC): Done.

Other[edit]

General remarks for Mapping MPIPL--Karin 17:44, 9 February 2010 (UTC)

  • these first mapping specs are based on YEARbook data for 2009. Not all genres and metadata elements which have been used on PubMan are therefore specified yet.
    • --Ulla 14:59, 10 February 2010 (UTC) Agreed. We focused on the genre types and elements needed for the Yearbook (Pflichtfeldertabelle)
  • external affiliations; we do not want them on eDoc. It looks very chaotic sometimes all these external affliations which can't be assigned to one author. It could make sense if just the external affiliations of the MPI authors of a publication are shown, but not those of the other authors. But that's not a priority.
    • --Makarenko 19:13, 9 February 2010 (UTC): Removed.
  • the dates appear as they are on Pubman either as yyyy, yyyy-mm, or yyyy-mm-dd. Could we just have the yyyy on eDoc.
    • --Makarenko 19:13, 9 February 2010 (UTC): Fixed.
      • I forgot to specify one exception. The start and end dates of an event should be migrated to edoc as they appear on pubman. --Karin 10:21, 23 February 2010 (UTC)
        • --Makarenko 11:17, 23 February 2010 (UTC): Done.
  • Free keywords seem to have been mapped to the field thesaurus
    • --Makarenko 19:13, 9 February 2010 (UTC): Se my comment above.


  • With some genres not all links (URLs etc) have been migrated
    • See my comment above. --Makarenko 19:13, 9 February 2010 (UTC)
    • In addition to the PubMan item Id, which will be imported as "localid" to eDoc, the fulltext (components and locators) will be also available on edoc, as URL.
      Karin, please decide:
      • a)We import all components (fulltexts&locators) to eDoc, independent from visibility level on PubMan. If a component (fulltext or locator) has visibility level other than public, the user cannot access without log-in to PubMan (or the external system).
      • b)We import only components (fulltexts&locators) to eDoc which have visibility level "public" on PubMan.
      • If you want, we can provide this imported components always as "clickable" links
      • If you want, we can set default comment for imported components (e.g. "Fulltext available via PubMan http://pubman.mpdl.mpg.de)

--Ulla 15:05, 10 February 2010 (UTC)

  • After talk to Karin, it is not necessary to link all fulltexts, as we already have a link to the whole publication in PubMan. The comment for this link on edoc should be: 'More information or fulltext available via PubMan: URL'. --Friederike 14:08, 15 February 2010 (UTC)
    • --Makarenko 14:32, 15 February 2010 (UTC): Unfortunately, the file comment cannot be passed during the upload, that can be only done with an edoc SQL-query.
      • Decided: After final transformation Vlad will set the description value. (not possible during transformation).--Friederike 14:51, 15 February 2010 (UTC)