Difference between revisions of "User:Martin de la Iglesia/Sandbox"

From MPDLMediaWiki
Jump to navigation Jump to search
(mapping ctd.)
 
(20 intermediate revisions by the same user not shown)
Line 1: Line 1:
Publication type tags from http://images.isiknowledge.com/WOK45/help/WOK/hft_wos.html - Web of Science Field Tags (Articles and Conference Proceedings) / WoK help:
==Mapping of Genres==
 
DCMI Type Vocabulary: http://dublincore.org/documents/dcmi-type-vocabulary/
{| border="1"
{| border="1"
|+
|+
! WoS !! eSciDoc !! Comment/Condition(s)
! eSciDoc !! DC !! Comment/Condition(s)
|-
| Article || Text || -
|-
|-
| Book || Text || -
|-
|-
| Book Item || Text || -
|-
|-
| Proceedings || Text || -
|-
|-
| Conference Paper || Text || -
|-
|-
| Talk at Event || - || Unfortunately, it is impossible to determine whether Text, (Moving)Image, or Sound should be used.
|-
|-
| Conference Report || Text || -
|-
|-
| Poster || Image || Alternatively, use the narrower term StillImage.
|-
|-
| Courseware/Lecture || - || Impossible to determine which Type should be used.
|-
|-
|-
| conference (C) || Conference Paper || -
| Thesis || Text || -
|-
|-
|-
|-
| book || - || not used in WoS records
| Paper || Text || -
|-
|-
|-
|-
| journal (J) || Article || set Source.Genre to Journal
| Report || Text || -
|-
|-
|-
|-
| book in series || - || not used in WoS records
| Journal || Text || -
|-
|-
|-
|-
| patent || - || not used in WoS records
| Issue || Text || -
|-|}
|-
 
|-
| Series || Collection || Text cannot be used because the Series could be a series of non-Text items (e.g. series of recorded talks).
|-
|-
| Manuscript || Text || -
|-
|-
| Other || - || -
|-
|}




{| border="1"
{| border="1"
|+
|+
! WoS !! eSciDoc !! Comment/Condition(s)
! eSciDoc !! DC !! Comment/Condition(s)
|-
| Genre || dc:type || Additionally, Genres are mapped to DCMI Type Vocabulary terms (see table above).
|-
|-
| Creator.CreatorType || - || -
|-
|-
| Creator.CreatorRole || - || determines whether the Creator is mapped to dc:contributor or to dc:creator (see below).
|-
|-
| Creator.Person.CompleteName || dc:creator || if Creator.CreatorRole = Author. Otherwise, if Creator.CreatorRole is Advisor, Contributor, Transcriber, Translator, or Honoree, map Creator.Person.CompleteName to dc:contributor. For other Creator.CreatorRoles, map Creator.Person.CompleteName to dc:creator if there is no Creator.CreatorRole = Author, or map to dc:contributor if there is already a Creator.CreatorRole = Author.
|-
|-
| Creator.Person.GivenName || dc:creator || if there's no CompleteName. See above
|-
|-
| FN (File Name) || - || -
|-
|-
| Creator.Person.FamilyName || dc:creator || if there's no CompleteName. See above
|-
|-
| VR (Version Number) || - || -
|-
|-
| Creator.Person.AlternativeName || - || -
|-
|-
| PT (Publication Type) || - || See [[#Mapping of Genres]]. Alternatively, types from DT (Document Type, see below) could be used.
|-
|-
| Creator.Person.Title || - || -
|-
|-
| AU (Authors) || Creator.Person.CompleteName || only if there is no AF (Author Full Name, see below). Set Creator.CreatorRole to Author. Organization authors are only supported in the group authors field in this mapping (see below).
|-
|-
| Creator.Person.Organization.Name || dc:contributor || -
|-
|-
| AF (Author Full Name) || Creator.Person.CompleteName || Set Creator.CreatorRole to Author. Organization authors are only supported in the group authors field in this mapping (see below).
|-
|-
| Creator.Person.Organization.Address || - || -
|-
|-
| CA (Group Authors) || - || seems to have been changed to GP (see below).
|-
|-
| Creator.Person.Organization.Identifier || - || -
|-
|-
| GP [=Group Author] || Creator.Organization.Name || Set Creator.CreatorRole to Author.
|-
|-
| Creator.Person.Identifier || - || -
|-
|-
| TI (Title) || Title || -
|-
|-
| Creator.Organization.Name || dc:creator || if Creator.CreatorRole = Author. Otherwise, if Creator.CreatorRole is Advisor, Contributor, Transcriber, Translator, or Honoree, map Creator.Organization.Name to dc:contributor. For other Creator.CreatorRoles, map Creator.Organization.Name to dc:creator if there is no Creator.CreatorRole = Author, or map to dc:contributor if there is already a Creator.CreatorRole = Author.
|-
|-
| ED (Editors) || Source.Creator.Person.CompleteName || Field contains all editors in a single string that needs to be split into several Creator fields. Set Source.Creator.CreatorRole to Editor.
|-
|-
| Creator.Organization.Address || - || -
|-
|-
| SO (Publication Name) || Source.Title || -
|-
|-
| Creator.Organization.Identifier || - || -
|-
|-
| SE (Book Series Title) || 2nd Source: Source.Title || Set Genre of 2nd Source to Series.
|-
|-
|-
|-
| BS (Book Series Subtitle) || 2nd Source: Source.AlternativeTitle || -
| Title || dc:title || -
|-
|-
|-
|-
| LA (Language) || Language || Needs to be mapped to ISO code.
| Language || dc:language || ISO 639 is also proposed by DCMI, so no transformation is necessary.
|-
|-
|-
|-
| DT (Document Type) || - || Basically, all DTs in WoS are either Articles or Conference Papers, so mapping using the PT field is sufficient (unless DTs "Meeting Abstract" and/or "Meeting Summary" should be mapped to Conference Report).
| AlternativeTitle || dc:title || It should be made clear by the order of the dc:title fields which is the main title and which is the alternative title (e.g. 1st dc:title = main title, 2nd dc:title = alternative title).
|-
|-
|-
|-
| CT (Conference Title) || Event.Title || -
| Identifier.Id || dc:identifier || -
|-
|-
|-
|-
| CY (Conference Date) || Event.StartDate, Event.EndDate || Needs to be split into the two Event.*Date fields.
| Identifier.IdType || - || Unfortunately, some ID types are hard to recognize without their explicit Id.Type information (e.g. PMID), but qualifiers are only supported in Qualified Dublin Core.
|-
|-
|-
|-
| HO (Conference Host) || - || Not supported. Alternatively, concatenate with CL (see below) and map to Event.Place.
| PublishingInfo.Publisher || dc:publisher || -
|-
|-
|-
|-
| CL (Conference Location) || Event.Place || -
| PublishingInfo.Place || - || Could be used in a citation - maybe as OpenURL? - in dc:identifier (similar to Source identifiers, see below, see also http://dublincore.org/documents/dc-citation-guidelines/).
|-
|-
|-
|-
| SP (Conference Sponsors) || - || -
| PublishingInfo.Edition || dc:relation || Since the qualifier hasVersion cannot be used in Simple Dublin Core, it might be useful to add the prefix "Edition: ".
|-
|-
|-
|-
| DE (Author Keywords) || Subject || together with ID and SC (see below).
| Date.Date || dc:date || ISO 8601 is also proposed by DCMI, so no transformation is necessary. Since no qualifiers are used to express the Date.DateType, only the first Date.Date according to [[PubMan_Func_Spec_OpenURL_Mapping]] is mapped to dc:date and all others are ignored.
|-
|-
|-
|-
| ID (Keywords Plus) || Subject || together with DE (see above) and SC (see below).
| Date.DateType || - || is used to determine which Date.Date is mapped to dc:date (see above).
|-
|-
|-
|-
| AB (Abstract) || Abstract || -
| ReviewMethod || - || -
|-
|-
|-
|-
| C1 (Author Address) || Creator.Organization.Name, Creator.Organization.Address || Match address prefix in square brackets with author name (AU/AF), then split into name and address part and assign to matching authors' Creator.Organization.
| Source.Genre || - || -
|-
|-
|-
|-
| RP (Reprint Address) || - || RP is usually the first author's address, and thus identical with one of the C1 addresses. Use only for Creator.Organization.Name + .Address if there is no C1.
| Source.Title || dc:identifier || together with other Source information (see below). DCMI suggests dcterms:bibliographicCitation for this kind of data, which is a Qualified Dublin Core refinement of dc:identifier. Additionally, Source information may be stored in dc:identifier in OpenURL format. See http://dublincore.org/documents/dc-citation-guidelines/
|-
|-
|-
|-
| EM (E-mail Address) || - || -
| Source.AlternativeTitle || - || -
|-
|-
|-
|-
| FU (Funding Agency and Grant Number) || - || -
| Source.Creator.CreatorType || - || -
|-
|-
|-
|-
| FX (Funding Text) || - || -
| Source.Creator.CreatorRole || - || -
|-
|-
|-
|-
| CR (Cited References) || - || -
| Source.Creator.Person.CompleteName || dc:identifier || together with other Source data (see above). Map only if Source.Genre = Book, Proceedings, Issue, or Other.
|-
|-
|-
|-
| NR (Cited Reference Count) || - || -
| Source.Creator.Person.GivenName || dc:identifier || if there is no Source.Creator.Person.CompleteName. See above.
|-
|-
|-
|-
| TC (Times Cited) || - || -
| Source.Creator.Person.FamilyName || dc:identifier || if there is no Source.Creator.Person.CompleteName. See above.
|-
|-
|-
|-
| PU (Publisher) || Source.PublishingInfo.Publisher || -
| Source.Creator.Person.AlternativeName || - || -
|-
|-
|-
|-
| PI (Publisher City) || - || Source.PublishingInfo.Place is obtained from PA instead (see below).
| Source.Creator.Person.Title || - || -
|-
|-
|-
|-
| PA (Publisher Address) || Source.PublishingInfo.Place || More complete than PI (also contains country).
| Source.Creator.Person.Organization || - || -
|-
|-
|-
|-
| SC (Subject Category) || Subject || together with DE and ID (see above).
| Source.Creator.Person.Identifier || - || -
|-
|-
|-
|-
| SN (ISSN) || Source.Identifier.Id || Set Source.Genre to Journal. If there is also an ISBN (BN field, see below), map to 2nd Source.Identifier.Id instead, set Source.Genre to Proceedings and 2nd Source.Genre to Series. IdType = ISSN.
| Source.Creator.Organization.Name || dc:identifier || together with other Source data (see above). Map only if Source.Genre = Book, Proceedings, Issue, or Other.
|-
|-
|-
|-
| BN (ISBN) || Source.Identifier.Id || Set Source.Genre to Proceedings. IdType = ISBN.
| Source.Creator.Organization.Address || - || -
|-
|-
|-
|-
| J9 (29-Character Source Abbreviation) || Source.AlternativeTitle || If there is a Series Title (SE), map to 2nd Source.AlternativeTitle instead.
| Source.Creator.Organization.Identifier || - || -
|-
|-
|-
|-
| JI (ISO Source Abbreviation) || Source.AlternativeTitle || If there is a Series Title (SE), map to 2nd Source.AlternativeTitle instead.
| Source.Volume || dc:identifier || together with other Source data (see above).
|-
|-
|-
|-
| PD (Publication Date) || Date.Date || together with PY (see below). DateType = created.
| Source.Issue || dc:identifier || together with other Source data (see above).
|-
|-
|-
|-
| PY (Year Published) || Date.Date || together with PY (see above) if there is a PD. DateType = created.
| Source.StartPage || dc:identifier || together with other Source data (see above).
|-
|-
|-
|-
| VL (Volume) || Source.Volume || If there is a Series Title (SE), map to 2nd Source.Volume instead.
| Source.EndPage || dc:identifier || together with other Source data (see above).
|-
|-
|-
|-
| IS (Issue) || Source.Issue || If there is a Series Title (SE), map to 2nd Source.Issue instead.
| Source.SequenceNumber || - || -
|-
|-
|-
|-
| PN (Part Number) || - || Probably not used in WoS anyway.
| Source.PublishingInfo.Publisher || dc:identifier || together with other Source data (see above). If Source.Genre = Journal or Series, ignore Source.PublishingInfo.Publisher instead.
|-
|-
|-
|-
| SU (Supplement) || - || Probably not used in WoS anyway.
| Source.PublishingInfo.Place || dc:identifier || together with other Source data (see above). If Source.Genre = Journal or Series, ignore Source.PublishingInfo.Place instead.
|-
|-
|-
|-
| SI (Special Issue) || - || Special issues usually have a regular issue number as well.
| Source.PublishingInfo.Edition || dc:identifier || together with other Source data (see above). If Source.Genre = Journal or Series, ignore Source.PublishingInfo.Edition instead.
|-
|-
|-
|-
| BP (Beginning Page) || Source.StartPage || -
| Source.Identifier.Id || - || -
|-
|-
|-
|-
| EP (Ending Page) || Source.EndPage || -
| Source.Identifier.IdType || - || -
|-
|-
|-
|-
| AR (Article Number) || - || -
| 2nd Source || - || -
|-
|-
|-
|-
| PG (Page Count) || TotalNumberOfPages || If there is no PG, use EP minus BP for TotalNumberOfPages instead.
| Event || dc:relation || export all Event.* fields to a single dc:relation field and separate the values with a comma. (In Qualified Dublin Core, the refinement would be dcterms:isPartOf.)
|-
|-
|-
|-
| DI (Digital Object Identifier (DOI)) || Identifier.Id || Set IdType to DOI.
| TotalNumberOfPages || dc:format || add suffix " pages".
|-
|-
|-
|-
| GA (Document Delivery Number) || - || -
| Degree || - || -
|-
|-
|-
|-
| UT (Unique Article Identifier) || Identifier || Set IdType to ISI.
| Abstract || dc:description || -
|-
|-
|-
|-
| ER (End of Record) || - || -
| Subject || dc:subject || Map each keyword to a dc:subject field of its own. Dublin Core also allows using a single dc:subject field and delimiters to separate the keywords, but it will probably be easier to use multiple fields once the changes to the Subject field ([[Talk:PubMan_Metadata_Sets#Subject_once]]) are implemented.
|-
|-
|-
|-
| EF (End of File) || - || -
| TableOfContents || dc:description || -
|-
|-
|}
|}

Latest revision as of 09:26, 14 April 2009

Mapping of Genres[edit]

DCMI Type Vocabulary: http://dublincore.org/documents/dcmi-type-vocabulary/

eSciDoc DC Comment/Condition(s)
Article Text -
Book Text -
Book Item Text -
Proceedings Text -
Conference Paper Text -
Talk at Event - Unfortunately, it is impossible to determine whether Text, (Moving)Image, or Sound should be used.
Conference Report Text -
Poster Image Alternatively, use the narrower term StillImage.
Courseware/Lecture - Impossible to determine which Type should be used.
Thesis Text -
Paper Text -
Report Text -
Journal Text -
Issue Text -
Series Collection Text cannot be used because the Series could be a series of non-Text items (e.g. series of recorded talks).
Manuscript Text -
Other - -


eSciDoc DC Comment/Condition(s)
Genre dc:type Additionally, Genres are mapped to DCMI Type Vocabulary terms (see table above).
Creator.CreatorType - -
Creator.CreatorRole - determines whether the Creator is mapped to dc:contributor or to dc:creator (see below).
Creator.Person.CompleteName dc:creator if Creator.CreatorRole = Author. Otherwise, if Creator.CreatorRole is Advisor, Contributor, Transcriber, Translator, or Honoree, map Creator.Person.CompleteName to dc:contributor. For other Creator.CreatorRoles, map Creator.Person.CompleteName to dc:creator if there is no Creator.CreatorRole = Author, or map to dc:contributor if there is already a Creator.CreatorRole = Author.
Creator.Person.GivenName dc:creator if there's no CompleteName. See above
Creator.Person.FamilyName dc:creator if there's no CompleteName. See above
Creator.Person.AlternativeName - -
Creator.Person.Title - -
Creator.Person.Organization.Name dc:contributor -
Creator.Person.Organization.Address - -
Creator.Person.Organization.Identifier - -
Creator.Person.Identifier - -
Creator.Organization.Name dc:creator if Creator.CreatorRole = Author. Otherwise, if Creator.CreatorRole is Advisor, Contributor, Transcriber, Translator, or Honoree, map Creator.Organization.Name to dc:contributor. For other Creator.CreatorRoles, map Creator.Organization.Name to dc:creator if there is no Creator.CreatorRole = Author, or map to dc:contributor if there is already a Creator.CreatorRole = Author.
Creator.Organization.Address - -
Creator.Organization.Identifier - -
Title dc:title -
Language dc:language ISO 639 is also proposed by DCMI, so no transformation is necessary.
AlternativeTitle dc:title It should be made clear by the order of the dc:title fields which is the main title and which is the alternative title (e.g. 1st dc:title = main title, 2nd dc:title = alternative title).
Identifier.Id dc:identifier -
Identifier.IdType - Unfortunately, some ID types are hard to recognize without their explicit Id.Type information (e.g. PMID), but qualifiers are only supported in Qualified Dublin Core.
PublishingInfo.Publisher dc:publisher -
PublishingInfo.Place - Could be used in a citation - maybe as OpenURL? - in dc:identifier (similar to Source identifiers, see below, see also http://dublincore.org/documents/dc-citation-guidelines/).
PublishingInfo.Edition dc:relation Since the qualifier hasVersion cannot be used in Simple Dublin Core, it might be useful to add the prefix "Edition: ".
Date.Date dc:date ISO 8601 is also proposed by DCMI, so no transformation is necessary. Since no qualifiers are used to express the Date.DateType, only the first Date.Date according to PubMan_Func_Spec_OpenURL_Mapping is mapped to dc:date and all others are ignored.
Date.DateType - is used to determine which Date.Date is mapped to dc:date (see above).
ReviewMethod - -
Source.Genre - -
Source.Title dc:identifier together with other Source information (see below). DCMI suggests dcterms:bibliographicCitation for this kind of data, which is a Qualified Dublin Core refinement of dc:identifier. Additionally, Source information may be stored in dc:identifier in OpenURL format. See http://dublincore.org/documents/dc-citation-guidelines/
Source.AlternativeTitle - -
Source.Creator.CreatorType - -
Source.Creator.CreatorRole - -
Source.Creator.Person.CompleteName dc:identifier together with other Source data (see above). Map only if Source.Genre = Book, Proceedings, Issue, or Other.
Source.Creator.Person.GivenName dc:identifier if there is no Source.Creator.Person.CompleteName. See above.
Source.Creator.Person.FamilyName dc:identifier if there is no Source.Creator.Person.CompleteName. See above.
Source.Creator.Person.AlternativeName - -
Source.Creator.Person.Title - -
Source.Creator.Person.Organization - -
Source.Creator.Person.Identifier - -
Source.Creator.Organization.Name dc:identifier together with other Source data (see above). Map only if Source.Genre = Book, Proceedings, Issue, or Other.
Source.Creator.Organization.Address - -
Source.Creator.Organization.Identifier - -
Source.Volume dc:identifier together with other Source data (see above).
Source.Issue dc:identifier together with other Source data (see above).
Source.StartPage dc:identifier together with other Source data (see above).
Source.EndPage dc:identifier together with other Source data (see above).
Source.SequenceNumber - -
Source.PublishingInfo.Publisher dc:identifier together with other Source data (see above). If Source.Genre = Journal or Series, ignore Source.PublishingInfo.Publisher instead.
Source.PublishingInfo.Place dc:identifier together with other Source data (see above). If Source.Genre = Journal or Series, ignore Source.PublishingInfo.Place instead.
Source.PublishingInfo.Edition dc:identifier together with other Source data (see above). If Source.Genre = Journal or Series, ignore Source.PublishingInfo.Edition instead.
Source.Identifier.Id - -
Source.Identifier.IdType - -
2nd Source - -
Event dc:relation export all Event.* fields to a single dc:relation field and separate the values with a comma. (In Qualified Dublin Core, the refinement would be dcterms:isPartOf.)
TotalNumberOfPages dc:format add suffix " pages".
Degree - -
Abstract dc:description -
Subject dc:subject Map each keyword to a dc:subject field of its own. Dublin Core also allows using a single dc:subject field and delimiters to separate the keywords, but it will probably be easier to use multiple fields once the changes to the Subject field (Talk:PubMan_Metadata_Sets#Subject_once) are implemented.
TableOfContents dc:description -