Managing CoNE entities - Persons

From MPDLMediaWiki
Revision as of 19:05, 24 August 2009 by Uat (talk | contribs) (→‎Scenarios)
Jump to navigation Jump to search

Scenarios, use cases and data structure for extending the eSciDoc CONE service with data on persons, especially in the context of eSciDoc.PubMan.

Data pools[edit]

eSciDoc.PubMan will maintain 2 pools of person data:

Un-authorized person data[edit]

  • i.e. names, name variants and respective affiliations as provided during submission or import of data by users
  • the data pool will be created by using the current name variants of publication items on pubman and will be continuosly growing by new entries (by submission or import)
  • The user who submits data manually, will be supported by an autosuggest list containing all currently available un-authorized data, to reduce duplicate entries (see below Scenarios)
  • The person information stored with a publication item will not be altered, i.e. the person information stored with a publication item can be un-authorized data, can be already authorized data. In case of un-authorized data, the unauthorized nameID can be related afterwards to an authorized PersonID in CONE.
  • Therefore, the publication item will contain only one possible naming variant, with specific affiliations at a given point in time, which is the one provided during submission/import.
  • The entries in the un-authorized pool of data, i.e. including all possible naming variants of the same person, will have internal ID, to be linked to CONE IDs.
    • to be checked if during the linking, the Cone authorized persons will have the un-authorized name variants added as alternative names --Natasa 16:45, 13 January 2009 (UTC)

Authorized person data[edit]

  • contains main name entry, controlled alternative name variants, controlled affiliation (see data structure)
  • will be controlled by selected users via edition of the CONE service data
  • Only selected users can access the CONE service to edit and maintain the controlled person IDs
  • Selected users can relate un-auhtorized person name IDs to permanent, controlled, authorized person IDs in CONE.


An example how to distinguish these two lists in the presentation can be found here.

Scenarios[edit]

These scenarios, and any related use case has to be crosschecked with preliminary functional specification!!--Ulla 19:05, 24 August 2009 (UTC)

Submission[edit]

Status: in specification

Schedule: R4.1

During Submission (either easy or full submission), user can enter any name variant. Either s/he follows the "Autopsie Prinzip" and copies the name variant directly from and strictly following the typing on the original copy. Alternatively, to increase data quality, s/he can choose a name variant, including an affiliation, from the auto-suggest list for persons. These values can be un-authorized or authorized person data.

  • The User can select a value from the autosuggest list and store the item. In the process of selection from auto-suggest list, he is supported by information, which of the suggested values is an "un-authorized" and what is "auhtorized" person data.(Example worldCat). After selection of an value, the publication item contains ID and value of selected ID.
  • The user can select an un-authorized name from the autosuggest list, but overwrites the provided value. A new un-authorized nameID is created, without relation to the previosly selected nameID.
  • The user can select an Authorized person ID. He is not allowed to overwrite the value, but he can create a "potential candidate" for a new naming variant of the authorized person ID.
  • The user can ignore the autosuggest list and enter whatever value for the person. A new un-authorized nameId is created.

Search[edit]

Status: in specification

Schedule: R4.1

Any search triggered via Quick search, Advanced search or Search&Export service will search in both un-authorized nameIDs and authorized CONE-IDs.

On the PubMan GUI for search results, user should be indicated, that other possible naming variants exist.

Comment Nicole: I think it would be good if the user would get the following information: number of records found with a CONE ID, number of records found with unauthorized person ID, variants. The user should then be able to specify if s/he only wants to see a specific set of records or all. --Nicole 08:30, 18 December 2008 (UTC)
Questions Nicole: How/where shall we search if the user wants to perform an exact search? --Nicole 08:30, 18 December 2008 (UTC)

View researcher portfolio/profile[edit]

Status: in design

Schedule: R4.1

The search triggered by "View researcher profile" is searching exclusively on CONE PersonIDs, as the service "view researcher portfolio" is bound to an authorized CONE PersonID. Check related use case for details. http://colab.mpdl.mpg.de/mediawiki/Researcher_Portfolio#UC_View_researcher_portfolio.2Fprofile

Edit CONE[edit]

Status: in specification

Schedule: R4.1/R4.2

Only selected users can define, which of the un-authorized names/naming variants relate to one specific person. They should be able to

  • create and release a PersonId in CONE (start content might be ingested?)
  • edit the personal data related to a PersonID
  • relate one or many un-authorized nameIDs to one PersonID
  • add new naming variants to a Person ID (e.g. "potential candidates", cf submission)
  • look-up Persons in external authority files (Researcher ID, WorldCat, Kaken, PND)

Open questions[edit]

  • in case of import of references (bibtex, endnote, fetch md), can we combine it with an alternative to "autosuggest", i.e. to avoid duplicate entries?
    • One could think of checking the given name for matches in both pools and then give a message to the user about possible controlled names which he can alter in the edit item mask.--Kleinfercher 08:50, 19 December 2008 (UTC)
  • does it make sense to provide additional extension on view item page, to search for "all publications of this author"? Is actually same scenario as search, but in addition to start searching in quick search, user would have option to trigger search right from view item details (i.e. name of person)
Comment Nicole: I think it makes sense to offer a link "all publications of this author", as we also had this in eDoc and I think it was used quite often. The only question would then be, if this will return into an exact search then or not. --Nicole 08:32, 18 December 2008 (UTC)

Possible solution: Linked person names in view item triggers exact search for this person name and provides pubman results. (similar to edoc). Complementary, icons for researcher portfolios are provided for those persons with CONE Id.--Ulla 14:36, 18 December 2008 (UTC)

Data for CoNE Person[edit]

The current namespaces and terms used to describe a CoNe person can be found here



Comments[edit]

  • The presentation of the researcher portfolio will be in English, an extra page in CONE service will be needed to provide a translation of the researcher portfolio in e.g. Japanese letters.
  • NIMS wants it to be visible on the researcher profile who last modified it and when.

External Resources[edit]

As potential external resources, following sources can be considered for Person(s), i.e. full name of persons (authors, editors, referees, etc.)

Name of service Scope Info Formats supported Interfaces Costs Access
Library of Congress Name Authority Service To be evaluated in detail

(likely not to cover too many MPG authors)

Introduction

WSDL http://authorities.loc.gov

MARCXML SOAP

WSDL

Records are free of charge[1] via web site
Personennormdatei (PND) ca. 2,6 mio names (1 mio with individualized records)

To be evaluated in detail (likely not to cover too many MPG authors)

Introduction MAB2

USMARC SUTRS

Z39.50 PND, GKD and SWD only in combination available

costs

CD-ROM (2 CDs) as cumulative new editions. Published biyearly in January and July
PND is licensed in MPS, database is available via the Aleph server[2]
Virtual International Authority File (VIAF) First prototype covers LC and DNB personal name authority and related bibliographic records project web site MARC21 (?) Prototype system available at:

http://viaf.org

Computer Science Bibliography (DBLP) Computer Science http://dblp.uni-trier.de HTML, XML
Wikipedia Persondata info data dump HTML

Related links[edit]

good overview on standards in use (international standards, library-derived systems, commercial systems)

  1. Information from web site: "users do not have to register or request permission to search, save, print, or email the LC authority records. The only limitation is that authority records may only be saved, printed or emailed one at a time."
  2. https://dev.livingreviews.org/projects/vlib/wiki/AuthFiles