Service for Control of Named Entities

From MPDLMediaWiki
Jump to: navigation, search
eSciDoc SOA
SOAP and REST style interfaces
Service layers
Core services
Context Handler · Item Handler
Container Handler
Organizational Unit Handler
User Account Handler
Authentication
Content Model Handler
Semantic Store Handler
Intermediate services
Validation Service
Statistics Manager
Technical Metadata extraction
PIDManager
Basket Handler
Duplication detection
ImageHandler(Digilib)

Application services
Depositing
Searching
Search&Export
Control of Named Entities
Citation style Manager
RightsChecking
DataAcquisition
Transformation
Fledged Data
PID Cache
OAI-PMH

SOA Introduction

edit



The purpose of this service is to provide methods to deal with controlled lists of named entities to assure data quality and facilitate data access and data entry. Read more on the background here.

Introduction

The Control of named entities (Cone) service at present supports the named entity types listed below. It is integrated with PubMan in the publication metadata editing forms via "auto suggest" component for respective metadata edit fields (journal name, language of the publication).

Operations

The CoNEControl of Named Entities service supports the following operations:

  • searching (based on the metadata of named entity i.e. in case of journals: journal name, publisher name, place of publication
  • retrieving details of selected named entity e.g. retrieving the metadata of a selected journal based on the journal identifier
Operation Status Input Output Description
query implemented (opt)q:String - the query (see below)
(opt)[predicate]: String - a value for a certain field (see below)
(opt)lang: String - the language as ISOInternational Organization for Standardization 636-1 code
(opt)n: int - The maximum number of results that should be returned
String Scope:Public
Gives back a list of resources/ids in the given language (default 'en') matching the given query. The number of maximum hits given back is configurable (default 50).
details implemented id: String
(opt)lang: String
String Scope:Public
Returns all available information on the resource in the given language (default 'en') identified by the given id.
all implemented (opt)lang: String String Scope:Public
Gives back a list of all resources/ids in the given language (default 'en').

The query parameter q may contain a single term (e.g. q=nature) or multiple terms (e.g. q=psychology therapy). CoNEControl of Named Entities will return those entries that contain all terms in one or more of the searchable fields. Putting the query into quotes e.g. q="John Doe" will cause CoNEControl of Named Entities to search for exact matches, wildcards (*) are allowed. As an alternative to search all fields, it is also possible to search certain fields (predicates) by specifying the predicate name and a query similar to the syntax described above (e.g. foaf:family_name=Miller or http://purl.org/escidoc/metadata/terms/0.1/suffix="*x"). The namespace of the predicate can be given fully or as prefix, prefixes are predefined here.

Interfaces

The CoNEControl of Named Entities service comes with a flexible interface definition that defines the Input/Output formats. The following formats are available

Interface Status Input Output Description
jquery implemented HTTPHyperText Transfer Protocol Get-Request jquery proprietary list, JSONJavaScript Object Notation HTTPServlet used by jquery Javascript components for autosuggest features
options implemented HTTPHyperText Transfer Protocol Get-Request Format that can easily be read by Java to generate JSFJavaServer Faces options HTTPServlet usable by Java a.o.
json implemented HTTPHyperText Transfer Protocol Get-Request pure JSONJavaScript Object Notation HTTPServlet usable by Javascript/AJAX components
html implemented HTTPHyperText Transfer Protocol Get-Request (X)HTMLHypertext Markup Language HTTPServlet that provides a human readable overview on the data (planned to be used for author pages)
rdf implemented HTTPHyperText Transfer Protocol Get-Request RDFResource Description Framework/XMLExtensible Markup Language HTTPServlet that provides a RDFResource Description Framework view on the data
unapi planned HTTPHyperText Transfer Protocol Get-Request varying HTTPServlet that conforms to the unAPISimple Interface for API interface specification
vcard planned HTTPHyperText Transfer Protocol Get-Request ASCIIAmerican Standard Code for Information Interchange(UTFUnicode Transformation Format-8 for single values possible using BASE64 encoding?) HTTPServlet that conforms to the vCard specification (RFC2425)

Supported Named Entity Types

Currently, we support following vocabularies:

Journals

Languages

  • url to the language service interface (to be done)
  • metadata description (to be done)
  • example urls to the service (to be done)
  • Functional Specification to handle the entity (to be done)

Persons

DDCDisplay Data Channel

  • Metadata description (to be done)
  • Functional Specification for handling entity (to be done)
  • Currently implemented: Dewey Decimal Classification

Mimetypes

A list of the IANAInternet Assigned Numbers Authority mimetype list plus additional mimetypes needed for PubManPublication Management (currently used for validation).

  • Metadata description (to be done)
  • Functional description for handling the entity (to be done)

eSciDocEnhanced Scientific Documentation mimetypes

A subset of the mimetypes list above. Planned to be used for PubManPublication Management in the future.

  • Metadata description (to be done)
  • Functional description for handling the entity (to be done)

Candidates for Named Entity Types

Potential further candidates for normalized metadata entries:

MPGMax-Planck-Gesellschaft Units

Potential metadata elements

Complete name In Englisch and/or German?

Alternative name

Place

Address

Homepage

Resources

Name of service Scope Info Formats supported Interfaces Costs Access
Körperschaftsnormdatei (GKDGemeinsame Körperschaftsdatei) More than 1 mio records (german&foreign corporate bodies and conferences) Introduction MAB2 Z39.50 see PNDPersonen Normdatei see PNDPersonen Normdatei

Conferences/events

Potential metadata elements

Title The name of the event (e.g. Symposium on Theory of Computing)

Alternative title Any alternative name of the event

Abbreviation Abbreviated name of the event (e.g. STOC)

Start date Start date of the event

End date End date of the event

Place Place where the event took place

Invitation status The information if the creator was explicitly invited

Remark Sabine: Should this information be stored in controlled metadata record?
Remark Ulla: No, not to my understanding
To my understanding, the invitation status can only be specified for each talk individually and is therefore no generic metadata for the conference --Inga 12:19, 28 December 2007 (CETCentral European Time)