ESciDoc Developer Workshop 2009-07-21

ESciDoc

Date: 21.07.2009 Start time: 14:30

Location: Karlsruhe, München (Video conference or TelCo)

Participants MPDL: Natasa Bulatovic, Wilhelm Frank

Participants FIZ: Harald Kappus

=workshop=
 * Previous workshop
 * ESciDoc_Developer_Workshop_2009-06-30


 * Next workshops
 * ESciDoc_Developer_Workshop_2009-07-28 will be skipped
 * ESciDoc_Developer_Workshop_2009-08-04 will be skipped
 * ESciDoc_Developer_Workshop_2009-08-11
 * ESciDoc_Developer_Workshop_2009-08-18

=Agenda=

to discuss

 * externalize xslt-s to DC metadata
 * MPDL needs new DC transformation in place
 * filters: latest release + latest versions
 * talk about basket issue
 * user group enhancements 1.2 clear?
 * 1.1.1 release available?

outcome

 * xslt-s for mapping to oai_dc are already "externalized" . A datastream in respective Fedora content model object needs to be added.
 * filters are not resolved, the issue on latest release + latest versions has to be reconsidered
 * on Baskets: MPDL needs to provide functional specification before talking on implementation
 * not clear if baskets should only be set of references or should contain (partial) copies of the object's metadata
 * what info should be shared for items referenced via baskets?
 * FIZ proposes to have basket as set of simple references
 * functional requirements have to reveal if there is a need for extension of filter methods with returning results or there is a need to have separate Basket Handler
 * MPDL considered containers as these mostly contain desired functionality
 * FIZ thinks containers are not needed as these are mostly resources and baskets in general are not resources
 * MPDL has to re-discuss the purpose of baskets - when they need to be persisted (not session level baskets)
 * user group enhancements clarified
 * FIZ points that the AA becomes much more complex
 * 1.1.1 release is not available, two serious ingestion bugs need to be fixed before releasing it
 * see INFR-734
 * second "bug" is a performance issue with large-scale ingests where we observe continuously increasing ingest times

V1.2

 * 1) surrogate items
 * 2) released items only
 * 3) further developments
 * 4) list of enhancements here
 * 5) status
 * Solr interfaces (FIZ, MPDL)?

Outcome

 * a list of ehnancements may be better addressed in the upcoming workshop
 * FIZ provided some input on critical for 1.2
 * Ontology manager (MPDL input: it is no big extension if Relations are implemented, it is only up to enabling the Ontologies to be published, rather then keeping them in the source code only)
 * versioning of large containers
 * Policy editor - scheduled, but not necessarily needed (MPDL agrees)
 * MPDL will check again and provide feedback


 * Surrogate items
 * FIZ informed that these are almost implemented
 * during discussion it seems there are different understanding of the concept of how to handle and what is a surrogate item at FIZ internally but also between FIZ/MPDL
 * FIZ will put the concept into Colab and MPDL will comment relating it to functional requirements

eSciDoc Developer Workshop

 * Date: München, 29-30.07.2009

Here is a list of potential topics for the meeting (compiled by Malte and Matthias): Topics:

1 Discussions

 * What have we achieved?


 * What makes eSciDoc special, compared to other solutions (e.g., Fedora, Islandora, Jackrabbit)?


 * What important features are still missing?


 * What do we want to achieve over the next 12/24 months?


 * short-term 6 months (by end of the year) MPDL / FIZ to discuss on the roadmap in more details --Natasa 10:14, 21 July 2009 (UTC)
 * How do we want to evolve from a funded project to a community project?


 * How do we see the collaboration of MPDL and FIZ in the future?

Timing: 1.30 hours

2 Fundamental changes

 * Fundamental changes
 * Ideas from FIZ Team:
 * dropping SOAP?
 * Replace atomistic model for Items/Components with copound model and RELS-INT
 * Replace DB-Cache with asynchonous Lucene Index and/or Object Database
 * Remove mapping of "escidoc" MD-record to DC record in Components (set title directly)
 * Get rid of content-model-specific properties (my personal "Ceterum censeo Carthaginem esse delendam" ;-)
 * Drop latest-release and latest-version from object representation
 * issues/problems in the current core architecture (input from MPDL team, FIZ Team)
 * issues/problems in the current solutions architecture (input from FIZ team, MPDL Team)

3 hours

Day 2
Specifics
 * Search and administrative search (additionally date indexes)
 * Admin Tools development
 * Large sets of data ingest - how to avoid downtime to recache and reindexing

2 hours

Other


 * Alignment of tools and processes (e.g., Maven)
 * Possible joint code-base, deployment configurations such as e.g. PubMan+core only, Faces+core Only, core only etc. ? --Natasa 10:11, 21 July 2009 (UTC)
 * Improved and harmonized external communication of eSciDoc
 * service names and classification
 * documentation of services
 * clean-up of unused and untested methods from docu--Natasa 10:19, 21 July 2009 (UTC)
 * installation guides
 * which information goes to Colab, which to escidoc.org?
 * eSciDoc Blog? --Natasa 10:12, 21 July 2009 (UTC)
 * Improved and harmonized internal communication of eSciDoc --Natasa 10:12, 21 July 2009 (UTC)
 * Service developments life-cycle
 * service-architecture board (again :), but also important for bringing to a community project)
 * Exchange of staff members for specific developments
 * select mixed smaller team for specific developments--Natasa 10:18, 21 July 2009 (UTC)
 * Joint PubMan training sessions für external organizations

Brainstorming:

Outcome

 * Teams to discuss the agenda again
 * Meeting will start 29.July around 13.00 and finish on 30.July around 16.00 - should be sufficient
 * Teams to discuss if there is need to have separate threads or not
 * MPDL suggests no group separations