Difference between revisions of "Talk:ViRR Scope"

From MPDLMediaWiki
Jump to navigation Jump to search
Line 21: Line 21:


'''ToDo (Discussion):'''
'''ToDo (Discussion):'''
* ingestion of bibliographic data: either mapping MAB to eSciDoc XML, to Dublin Core, to MODS, or enter data manually (evaluation by inga?)
* Ingestion of bibliographic data: either mapping MAB to eSciDoc XML, to Dublin Core, to MODS, or enter data manually (evaluation by inga?)
* relation to eSciDoc METS profile (first draft based on Ingas mapping ebinds<=>METS)?
* Relation to eSciDoc METS profile (first draft based on Ingas mapping ebinds <=> METS)?
 
:''Already some input from Inga:
:Generic Mapping MAB => eSciDoc is not necessarily of great help: Each MAB user uses his own adapted MAB and using MAB means running into severe character set problems. If a mapping is needed, mapping to DC or MODS might be sufficient. In addition, instead of mapping, manual data entry should be considered, especially when dealing only with 2 books. In any case, a decision about the "eSciDoc VIRR profile" might be needed, as genre types and current PubMan Metadata won't cover the ViRR material. ''
 
* Improve quality of image files (based on TIFFs) => for improved thumbnails + additional resolution for web presentation => check concrete requirements with Institute ("schwarze Ränder" on TIFFs would have to be done by institute). Check requirements for resolution needed by digilib
* Functional prototype for Display and Browsing
* Start collecting requirements for the viewing environment [[Digilib#Requirements_for_Solutions|DigiLib]] and set up meeting with user group (Contact@FIZ: Frank Schwichtenberg) (Kristina, Tobias)


''Already some input from Inga: Generic Mapping MAB=> eSciDoc is not necessarily of great help: Each MAB user uses his own adapted MAB and using MAB means running into severe character set problems. If a mapping is needed, mapping to DC or MODS might be sufficient. In addition, instead of mapping, manual data entry should be considered, especially when dealing only with 2 books. In any case, a decision about the "eSciDoc VIRR profile" might be needed, as genre types and current PubMan Metadata won't cover the VIrr material. ''
* improve quality of image files (based on TIFFs) => for improved thumbnails + additional resolution for web presentation => check concrete requs with Institute ("schwarze Ränder" on TIFFs would have to be done by institute). Check requirements for resolution needed by digilib
* functional prototype for Display and browsing
* start collecting requirements for the viewing environment [[Digilib#Requirements_for_Solutions|DigiLib]] and set up meeting with user group (Contact@FIZ: Frank Schwichtenberg) (Kristina, Tobias)


=== Release two ===
=== Release two ===
Line 43: Line 46:
*# Shame
*# Shame
* prepare requirements for FIZ for the METS integration
* prepare requirements for FIZ for the METS integration


=== Release three ===
=== Release three ===
Line 57: Line 61:
#* simple search (one search field "any field")
#* simple search (one search field "any field")
#* advanced search (several special search fields, e.g. one for title, one for author)
#* advanced search (several special search fields, e.g. one for title, one for author)


=== Release four ===
=== Release four ===

Revision as of 11:08, 7 February 2008

ToDo's and comments for each Release:

FIRST PHASE - Publication of the digital collection[edit]

General ToDos:

  • Evaluation of the results from JHOVE on the images.

Release one[edit]

  1. Ingestion
    • scans --> derive from file structure a first skeleton of toc (name of book, sequence of pages)
    • bibliographic metadata (currently in MAB) - either entered manually or ingested
    • derive basic keywords from title of book (to be checked with institute)
  2. Browsing (basic)
    • browsing tree (books and pages), sorted alphabetically by title
  3. Display (basic)
    • thumbnail lists
    • basic bibliographic metadata (name of book, page)
    • scans


ToDo (Discussion):

  • Ingestion of bibliographic data: either mapping MAB to eSciDoc XML, to Dublin Core, to MODS, or enter data manually (evaluation by inga?)
  • Relation to eSciDoc METS profile (first draft based on Ingas mapping ebinds <=> METS)?
Already some input from Inga:
Generic Mapping MAB => eSciDoc is not necessarily of great help: Each MAB user uses his own adapted MAB and using MAB means running into severe character set problems. If a mapping is needed, mapping to DC or MODS might be sufficient. In addition, instead of mapping, manual data entry should be considered, especially when dealing only with 2 books. In any case, a decision about the "eSciDoc VIRR profile" might be needed, as genre types and current PubMan Metadata won't cover the ViRR material.
  • Improve quality of image files (based on TIFFs) => for improved thumbnails + additional resolution for web presentation => check concrete requirements with Institute ("schwarze Ränder" on TIFFs would have to be done by institute). Check requirements for resolution needed by digilib
  • Functional prototype for Display and Browsing
  • Start collecting requirements for the viewing environment DigiLib and set up meeting with user group (Contact@FIZ: Frank Schwichtenberg) (Kristina, Tobias)


Release two[edit]

  1. Editing
    • enrich toc sceleton with information on chapters (i.e. bundles)(e.g. page 1-5 = chapter 1)
    • add metadata about the chapters, e.g. keywords

ToDo (Discussion):

  • editing via simple edit mask or already with METS editor (selection of Editor depends on eSciDoc METS profile)
  • prepare first draft eSciDoc METS profile (based on bibliographic data needed, descriptive data needed)
  • decide on recommended METS online xml editor for the acquisition of structural data (which can also be used offline if possible)
    1. GOOBI for METS
    2. Docworks: Meta-e cooperation with css (is not interesting if it works with an automatic recognition)
    3. Shame
  • prepare requirements for FIZ for the METS integration


Release three[edit]

  1. Browsing (detailed)
    • extension of the alphabetical browsing tree (chapters)
    • chronological navigation on book and/or chapter level?(depends on descriptive Metadata! )
    • paginator (for lists)
    • paging for images (i.e. "im Buch blaettern")
  2. Display (detailed)
    • integration of digilib functionalities (minimum: zoom in, zoom out)
    • dynamic generation and integration of "identification stamp" ("Herkunftsnachweis") on the images (whole image, selected part of image) --> new Digilib requirement
  3. Search
    • simple search (one search field "any field")
    • advanced search (several special search fields, e.g. one for title, one for author)


Release four[edit]

  1. Functional definition of eSciDoc METS profile
    • needed for import / export
  2. Export
    • image selection
    • downloading of selected images(in separate jpgs)
    • downloading of selected images(in one pdf with a cover page)
    • downloading selected part of an image
    • downloading of METS-xml
  3. Display keywords as list (cf. Index in a book)
  4. Persistent Identifier (PID)

SECOND PHASE - Virtual research environment[edit]

Following is a list of requirements to be met...detailed release planning at a later stage.

  • Workflow for edition process of collection, incl. metadata, images, annotations, external sources (upload, editing, annotating, scientific review etc.)
  • User Management to support workflow
  • Fulltext transcription online (offline client at later stage) - in METS
  • Ingestion/Upload of additional books (digital images + bibliographic metadata) - local resources, BBAW-DTA
  • Adding and editing of bibliographic and descriptive metadata
  • Adding annotations / comments
  • Adding relations
  • Integration of external resources (Deutsches Rechtswoerterbuch/Heidelberg)
  • Creation and maintenance of synonyms
  • Offering metadata to the ZVDD(zentrales Verzeichnis digitalisierter Drucke) and other virtual libraries - OAI interface for the exchange of metadata
  • Sitemap protocol for crawlers
  • Integration of research literature for download (bibliographic lists? articles?)
  • Linking to other digital archives / OPACs /research projects
  • Delivery of one complete dataset for the DNB for long term archiving

ToDo (Discussion):

  • Structural analyzes of the data of the Deutsche Rechtswörterbuch
  • Analyzes of the requirements of the ZVDD
  • Text editor for the creation of transcriptions is needed

THIRD PHASE - Productive environment[edit]

  • Preparation of productive environment (hardware, support, policies)
  • Offline tool for image processing to improve image quality
  • Fulltext transcription in TEI?
  • Additional functionality for historisch-kritische Editionsarbeit?
  • Concept ViRR for other local/MPG projects (e.g. Policey-Ordnung)