Difference between revisions of "Talk:ViRR Scope"

From MPDLMediaWiki
Jump to navigation Jump to search
Line 5: Line 5:
== Release one ==
== Release one ==


# Ingestion (no user interface)
=== Short ToDos Development ===
#* scans --> derive from file structure a basic skeleton of toc
#* bibliographic metadata: [[Talk:ViRR_Metadata#MAB_to_MODS_mapping| MAB mapping to MODS]]
#* structural metadata: eSciDoc container
#* derive basic keywords from bibliographic metadata
# [[ViRR_Browse_and_Display|Browsing and Display]] (basic)
#* alphabetical sorted browsing tree (multi-volume works, parts and pages)
#* display of basic bibliographic metadata (name of book, page)
#* display of scans in detailed view
 
=== Discussion ===
 
:* Ingestion of scans (incl. derivation of a basic skeleton of the books)
:** Naming convention of the files
:**: '''e:''' einband (always e1-e4, two for the front cover and two for the back cover)
:**: '''v:''' vorgebundene, nichtbedruckte Seiten (always after the front cover)
:**: '''d:''' derivate (content pages)
:**: '''n:''' nachgebundenen Seiten (always after the content pages)
:** Basic skeleton already includes three levels:
:**# multi-volume work, e.g. "Vollständiges corpus gravaminum evangelicorum"
:**# part (volume) [https://zim01.gwdg.de/repos/smc/trunk/03_Functional_Description/02_Scenarios_Concepts/Concepts/Virtueller%20Raum%20Reichsrecht/Beispieldateien/dt9bg20q%5b7-8%5d%20-%20gekuerzt/dt9bg20q%5b7-8%5d/dt9bg20q%5b7-8%5d_druck1=d0001.jpg Vollständiges corpus gravaminum evangelicorum. Band 7]
:**# pages
:* Ingestion of bibliographic data: mapping of [[Talk:ViRR_Metadata#MAB_to_MODS_mapping|MAB to MODS]]
:* Rework / clarification of the container format in eSciDoc (dev. Team) as basis for the [[Metadata Encoding and Transmission Standard|METS]] profile
:** Relation between the eSciDoc container format and the METS profile for ViRR (first draft based on Ingas mapping ebinds <=> METS?)
:* Start collecting requirements for the viewing environment [[Digilib#Requirements_for_Solutions|DigiLib]] and set up meeting with user group (Contact@FIZ: Frank Schwichtenberg) (Kristina, Tobias)
 
=== Functional Prototype ===
 
The functional Prototype for R1 should enable the following user goals:
# User wants to get an overview on the digital content
# User wants to browse within a multivolume work
# User wants to browse one volume in detail
# User wants to go to certain page within one volume (i.e. logical page)
# User wants to view the back cover of a volume
# User is not satisfied with selected volume and wants to switch to another volume
# User is not satisfied with selected multivolume and wants to switch to another multivolume
 
=== Short ToDo Developement ===


*ingest two multi-volumes
*ingest two multi-volumes
Line 72: Line 34:


== Release two ==
== Release two ==
# [[ViRR_Editing|Editing]]
#* enrich toc skeleton with information on chapters (i.e. bundles)(e.g. page 1-5 = chapter 1)
#* add metadata about the chapters, e.g. keywords
=== Discussion ===
:* Editing via simple edit mask or already with [[Metadata Encoding and Transmission Standard|METS]] editor (selection of Editor depends on eSciDoc METS profile)?
:* Prepare a first draft of the eSciDoc METS profile (based on bibliographic data needed, descriptive data needed), based on the basic METS metadata required by the DFG viewer
:* Decide on recommended METS online xml editor for the acquisition of structural data (which can also be used offline if possible)
:*# [[Goobi|GOOBI]] for METS (expectations of the institute: creation of METS conform XML files)
:*# [http://www.ccs-gmbh.de/de/digitization.htm Docworks]: Meta-e cooperation with css (is not interesting if it works with an automatic recognition)
:* Prepare requirements for FIZ for the METS integration
=== Functional Prototype ===
The functional Prototype for R2 should enable the following user goals:
# (Multi)Volumes
#* The user wants to enhance the metadata of a multivolume or a volume
# Chapters
#* The user wants to define chapters as a sum of separate pages (mark which pages belong together in form of a chapter)
#* The user wants to number serially the defined chapters
#* The user wants to define metadata for each chapter
# Pages
#* The user wants to assign an order label to each page
#* The user wants to assign a page label to some pages




== Release three ==
== Release three ==
# [[ViRR_Browse_and_Display|Browsing and Display]] (detailed)
#* extension of the alphabetical browsing tree (chapters)
#* systematical browsing tree as alternative entrance to the collection
#* paginator (for lists)
#* paging for images (i.e. "im Buch blaettern")
#* integration of [[Digilib]] functionalities (minimum: zoom in, zoom out)
#* dynamic generation and integration of "identification stamp" ("Herkunftsnachweis") on the images (whole image, selected part of image) --> new [[Digilib]] requirement
# [[ViRR_Searching|Search]]
#* simple search (one search field "any field")
#* advanced search (several special search fields, e.g. one for title, one for author)




== Release four ==
== Release four ==
# Functional definition of eSciDoc METS profile
#* needed for import / export
# [[ViRR_Export|Export]]
#* image selection
#* downloading of selected images(in separate jpgs)
#* downloading of selected images(in one pdf with a cover page)
#* downloading selected part of an image
#* downloading of METS-xml
# Display [[ViRR_Keywords_and_Synonyms|keywords]] as list (cf. Index in a book)
# [[ViRR_Persistent_Identification|Persistent Identifier]] (PID)
== Release ??? ==
'''required for DFG'''
# Collection description
# URN handling (in the context of an assignment of parts to a multi-volume work)
'''ToDo (Discussion):'''
:* Evaluation of the results from JHOVE on the images.




= SECOND PHASE - Virtual research environment =
= SECOND PHASE - Virtual research environment =
Following is a list of requirements ...detailed release planning will come at a later stage.
* Ingestion of all 15 available books
* Workflow for edition process of collection, incl. metadata, images, annotations, external sources (upload, editing, annotating, scientific review etc.)
* [[ViRR_User_Management|User Management]] to support workflow
* Fulltext [[ViRR_Transcriptions|transcription]] online (offline client at later stage) - in METS
* [[ViRR_Ingestion|Ingestion/Upload]] of additional books (digital images + bibliographic metadata) - local resources, BBAW-DTA
* Adding and editing of bibliographic and descriptive metadata
* Adding annotations / comments
* Adding [[ViRR_Relations|relations]]
* Integration of external resources (Deutsches Rechtswoerterbuch/Heidelberg)
* Creation and maintenance of synonyms
* Offering metadata to the [http://www.zvdd.de/ ZVDD](zentrales Verzeichnis digitalisierter Drucke) and other virtual libraries - OAI interface for the exchange of metadata
* Sitemap protocol for crawlers
* Integration of research literature for download (bibliographic lists? articles?)
* Linking to other digital archives / OPACs /research projects
* Delivery of one complete dataset for the DNB for long term archiving
'''ToDo (Discussion):'''
:* Structural analyzes of the data of the Deutsche Rechtswörterbuch
:* Analyzes of the requirements of the ZVDD
:* Text editor for the creation of transcriptions is needed




= Productive environment =
= Productive environment =
Each phase can be go productive after it is finished!
* Preparation of productive environment (hardware, support, policies)
* Offline tool for image processing to improve image quality
* Fulltext transcription in TEI?
* Additional functionality for historisch-kritische Editionsarbeit?
* Concept ViRR for other local/MPG projects (e.g. Policey-Ordnung)

Revision as of 14:02, 5 May 2008

ToDo's and comments for each Release:

FIRST PHASE - Publication of the digital collection[edit]

Release one[edit]

Short ToDos Development[edit]

  • ingest two multi-volumes
  • ingest two books into multi-vol1
  • ingest one book into multi-vol2
    • create Context VIRR
    • create User VIRR_user who has Depositor, MD-Editor, Moderator role for context VIRR
    • create ContentModel "MultiVolume"
    • create 2 containers with content model "MultiVolume"
      • each container has 2 metadata records (MAB format, MODS format)
    • create ContentModel "ScannedBook" (or find better name)
    • create 3 containers with content model "ScannedBook" (or find better name)
      • each container has 2 metadata records (MAB format, MODS format)
    • add "ScannedBook" containers to "MultiVolume" containers
    • create ContentModel "ScannedBookPage" (or find better name)
    • create Items with "ScannedBookPage"
      • each created item has a metadata record in MODS format+extra metadata (to check respective MODS metadata to put the Page sequence)
      • each created item should have 2 components
        • component: content-category="original", visibility="public"
        • component: content-category="thumbnail", visibility="public"
    • submit, release the 5 containers and all Items
  • for all users (no AA included) implement "browsing" GUI (simple tree for books) where top-level nodes are containers and second-level nodes are Items
    • workaround needed: for sorting by page number on the browsing interface already (not certain if we can simply take the order as it is stated in the container's struct map for start).
      • In respect to a possible requirement to see one page after another one may use content-relations to link pages. E.g. <container> <firstPage> <item1> and <item1> <nextPage> <item2> and <item2> <nextPage> <item3> etc. Because if one can get the order form a list (or TOC) of the container that information is not (directly) available when displaying a single page-item but there should certainly be a "next"-button. Frank 15:10, 12 March 2008 (CET)
  • for all users (no AA included) implement "view-container/item" GUIs (for containers it displays the container metadata, for items it displays the item metadata and the image thumbnail as image on GUI) (link to component "original" should be provided for download).
  • (optionally, to decide when most of these things are done)-> to display the original image (or a working version of the original image via Digilib server?)


Release two[edit]

Release three[edit]

Release four[edit]

SECOND PHASE - Virtual research environment[edit]

Productive environment[edit]