Difference between revisions of "PubMan 7 9"

From MPDLMediaWiki
Jump to navigation Jump to search
 
(One intermediate revision by the same user not shown)
Line 14: Line 14:
* install new escidoc-core-1.3.10-SNAPSHOT-build202.ear, fedoragsearch.war, srw.war from /home/siedersleben/escidoc-core-1.3.10-SNAPSHOT-build202.ear; remove version specifier ..
* install new escidoc-core-1.3.10-SNAPSHOT-build202.ear, fedoragsearch.war, srw.war from /home/siedersleben/escidoc-core-1.3.10-SNAPSHOT-build202.ear; remove version specifier ..
* install new pdf-extraction to JBOSS_HOME/server/default/conf from /home/siedersleben/pdf-extraction/
* install new pdf-extraction to JBOSS_HOME/server/default/conf from /home/siedersleben/pdf-extraction/
* test new pdf extraction calling from Shell..
* test new pdf extraction calling from command line


   /usr/bin/java -classpath /usr/share/jboss/server/default/conf/pdf-extraction/classes:/usr/share/jboss/server/default/conf/pdf-extraction/lib/itextpdf-5.5.1.jar de.mpg.escidoc.services.extraction.ExtractionChain <inputfile> <outputfile>
   /usr/bin/java -classpath /usr/share/jboss/server/default/conf/pdf-extraction/classes:/usr/share/jboss/server/default/conf/pdf-extraction/lib/itextpdf-5.5.1.jar de.mpg.escidoc.services.extraction.ExtractionChain <inputfile> <outputfile>


check output file by 'file -b' for UTF-8 format
check output file by 'file -b' for UTF-8 format


== Core Infrastructure ==
== Core Infrastructure ==
* upgrade pdftotext
* upgrade pdftotext to version 3.02


== Core Properties ==
== Core Properties ==

Latest revision as of 14:07, 24 March 2015

This page shall contain every change that is made during a qa release of the version mentioned above. If it's not here, it never happened!

PubMan 7.9 Release[edit]

Affected Servers[edit]

Prepare read only system[edit]

Fedora[edit]

Coreservice Apache[edit]

Coreservice JBoss[edit]

  • install new escidoc-core-1.3.10-SNAPSHOT-build202.ear, fedoragsearch.war, srw.war from /home/siedersleben/escidoc-core-1.3.10-SNAPSHOT-build202.ear; remove version specifier ..
  • install new pdf-extraction to JBOSS_HOME/server/default/conf from /home/siedersleben/pdf-extraction/
  • test new pdf extraction calling from command line
 /usr/bin/java -classpath /usr/share/jboss/server/default/conf/pdf-extraction/classes:/usr/share/jboss/server/default/conf/pdf-extraction/lib/itextpdf-5.5.1.jar de.mpg.escidoc.services.extraction.ExtractionChain <inputfile> <outputfile>

check output file by 'file -b' for UTF-8 format

Core Infrastructure[edit]

  • upgrade pdftotext to version 3.02

Core Properties[edit]

Core Index Properties[edit]

Core Lucene Index[edit]

  • Use new indexing stylesheets (contain fix for indexing the visibility of files/locators):

https://subversion.mpdl.mpg.de/repos/common/trunk/common_services/framework_access/src/main/resources/itemContainerAdminXmlToLucene.xslt

https://subversion.mpdl.mpg.de/repos/common/trunk/common_services/framework_access/src/main/resources/mpdlEscidocXmlToLucene.xslt

  • Also use new indexing stylesheet for OAI-PMH index (contains new index fields for OpenAire Queries)

https://subversion.mpdl.mpg.de/repos/common/trunk/common_services/framework_access/src/main/resources/escidocOaipmhXmlToLucene.xslt

  • Configuration for indexing:

in JBOSS_HOME/conf/search/config/index/escidoc_all und ../item_container_admin activate in index.properties

 #Use this property for bulk index operations: the index is hold in memory until ramBufferSize is reached.
 #Make sure this property does not conflicts with fgsindex.maxBufferedDocs.
 fgsindex.ramBufferSizeMb = 256

in JBOSS_HOME/conf/search/config/index/item_container_admin/index.object-types.properties set index mode to asynchron:

 Resource.Item.indexAsynchronous=true

Revert the modifications when reindex has finished.

PubMan EAR[edit]

neues ear:

wget http://rd.mpdl.mpg.de/nexus/service/local/repositories/releases/content/de/mpg/escidoc/pubman/wildfly/pubman_ear/7.9.1/pubman_ear-7.9.1-pure.ear

PubMan Properties[edit]

  • escidoc.pubman_presentation.overview_page.authors_ou
    • Max Planck Society for MPG-Sites
    • Root-Organization for other use cases

PubMan Apache[edit]

PubMan Wildfly[edit]

PubMan PidCache[edit]

eSciDoc-OAI-Provider[edit]

  • update content-model oai-stylesheet
    • replace internal DC-Mapping URL with external once
  • run FOXML transformation script ( java -jar foxml_transformation_service-0.0.1-SNAPSHOT.jar FoxmlXslTransformation /opt/fedora/data/objects/ /home/walter/dc_stylesheet_foxmls.xsl &> foxml_transformation.log )
    • LIVE SETTINGS!
    • fedora service must be reachable
  • drop and recreate fedora database
psql -U postgres

\list

DROP DATABASE "fedora3";
DROP DATABASE "riTriples";

CREATE DATABASE "fedora3" WITH ENCODING='UTF8' OWNER="fedoraAdmin";
CREATE DATABASE "riTriples" WITH ENCODING='SQL_ASCII' OWNER="fedoraAdmin" TEMPLATE=template0;

\quit
  • rebuild fedora database
    • /opt/fedora/server/bin/fedora-rebuild.sh
      • rebuild database (2 -> 1) fedora3 database
      • rebuild index (1 -> 1) ritriples database
  • reindex (escidocoaipmh_all)
  • add OpenAire set-definition to eSciDoc
    • PUT /oai/set-definition
<?xml version="1.0" encoding="UTF-8"?>
<set-definition xmlns="http://www.escidoc.de/schemas/setdefinition/0.1">
    <properties xmlns:prop="http://escidoc.de/core/01/properties/">
        <prop:name>OpenAire_1</prop:name>
        <prop:description>Set definition for OpenAire</prop:description>
    </properties>
    <specification>OpenAire_1</specification>
    <query>((escidoc.component.internal-managed.content-category=&quot;http://purl.org/escidoc/metadata/ves/content\-categories/pre\-print&quot; OR escidoc.component.internal-managed.content-category=&quot;http://purl.org/escidoc/metadata/ves/content\-categories/post\-print&quot; OR escidoc.component.internal-managed.content-category=&quot;http://purl.org/escidoc/metadata/ves/content\-categories/publisher\-version&quot; OR escidoc.component.internal-managed.content-category=&quot;http://purl.org/escidoc/metadata/ves/content\-categories/any\-fulltext&quot;) AND escidoc.component.internal-managed.visibility=&quot;public&quot;) OR escidoc.publication.project-info.identifier&gt;&apos;&apos;</query>
</set-definition>
  • clear proai cache files
  • clear proai database
  • replace escidoc_oaiprovider with version from QA(runnable in Wildfly)
    • adjust properties for LIVE

AA[edit]

Validation Database[edit]

  • new validation rules added --> delete validation DB to adapt all changes

Migration database[edit]

CoNE[edit]

  • New models funding-organizations and funding-programs
  • Fill with relevant data for Open Aire Autosuggest (FP7, H2020, Wellcome Trust)

eSciDoc Admin[edit]

Data Migration[edit]

PubMan Software Homepage[edit]

Miscellaneous[edit]