Difference between revisions of "PubMan 7 9"
Jump to navigation
Jump to search
Haarlaender (talk | contribs) |
Siedersleben (talk | contribs) |
||
(21 intermediate revisions by 3 users not shown) | |||
Line 1: | Line 1: | ||
This page shall contain every change that is made during a qa release of the version mentioned above. If it's not here, it never happened! | This page shall contain every change that is made during a qa release of the version mentioned above. If it's not here, it never happened! | ||
= PubMan 7. | = PubMan 7.9 Release = | ||
== Affected Servers == | == Affected Servers == | ||
Line 12: | Line 12: | ||
== Coreservice JBoss == | == Coreservice JBoss == | ||
* install new escidoc-core-1.3.10-SNAPSHOT-build202.ear, fedoragsearch.war, srw.war from /home/siedersleben/escidoc-core-1.3.10-SNAPSHOT-build202.ear; remove version specifier .. | |||
* install new pdf-extraction to JBOSS_HOME/server/default/conf from /home/siedersleben/pdf-extraction/ | |||
* test new pdf extraction calling from command line | |||
/usr/bin/java -classpath /usr/share/jboss/server/default/conf/pdf-extraction/classes:/usr/share/jboss/server/default/conf/pdf-extraction/lib/itextpdf-5.5.1.jar de.mpg.escidoc.services.extraction.ExtractionChain <inputfile> <outputfile> | |||
check output file by 'file -b' for UTF-8 format | |||
== Core Infrastructure == | == Core Infrastructure == | ||
* upgrade pdftotext | * upgrade pdftotext to version 3.02 | ||
== Core Properties == | == Core Properties == | ||
Line 30: | Line 37: | ||
https://subversion.mpdl.mpg.de/repos/common/trunk/common_services/framework_access/src/main/resources/escidocOaipmhXmlToLucene.xslt | https://subversion.mpdl.mpg.de/repos/common/trunk/common_services/framework_access/src/main/resources/escidocOaipmhXmlToLucene.xslt | ||
* Configuration for indexing: | |||
in JBOSS_HOME/conf/search/config/index/escidoc_all und ../item_container_admin activate in index.properties | |||
#Use this property for bulk index operations: the index is hold in memory until ramBufferSize is reached. | |||
#Make sure this property does not conflicts with fgsindex.maxBufferedDocs. | |||
fgsindex.ramBufferSizeMb = 256 | |||
in JBOSS_HOME/conf/search/config/index/item_container_admin/index.object-types.properties set index mode to asynchron: | |||
Resource.Item.indexAsynchronous=true | |||
Revert the modifications when reindex has finished. | |||
== PubMan EAR == | == PubMan EAR == | ||
neues ear: | |||
wget http://rd.mpdl.mpg.de/nexus/service/local/repositories/releases/content/de/mpg/escidoc/pubman/wildfly/pubman_ear/7.9.1/pubman_ear-7.9.1-pure.ear | |||
== PubMan Properties == | == PubMan Properties == | ||
Line 41: | Line 65: | ||
== PubMan Wildfly== | == PubMan Wildfly== | ||
* Update to WildFly 8.2 | |||
** http://wildfly.org/downloads/ | |||
== PubMan PidCache == | == PubMan PidCache == | ||
== eSciDoc-OAI-Provider == | == eSciDoc-OAI-Provider == | ||
* update content-model oai-stylesheet | |||
** replace internal DC-Mapping URL with external once | |||
* run FOXML transformation script ( <code>java -jar foxml_transformation_service-0.0.1-SNAPSHOT.jar FoxmlXslTransformation /opt/fedora/data/objects/ /home/walter/dc_stylesheet_foxmls.xsl &> foxml_transformation.log</code> ) | |||
** LIVE SETTINGS! | |||
** fedora service must be reachable | |||
* drop and recreate fedora database | |||
<syntaxhighlight lang="postgresql"> | |||
psql -U postgres | |||
\list | |||
DROP DATABASE "fedora3"; | |||
DROP DATABASE "riTriples"; | |||
CREATE DATABASE "fedora3" WITH ENCODING='UTF8' OWNER="fedoraAdmin"; | |||
CREATE DATABASE "riTriples" WITH ENCODING='SQL_ASCII' OWNER="fedoraAdmin" TEMPLATE=template0; | |||
\quit | |||
</syntaxhighlight> | |||
* rebuild fedora database | |||
** /opt/fedora/server/bin/fedora-rebuild.sh | |||
*** rebuild database (2 -> 1) fedora3 database | |||
*** rebuild index (1 -> 1) ritriples database | |||
* reindex (escidocoaipmh_all) | |||
* add OpenAire set-definition to eSciDoc | |||
** PUT /oai/set-definition | |||
<syntaxhighlight lang="xml"> | |||
<?xml version="1.0" encoding="UTF-8"?> | |||
<set-definition xmlns="http://www.escidoc.de/schemas/setdefinition/0.1"> | |||
<properties xmlns:prop="http://escidoc.de/core/01/properties/"> | |||
<prop:name>OpenAire_1</prop:name> | |||
<prop:description>Set definition for OpenAire</prop:description> | |||
</properties> | |||
<specification>OpenAire_1</specification> | |||
<query>((escidoc.component.internal-managed.content-category="http://purl.org/escidoc/metadata/ves/content\-categories/pre\-print" OR escidoc.component.internal-managed.content-category="http://purl.org/escidoc/metadata/ves/content\-categories/post\-print" OR escidoc.component.internal-managed.content-category="http://purl.org/escidoc/metadata/ves/content\-categories/publisher\-version" OR escidoc.component.internal-managed.content-category="http://purl.org/escidoc/metadata/ves/content\-categories/any\-fulltext") AND escidoc.component.internal-managed.visibility="public") OR escidoc.publication.project-info.identifier>''</query> | |||
</set-definition> | |||
</syntaxhighlight> | |||
* clear proai cache files | |||
* clear proai database | |||
* replace escidoc_oaiprovider with version from QA(runnable in Wildfly) | |||
** adjust properties for LIVE | |||
== AA == | == AA == | ||
== Validation Database == | == Validation Database == | ||
* new validation rules added --> delete validation DB to adapt all changes | |||
== Migration database == | == Migration database == | ||
Line 54: | Line 122: | ||
== CoNE == | == CoNE == | ||
* New models funding-organizations and funding-programs | * New models funding-organizations and funding-programs | ||
* Fill with | * Fill with relevant data for Open Aire Autosuggest (FP7, H2020, Wellcome Trust) | ||
==eSciDoc Admin== | ==eSciDoc Admin== |
Latest revision as of 13:07, 24 March 2015
This page shall contain every change that is made during a qa release of the version mentioned above. If it's not here, it never happened!
PubMan 7.9 Release[edit]
Affected Servers[edit]
Prepare read only system[edit]
Fedora[edit]
Coreservice Apache[edit]
Coreservice JBoss[edit]
- install new escidoc-core-1.3.10-SNAPSHOT-build202.ear, fedoragsearch.war, srw.war from /home/siedersleben/escidoc-core-1.3.10-SNAPSHOT-build202.ear; remove version specifier ..
- install new pdf-extraction to JBOSS_HOME/server/default/conf from /home/siedersleben/pdf-extraction/
- test new pdf extraction calling from command line
/usr/bin/java -classpath /usr/share/jboss/server/default/conf/pdf-extraction/classes:/usr/share/jboss/server/default/conf/pdf-extraction/lib/itextpdf-5.5.1.jar de.mpg.escidoc.services.extraction.ExtractionChain <inputfile> <outputfile>
check output file by 'file -b' for UTF-8 format
Core Infrastructure[edit]
- upgrade pdftotext to version 3.02
Core Properties[edit]
Core Index Properties[edit]
Core Lucene Index[edit]
- Use new indexing stylesheets (contain fix for indexing the visibility of files/locators):
- Also use new indexing stylesheet for OAI-PMH index (contains new index fields for OpenAire Queries)
- Configuration for indexing:
in JBOSS_HOME/conf/search/config/index/escidoc_all und ../item_container_admin activate in index.properties
#Use this property for bulk index operations: the index is hold in memory until ramBufferSize is reached. #Make sure this property does not conflicts with fgsindex.maxBufferedDocs. fgsindex.ramBufferSizeMb = 256
in JBOSS_HOME/conf/search/config/index/item_container_admin/index.object-types.properties set index mode to asynchron:
Resource.Item.indexAsynchronous=true
Revert the modifications when reindex has finished.
PubMan EAR[edit]
neues ear:
PubMan Properties[edit]
- escidoc.pubman_presentation.overview_page.authors_ou
- Max Planck Society for MPG-Sites
- Root-Organization for other use cases
PubMan Apache[edit]
PubMan Wildfly[edit]
- Update to WildFly 8.2
PubMan PidCache[edit]
eSciDoc-OAI-Provider[edit]
- update content-model oai-stylesheet
- replace internal DC-Mapping URL with external once
- run FOXML transformation script (
java -jar foxml_transformation_service-0.0.1-SNAPSHOT.jar FoxmlXslTransformation /opt/fedora/data/objects/ /home/walter/dc_stylesheet_foxmls.xsl &> foxml_transformation.log
)- LIVE SETTINGS!
- fedora service must be reachable
- drop and recreate fedora database
psql -U postgres
\list
DROP DATABASE "fedora3";
DROP DATABASE "riTriples";
CREATE DATABASE "fedora3" WITH ENCODING='UTF8' OWNER="fedoraAdmin";
CREATE DATABASE "riTriples" WITH ENCODING='SQL_ASCII' OWNER="fedoraAdmin" TEMPLATE=template0;
\quit
- rebuild fedora database
- /opt/fedora/server/bin/fedora-rebuild.sh
- rebuild database (2 -> 1) fedora3 database
- rebuild index (1 -> 1) ritriples database
- /opt/fedora/server/bin/fedora-rebuild.sh
- reindex (escidocoaipmh_all)
- add OpenAire set-definition to eSciDoc
- PUT /oai/set-definition
<?xml version="1.0" encoding="UTF-8"?>
<set-definition xmlns="http://www.escidoc.de/schemas/setdefinition/0.1">
<properties xmlns:prop="http://escidoc.de/core/01/properties/">
<prop:name>OpenAire_1</prop:name>
<prop:description>Set definition for OpenAire</prop:description>
</properties>
<specification>OpenAire_1</specification>
<query>((escidoc.component.internal-managed.content-category="http://purl.org/escidoc/metadata/ves/content\-categories/pre\-print" OR escidoc.component.internal-managed.content-category="http://purl.org/escidoc/metadata/ves/content\-categories/post\-print" OR escidoc.component.internal-managed.content-category="http://purl.org/escidoc/metadata/ves/content\-categories/publisher\-version" OR escidoc.component.internal-managed.content-category="http://purl.org/escidoc/metadata/ves/content\-categories/any\-fulltext") AND escidoc.component.internal-managed.visibility="public") OR escidoc.publication.project-info.identifier>''</query>
</set-definition>
- clear proai cache files
- clear proai database
- replace escidoc_oaiprovider with version from QA(runnable in Wildfly)
- adjust properties for LIVE
AA[edit]
Validation Database[edit]
- new validation rules added --> delete validation DB to adapt all changes
Migration database[edit]
CoNE[edit]
- New models funding-organizations and funding-programs
- Fill with relevant data for Open Aire Autosuggest (FP7, H2020, Wellcome Trust)