This page shall contain every change that is made during a qa release of the version mentioned above. If it's not here, it never happened!

PubMan 7.6 Release[edit]

Affected Servers[edit]

Prepare read only system[edit]

Fedora[edit]

Coreservice Apache[edit]

Coreservice JBoss[edit]

adapt log4j.xml according qa-coreservice

Core Infrastructure[edit]

Core Properties[edit]

escidoc-core.properties: remove the pdf extractor properties from escidoc-core.properties and put the corresponding properties to fedoragsearch.properties

# true|false Defines what happenes if an Exception occurs while extracting the text from an pdf for indexing
# if set to true, Exception is ignored and object is indexed without the fulltext.
# if set to false, Exception is thrown and object is not indexed at all.
gsearch.ignoreTextExtractionErrors = true
# Location of the indexingStylesheet that generates the indexInformation-Document for gsearch-indexing.
# has to be an URL
# currently the eSciDoc-Core-Infrastructure provides 2 index-databases: escidoc_all and escidocou_all
# stylesheet-path-property for index escidoc_all is gsearch.escidoc.indexingStylesheet
# stylesheet-path-property for index escidoc_all is gsearch.escidocou.indexingStylesheet
#gsearch.escidoc.indexingStylesheet = http://escidoc1.escidoc.mpg.de/resources/searchIndexDefinition/mpdlEscidocXmlToLucene_1.2.xslt
gsearch.escidoc.indexingStylesheet = http://coreservice.mpdl.mpg.de/mpdlEscidocXmlToLucene.xslt
gsearch.escidocou.indexingStylesheet = 
# if pdfBox (internally used by gsearch to extract text from pdfs) is not working well for your pdfs,
# define command-line-command to custom pdf-text-extractor (has to get installed seperately)
# define command with full path, define inputfile with <inputfile> and outputfile with <outputfile>
#example: C:/Programme/xpdf-3.02pl2-win32/pdftotext -cfg C:/Programme/xpdf-3.02pl2-win32/xpdfrc <inputfile> <outputfile>
# gsearch.pdfTextExtractorCommand = /usr/bin/pdftotext -cfg /etc/xpdfrc <inputfile> <outputfile>
gsearch.pdfTextExtractorCommand = /usr/bin/java -classpath /usr/share/jboss/server/default/conf/pdf-extraction/classes:/usr/share/jboss/server/default/conf/pdf-extraction/lib/iText-5.0.6.jar de.mpg.escidoc.services.extraction.ExtractionChain <inputfile> <outputfile>
# Analyzer to use for indexing and search
lucene.analyzer = de.escidoc.sb.common.lucene.analyzer.EscidocAnalyzer

add the following properties to JBOSS_HOME/conf/search/config/fedoragsearch.properties:

# if pdfBox (internally used by gsearch to extract text from pdfs) is not working well for your pdfs,
# use a command-line tool.
# If you want to use a command-line tool,
# define command-line-command to custom pdf-text-extractor (has to get installed seperately)
# define command with full path, define inputfile with <inputfile> and outputfile with <outputfile>
#example: C:/Programme/xpdf-3.02pl2-win32/pdftotext -cfg C:/Programme/xpdf-3.02pl2-win32/xpdfrc <inputfile> <outputfile>
fedoragsearch.pdfTextExtractorCommand=/usr/bin/java -classpath /usr/bin/java -classpath /usr/share/jboss/server/default/conf/pdf-extraction/classes:/usr/share/jboss/server/default/conf/pdf-extraction/lib/itextpdf-5.5.1.jar de.mpg.escidoc.services.extraction.ExtractionChain <inputfile> <outputfile>

# true|false Defines what happens if an Exception occurs while extracting the text from an pdf for indexing
# if set to true, Exception is ignored and object is indexed without the fulltext.
# if set to false, Exception is thrown and object is not indexed at all.
fedoragsearch.ignoreTextExtractionErrors=true

copy directory pdf-extraction and pdfbox-app-1.8.6.jar (e.g. from dev-pubman) to JBOSS_HOME/conf

add new property for skipping reindex to escidoc-core.properties (best to line ~ 60 where other indexing properties are configured)

# Comma separated List of method names, where automatic indexing is skipped
escidoc-core.skip.notify.indexer.methods = assignObjectPid, assignVersionPid

Core Index Properties[edit]

Core Lucene Index[edit]

PubMan EAR[edit]

PubMan Properties[edit]

escidoc.transformation.wos.stylesheet.filename=/usr/share/jboss/server/default/conf/transformation/transformations/otherFormats/xslt/wosxml2escidoc.xslt

& check if transformation exists

escidoc.framework_access.framework.url=http://localhost:8080 (instead of coreservice)

PubMan Apache[edit]

Add ProxyPassReverse for /cone, /sword-app, /dataacquisition in Apache 2 config, if not done yet

PubMan Wildfly[edit]

Add pubman module, which should contain all properties and configuration files for PubMan:
- Create directory WILDFLY_HOME/modules/pubman/main
- Add a file called module.xml to this directory, containing the following xml

 <?xml version="1.0" encoding="UTF-8"?>
 <module xmlns="urn:jboss:module:1.1" name="pubman">
     <resources>
                 <resource-root path="."/>
     </resources>
 </module>

- Add all necessary property files to this directory (pubman.properties, solution.properties, auth.properties, cone.properties, conf.xml, apache-fop-config.xml)
- Make this module global by adding the following xml snippet to standalone.xml, subsystem urn:jboss:domain:ee

 <global-modules>
   <module name="pubman" slot="main"/>
 </global-modules>

Wildfly has a default maximum POST size of 10mb, which is not convenient for large file uploads
- Increase max-post size in standalone.xml, subsystem urn:jboss:domain:undertow by changing http-listener to (for e.g. 1024 mb)

 <http-listener name="default" socket-binding="http" max-post-size="1024000000"/>

PubMan PidCache[edit]

AA[edit]

Validation Database[edit]

Migration database[edit]

CoNE[edit]

eSciDoc Admin[edit]

Data Migration[edit]

PubMan Software Homepage[edit]

Miscellaneous[edit]

PubMan 7 7

Contents