Difference between revisions of "Talk:Linguistic Literature Single Pages"
Jump to navigation
Jump to search
(5 intermediate revisions by 3 users not shown) | |||
Line 36: | Line 36: | ||
** have the single file PDF search as a separate service for each file in the fulltext search result | ** have the single file PDF search as a separate service for each file in the fulltext search result | ||
* basic use case on full text search and links to pages maybe already supported via OpenParameters of adobe? | * basic use case on full text search and links to pages maybe already supported via OpenParameters of adobe? | ||
** I have nothing personally against using openparameters of adobe, but if we do, then we could/should also use it for the url of a single page.--[[User:Bastien|Bastien]] 08:10, 8 April 2010 (UTC) | |||
: See one example here: http://pubman.mpdl.mpg.de/pubman/item/escidoc:383417:3/component/escidoc:383415/amerindian_gomez2008-1_s.pdf#search=language (Firefox, IE8 play nice..) | : See one example here: http://pubman.mpdl.mpg.de/pubman/item/escidoc:383417:3/component/escidoc:383415/amerindian_gomez2008-1_s.pdf#search=language (Firefox, IE8 play nice..) | ||
:: | ::what plays nice is the adobe reader plugin, i guess. do other pdf readers also work well with these URL parameters? considering that adobe reader is rife with vulnerabilities offering functionality which does require it would seem inconsiderate.--[[User:Robert|Robert]] 08:27, 8 April 2010 (UTC) | ||
:::please point to the other pdf readers that have to be tested and checked for API OpenParameters support. --[[User:Natasab|Natasa]] 08:56, 27 April 2010 (UTC) | |||
::::[http://www.foxitsoftware.com/pdf/reader/ foxit] and [http://www.foolabs.com/xpdf/ xpdf] come to mind. | |||
==Java PDF== | ==Java PDF== | ||
* http://java-source.net/open-source/pdf-libraries/jpedal | * http://java-source.net/open-source/pdf-libraries/jpedal | ||
* http://pdfbox.apache.org/ | * http://pdfbox.apache.org/ |
Latest revision as of 09:46, 5 May 2010
Links[edit]
- http://view.samurajdata.se/
- NY Times Document Viewer promoted as open source to be released soon after Oct2009 - but can not find it
- very nice, though not open source - but also not expensive http://www.thedocumentviewer.com/
- http://www.ajaxdocumentviewer.com/
- http://multivalent.sourceforge.net/ - not certain how recent this is
More on PDF[edit]
- http://www.cogniview.com/convert-pdf-to-excel/post/pdf-editing-creation-50-open-sourcefree-alternatives-to-adobe-acrobat/
- http://freshmeat.net/projects/pdf2html
Interesting to annotate also images, pdfs etc.[edit]
- http://a.nnotate.com (not OPEN SOURCE)
general info regarding the UC[edit]
URL of a single page
- view by URL/download of single page (calculated on the fly?)
- maybe check again the requirement stated as
- "When referencing one page within an PDF via an URL, this URL shall include the logical page number. The logical page numbers are not always in one sequence. For example it could happen that one document first starts with 5 unnumbered pages, than with page I - V, and than with page 3-66. That means that the no information concerning the logical page numbers can be derived from the total number of pages. That also means that the unnumbered pages can not be referenced."
- what happens if there is a logical page number that does not form a valid URL?
- would mean we have to validate it... would be too much
- proposal: stick with physical page numbers--Natasa 12:14, 10 March 2010 (UTC)
- as far as i can see, the idea to have single pages accessible via URL came also about because there should be a way to link citations to fulltexts in a way like it is possible with google books. this scenario is only possible with logical page numbers, because those are typically available in citation data. (when put into a URL - most probably as query parameter - page numbers should be URL-encoded, so there is no such thing as "invalid page numbers".)--Robert 12:38, 10 March 2010 (UTC)
- proposal: stick with physical page numbers--Natasa 12:14, 10 March 2010 (UTC)
- would mean we have to validate it... would be too much
- or if logical page number is smth like e.g. 4/99 we might have problems (to be checked with devs)
- check Digilib Page as well
Search within a document
- initial idea developed by Malte, Natasa
- constraints of core service:
- core service does not know on which page the term is found
- it would be huge modification to Fedora search to actually enable this.
- have the single file PDF search as a separate service for each file in the fulltext search result
- constraints of core service:
- basic use case on full text search and links to pages maybe already supported via OpenParameters of adobe?
- I have nothing personally against using openparameters of adobe, but if we do, then we could/should also use it for the url of a single page.--Bastien 08:10, 8 April 2010 (UTC)
- See one example here: http://pubman.mpdl.mpg.de/pubman/item/escidoc:383417:3/component/escidoc:383415/amerindian_gomez2008-1_s.pdf#search=language (Firefox, IE8 play nice..)
- what plays nice is the adobe reader plugin, i guess. do other pdf readers also work well with these URL parameters? considering that adobe reader is rife with vulnerabilities offering functionality which does require it would seem inconsiderate.--Robert 08:27, 8 April 2010 (UTC)