Talk:Linguistic Literature Single Pages
Links[edit]
- http://view.samurajdata.se/
- NY Times Document Viewer promoted as open source to be released soon after Oct2009 - but can not find it
- very nice, though not open source - but also not expensive http://www.thedocumentviewer.com/
- http://www.ajaxdocumentviewer.com/
- http://multivalent.sourceforge.net/ - not certain how recent this is
More on PDF[edit]
- http://www.cogniview.com/convert-pdf-to-excel/post/pdf-editing-creation-50-open-sourcefree-alternatives-to-adobe-acrobat/
- http://freshmeat.net/projects/pdf2html
Interesting to annotate also images, pdfs etc.[edit]
- http://a.nnotate.com (not OPEN SOURCE)
general info regarding the UC[edit]
URL of a single page
- view by URL/download of single page (calculated on the fly?)
- maybe check again the requirement stated as
- "When referencing one page within an PDF via an URL, this URL shall include the logical page number. The logical page numbers are not always in one sequence. For example it could happen that one document first starts with 5 unnumbered pages, than with page I - V, and than with page 3-66. That means that the no information concerning the logical page numbers can be derived from the total number of pages. That also means that the unnumbered pages can not be referenced."
- what happens if there is a logical page number that does not form a valid URL?
- would mean we have to validate it... would be too much
- proposal: stick with physical page numbers--Natasa 12:14, 10 March 2010 (UTC)
- as far as i can see, the idea to have single pages accessible via URL came also about because there should be a way to link citations to fulltexts in a way like it is possible with google books. this scenario is only possible with logical page numbers, because those are typically available in citation data. (when put into a URL - most probably as query parameter - page numbers should be URL-encoded, so there is no such thing as "invalid page numbers".)--Robert 12:38, 10 March 2010 (UTC)
- proposal: stick with physical page numbers--Natasa 12:14, 10 March 2010 (UTC)
- would mean we have to validate it... would be too much
- or if logical page number is smth like e.g. 4/99 we might have problems (to be checked with devs)
- check Digilib Page as well
Search within a document
- initial idea developed by Malte, Natasa
- constraints of core service:
- core service does not know on which page the term is found
- it would be huge modification to Fedora search to actually enable this.
- have the single file PDF search as a separate service for each file in the fulltext search result
- constraints of core service:
- basic use case on full text search and links to pages maybe already supported via OpenParameters of adobe?
- I have nothing personally against using openparameters of adobe, but if we do, then we could/should also use it for the url of a single page.--Bastien 08:10, 8 April 2010 (UTC)
- See one example here: http://pubman.mpdl.mpg.de/pubman/item/escidoc:383417:3/component/escidoc:383415/amerindian_gomez2008-1_s.pdf#search=language (Firefox, IE8 play nice..)
- what plays nice is the adobe reader plugin, i guess. do other pdf readers also work well with these URL parameters? considering that adobe reader is rife with vulnerabilities offering functionality which does require it would seem inconsiderate.--Robert 08:27, 8 April 2010 (UTC)
- please point to the other pdf readers that have to be tested and checked for API OpenParameters support. --Natasa 08:56, 27 April 2010 (UTC)
- @Kirstin: Please give feedback if it would be fine that the search within PubMan only finds the documents in which a search term exists. The user then can select one of these documents and creates there a specific search only for the whole document (see example above). --Kristina 12:21, 7 April 2010 (UTC)
- what plays nice is the adobe reader plugin, i guess. do other pdf readers also work well with these URL parameters? considering that adobe reader is rife with vulnerabilities offering functionality which does require it would seem inconsiderate.--Robert 08:27, 8 April 2010 (UTC)