Hi all
I have a bit of a convoluted workflow when it comes to OCR PDF.
I need to join a number of single page PDF's (with OCR information) into a single multipage document, whilst preseving the OCR data!
I have tried a number of software but they tend to lose the OCR info when the pages are joined.
I am using a very old version of ADOBE Capture with a custom workflow setup to take a folder of singles and output a singel file named after the folder..
Ideally I need something scriptable from the command line etc?
Any help?
Software to join OCR single page PDF to multipage PDF
Moderator: peterZ
-
- Posts: 22
- Joined: 21 Oct 2011, 09:51
- Number of books owned: 0
Re: Software to join OCR single page PDF to multipage PDF
OS is some sort of Windows presumably going by the Adobe ref., under which if so I've used "GUI for PDFTK" http://www.paehl.de once in a blue moon for joining a couple of pages, can't recall if it dropped the hidden text layer of an image pdf though. Small program easily installed for you to try out if not yet done. If works then might explore the cmd line for scripting possibilities, also at http://www.pdflabs.com/tools/pdftk-the-pdf-toolkit/
If that doesn't work then offhand the only one I can think of trying (sure to be more though) is http://www.pdfill.com/pdf_tools_free.html
- both free at least for non-commercial, latter has some paying component but I don't think that affects the merge function.
If that doesn't work then offhand the only one I can think of trying (sure to be more though) is http://www.pdfill.com/pdf_tools_free.html
- both free at least for non-commercial, latter has some paying component but I don't think that affects the merge function.
-
- Posts: 22
- Joined: 21 Oct 2011, 09:51
- Number of books owned: 0
Re: Software to join OCR single page PDF to multipage PDF
Hi
I think I tried PDFTK in the past, I'll try it again.
As for OS I have access to Linux / Windows so anything is OK, probably even a pref for a linux solution --- does anyone have any experience with WatchOCR?
Could I set up say 6 PCs and have a fast OCR setup?
I think I tried PDFTK in the past, I'll try it again.
As for OS I have access to Linux / Windows so anything is OK, probably even a pref for a linux solution --- does anyone have any experience with WatchOCR?
Could I set up say 6 PCs and have a fast OCR setup?