Software to join OCR single page PDF to multipage PDF

Don't know where to start, or stuck on a certain problem? Drop by and tell us about it. Feel like helping others? Start here.

Moderator: peterZ

Post Reply
jaffamuffin
Posts: 22
Joined: 21 Oct 2011, 09:51
Number of books owned: 0

Software to join OCR single page PDF to multipage PDF

Post by jaffamuffin »

Hi all

I have a bit of a convoluted workflow when it comes to OCR PDF.

I need to join a number of single page PDF's (with OCR information) into a single multipage document, whilst preseving the OCR data!

I have tried a number of software but they tend to lose the OCR info when the pages are joined.

I am using a very old version of ADOBE Capture with a custom workflow setup to take a folder of singles and output a singel file named after the folder..


Ideally I need something scriptable from the command line etc?


Any help?
b0bcat
Posts: 49
Joined: 30 Nov 2012, 21:37
Number of books owned: 0
Country: UK

Re: Software to join OCR single page PDF to multipage PDF

Post by b0bcat »

OS is some sort of Windows presumably going by the Adobe ref., under which if so I've used "GUI for PDFTK" http://www.paehl.de once in a blue moon for joining a couple of pages, can't recall if it dropped the hidden text layer of an image pdf though. Small program easily installed for you to try out if not yet done. If works then might explore the cmd line for scripting possibilities, also at http://www.pdflabs.com/tools/pdftk-the-pdf-toolkit/

If that doesn't work then offhand the only one I can think of trying (sure to be more though) is http://www.pdfill.com/pdf_tools_free.html

- both free at least for non-commercial, latter has some paying component but I don't think that affects the merge function.
jaffamuffin
Posts: 22
Joined: 21 Oct 2011, 09:51
Number of books owned: 0

Re: Software to join OCR single page PDF to multipage PDF

Post by jaffamuffin »

Hi

I think I tried PDFTK in the past, I'll try it again.

As for OS I have access to Linux / Windows so anything is OK, probably even a pref for a linux solution --- does anyone have any experience with WatchOCR?

Could I set up say 6 PCs and have a fast OCR setup?
Post Reply