Daniel Reetz, the founder of the DIY Book Scanner community, has recently started making videos of prototyping and shop tips. If you are tinkering with a book scanner (or any other project) in your home shop, these tips will come in handy. https://www.youtube.com/channel/UCn0gq8 ... g_8K1nfInQ

Need some help with OCR

Don't know where to start, or stuck on a certain problem? Drop by and tell us about it. Feel like helping others? Start here.
Post Reply
itsme

Need some help with OCR

Post by itsme » 22 Sep 2011, 14:26

Hello,

Our DIY-bookscanner ( 2x s95 with CHDK) is builded but now we are stuck with the processing of the pictures. :(

We have a Windows computer and a HQ jpg to test. The jpg is 2.2MB and only contain 1 page of a book. I've been searching and testing for some time but i don't get a good result.

So to be shure: what programs ( OCR, Windows) do we need to get a pdf wich is searchable? A link is also good.

Thank you in advance

quân

Re: Need some help with OCR

Post by quân » 19 Nov 2011, 22:30

You can use Tesseract or one of its GUI frontends, e.g., VietOCR, to recognize the image. Make sure your images are scanned or captured with 300-DPI resolution. Once you get the text output, you can print to a PDF virtual printer using PDFCreator.

Post Reply