Tesseract OCR

Convert page images into searchable text. Talk about software, techniques, and new developments here.

Moderator: peterZ

Post Reply
User avatar
rob
Posts: 773
Joined: 03 Jun 2009, 13:50
E-book readers owned: iRex iLiad, Kindle 2
Number of books owned: 4000
Country: United States
Location: Maryland, United States
Contact:

Tesseract OCR

Post by rob »

Tesseract OCR is a Google project claiming to be "probably one of the most accurate open source OCR engines available." I tried it once a few months ago and wasn't terribly impressed.

But, if you'd like to try an open source OCR solution, check out this document.
The Singularity is Near. ~ http://halfbakedmaker.org ~ Follow me as I build the world's first all-mechanical steam-powered computer.
james415
Posts: 13
Joined: 04 Mar 2014, 00:52

Re: Tesseract OCR

Post by james415 »

I believe that is what bkrpr based their software on as well.

Cheers,
James
User avatar
rob
Posts: 773
Joined: 03 Jun 2009, 13:50
E-book readers owned: iRex iLiad, Kindle 2
Number of books owned: 4000
Country: United States
Location: Maryland, United States
Contact:

Re: Tesseract OCR

Post by rob »

Don't get me wrong -- I would love Tesseract to succeed. I found a misclassification bug with FineReader for certain output formats, I contacted the company, and they told me that the bug did exist, but they wouldn't fix the problem.
The Singularity is Near. ~ http://halfbakedmaker.org ~ Follow me as I build the world's first all-mechanical steam-powered computer.
Post Reply