Daniel Reetz, the founder of the DIY Book Scanner community, has recently started making videos of prototyping and shop tips. If you are tinkering with a book scanner (or any other project) in your home shop, these tips will come in handy. https://www.youtube.com/channel/UCn0gq8 ... g_8K1nfInQ

Search found 464 matches

by dtic
07 Apr 2010, 05:48
Forum: Scan Tailor
Topic: Scan Tailor
Replies: 357
Views: 214872

Re: Scan Tailor

5MB DJVU file for a 500 page, 600 DPI book produced by dtic is easier to believe, although I would expect a bit more, maybe 7 or 8 MB. Unfortunately his solution is quite complex and can only be recommended to geeky crowd. I believe it also wouldn't be able to handle halftone illustrations. Geeky s...
by dtic
05 Apr 2010, 07:33
Forum: Scan Tailor
Topic: Scan Tailor
Replies: 357
Views: 214872

Re: Scan Tailor

For djvu and OCR there is already fine FOSS available it seems to me. See http://www.diybookscanner.org/forum/viewtopic.php?f=3&t=319. I'm no programmer but I made a GUI frontend that transforms Scan Tailor output into OCR'ed djvu in one manual step. I will post it very shortly. I'd certainly like ...
by dtic
04 Apr 2010, 20:32
Forum: OCR/Optical Character Recognition
Topic: Adding positionally aware ocr to a djvu scan
Replies: 24
Views: 24598

Re: Adding positionally aware ocr to a djvu scan

TiffDjvuOCR - Windows GUI frontend to convert Scan Tailor tiff output into OCR'ed djvu
info + download: http://nod5.dcmembers.com/tiffdjvuocr.html

dtic = nod5 :)
by dtic
04 Apr 2010, 06:27
Forum: Scan Tailor
Topic: Scan Tailor
Replies: 357
Views: 214872

Re: Scan Tailor

Recent discussions here made me think there is a more important task at hand. That task would be assembiling post-processed pages into a PDF or DJVU file. I am going to write a simple GUI program to do just that. Right now this task requires either lots of efforts, or costly commercial software, or...
by dtic
29 Mar 2010, 12:56
Forum: OCR/Optical Character Recognition
Topic: Adding positionally aware ocr to a djvu scan
Replies: 24
Views: 24598

Re: Adding positionally aware ocr to a djvu scan

One important thing worth changing is the tesseract options that the blog used. Replacing "batch.nochop" with "batch" has resulted in far more accurate ocr text. I only wish tesseract had some documentation that could explain what "nochop" tries to do. Tested and agreed! There's a big difference.
by dtic
28 Mar 2010, 07:46
Forum: Cameras and Electronics
Topic: Minimum Camera for Book Scanning
Replies: 84
Views: 116019

Re: Minimum Camera for Book Scanning

How important is it to get two cameras of the same model? Would for example a powershot a710 + ixus 70 (both 7 megapixel) work about as well as two 710's?
by dtic
27 Mar 2010, 09:42
Forum: OCR/Optical Character Recognition
Topic: Adding positionally aware ocr to a djvu scan
Replies: 24
Views: 24598

Re: Adding positionally aware ocr to a djvu scan

Ok, I'll try switching to "batch" for some testruns and compare it. I'm completely new to tesseract (and to OCR in general) so I don't know what the commands are supposed to do differently. Here's the steps I work through: 1. loop for each tiff in folder: - djvulibre: cjb2.exe make djvu 2. djvulibre...
by dtic
27 Mar 2010, 07:36
Forum: OCR/Optical Character Recognition
Topic: Adding positionally aware ocr to a djvu scan
Replies: 24
Views: 24598

Re: Adding positionally aware ocr to a djvu scan

Thanks Strider1551, very useful! I've now made a windows script that follows the same steps (using the autohotkey language). It controls djvulibre, imagemagick, strawberry perl + the perl script above and tesseract. Takes scan tailor tiff files as input and outputs an OCR'ed djvu file. All with only...
by dtic
14 Mar 2010, 10:44
Forum: Scanners and Build Threads
Topic: Book (camera-based) Scanner versus Flatbed Scanner
Replies: 37
Views: 21640

Re: Book (camera-based) Scanner versus Flatbed Scanner

rob: Wow, that scanrobot is cool! Once you have a robot then scan time is no issue I guess. Has anyone here tried building an automatic page turner like that for a DIY camera scanner? possum: They needn't collide. Think of four sensors 10 cm apart attached to the same "rail". When you start to scan,...
by dtic
13 Mar 2010, 14:01
Forum: Scanners and Build Threads
Topic: Book (camera-based) Scanner versus Flatbed Scanner
Replies: 37
Views: 21640

Re: Book (camera-based) Scanner versus Flatbed Scanner

You've so far mentioned (1) the DIY dual camera scanner setup and (2) document feeder scanner for de-spined books. A third option: (3) using a high speed flatbed scanner and manually turn the pages. I have access to such a scanner and I suspect many people have without knowing about it. Many corpora...