Daniel Reetz, the founder of the DIY Book Scanner community, has recently started making videos of prototyping and shop tips. If you are tinkering with a book scanner (or any other project) in your home shop, these tips will come in handy. https://www.youtube.com/channel/UCn0gq8 ... g_8K1nfInQ

Search found 67 matches

by BruceG
11 Sep 2016, 05:24
Forum: HELP
Topic: How should I create OCR text from existing DJVU image-only file and make a searchable text layer in the DJVU file?
Replies: 20
Views: 9184

Re: How should I create OCR text from existing DJVU image-only file and make a searchable text layer in the DJVU file?

djvu to pdf http://djvu2pdf.com/ It took the 10 pages as one file. Not sure if there is a limit. It was the first in my google search Used InFix to re-size the page size. Other programs may also do this. Did the OCR again - no editing - instead of saving graphics -as is- reduced to 100dpi could have...
by BruceG
11 Sep 2016, 04:45
Forum: HELP
Topic: How should I create OCR text from existing DJVU image-only file and make a searchable text layer in the DJVU file?
Replies: 20
Views: 9184

Re: How should I create OCR text from existing DJVU image-only file and make a searchable text layer in the DJVU file?

John
I took the DJVU file & converted online to pdf (the ten pages together)
Omnipage would not except the file because of page size.
Changed size of pages to A4
Omnipage then excepted file for OCR
Results attached
I see that there are a few things need fixing
Ashley Book of Knots.pdf
OCR output from OmniPage
(1.82 MiB) Downloaded 165 times
by BruceG
07 Sep 2016, 17:55
Forum: HELP
Topic: How should I create OCR text from existing DJVU image-only file and make a searchable text layer in the DJVU file?
Replies: 20
Views: 9184

Re: How should I create OCR text from existing DJVU image-only file and make a searchable text layer in the DJVU file?

John When I said put the large pdf file through Omnipage, I meant to also save as a image file to reduce its size. I am not sure about V15 but I have a options button near save that has a lot of settings that you can play with. I suggest that you extract a few pages of the large file to play with. I...
by BruceG
07 Sep 2016, 01:47
Forum: HELP
Topic: How should I create OCR text from existing DJVU image-only file and make a searchable text layer in the DJVU file?
Replies: 20
Views: 9184

Re: How should I create OCR text from existing DJVU image-only file and make a searchable text layer in the DJVU file?

Create pdf and tiff
Can you print from DJVU? If you can, then print to pdf. Drivers (if that's what they are called) are freely available. You can then use Omnipage to produce tif. Omnipage will alow you to reduce the size of the pdf also.
by BruceG
05 Sep 2016, 19:21
Forum: HELP
Topic: How should I create OCR text from existing DJVU image-only file and make a searchable text layer in the DJVU file?
Replies: 20
Views: 9184

Re: How should I create OCR text from existing DJVU image-only file and make a searchable text layer in the DJVU file?

John I have been working on making epub books lately. Only for reading as a novel not searching. And not using DJVU. From pdf scans use Omnipage Ultimate to OCR and save as epub To edit the epub I use Calibre epub editing which is free. Usually some pages out of Omnipage do not flow into the next an...
by BruceG
22 Jul 2016, 22:15
Forum: Cameras and Electronics
Topic: Feedback for improvement of my scans
Replies: 2
Views: 1962

Re: Feedback for improvement of my scans

meelash I use a Nikon S6500 16meg camera and it produces jpeg file about 7000+ kb. The Canon I think is a 20meg camera so should be greater. The out put you have is only about 250 kb which is large for a OCR page but useless for a image page. Some where along the line the file size is being greatly ...
by BruceG
17 Jun 2016, 22:52
Forum: HELP
Topic: sheet music & tiff
Replies: 9
Views: 4921

Re: sheet music & tiff

Bart https://en.wikipedia.org/wiki/Music_OCR may give you a few ideas It is unclear ( from your post) if it is Scan Tailor that is producing the 'blurry and unusable' file or unpaper/ImageMagick. I am not familiar with these programs. Though I would expect there would be options as to the quality of...
by BruceG
13 May 2016, 04:11
Forum: Chat
Topic: Cataloging exhibition material - best way? (method/software)
Replies: 8
Views: 16248

Re: Cataloging exhibition material - best way? (method/software)

yello If you happen to have Acrobat you could save all the material with appropriate naming as pdf's. Then with Acrobat create a index/catalog of all the material, this can be added to with each exhibition. Searching is very quick. Picks any word as long as the doc has been OCRed. I use this method ...
by BruceG
10 May 2016, 05:44
Forum: Tutorials/How-To's
Topic: Converting Color/Grayscale Text Scans to Black & White
Replies: 34
Views: 36506

Re: Converting Color/Grayscale Text Scans to Black & White

Fabian Did you have a look at the B&W pdf version at Internet Archive. There some tint on some pages but not over the whole page like the normal pdf. Thus to Revisit OCR.pdf This OCR was with the B&W pdf. Using the tinted version I would have to remove it one page at a time after OCR. Either while f...
by BruceG
09 May 2016, 21:40
Forum: Tutorials/How-To's
Topic: Converting Color/Grayscale Text Scans to Black & White
Replies: 34
Views: 36506

Re: Converting Color/Grayscale Text Scans to Black & White

Fabian
Can you give a link to the book you have from Internet Archive. ie Dropbox etc.

The first entry about Photoshop will allow batch changes.