Search found 61 matches

by zbgns
18 Aug 2021, 13:11
Forum: Tutorials/How-To's
Topic: How to convert a book to serchable pdf using open source software
Replies: 33
Views: 166436

Re: How to convert a book to serchable pdf using open source software

Well, actually I am not able to say, what is wrong. I expect that my lack of abilities is reason for that. When I try to install 'archive-hocr-tools' I have this error message: $ python3 -m pip install setup.py Defaulting to user installation because normal site-packages is not writeable ERROR: Coul...
by zbgns
19 Jul 2021, 12:05
Forum: Tutorials/How-To's
Topic: How to convert a book to serchable pdf using open source software
Replies: 33
Views: 166436

Re: How to convert a book to serchable pdf using open source software

Nice to see that. I sought for free implementation of MRC and apparently here it is. Unfortunately, I was not able to test the program due to some dependency issues. Nevertheless, it looks very promising. Thank you for sharing info about this software with me.
by zbgns
26 May 2021, 05:14
Forum: Lighting
Topic: LED lamps 'Warm' or 'Cold' light - Does it matter?
Replies: 7
Views: 9661

Re: LED lamps 'Warm' or 'Cold' light - Does it matter?

I would like to suggest rather "neutral" white (4600K) instead of "warm" or "cold" white light source. That is definitely less problematic and gives good results with standard white balance algorithms in smartphone cameras. I tried also "warm" white LEDs (2700...
by zbgns
31 Jan 2021, 10:51
Forum: Scan Tailor
Topic: Combining Split output into PDF
Replies: 9
Views: 10082

Re: Combining Split output into PDF

What I don't understand, better, I don't know is how to achieve the 2 passages after scantailor: PDF size 50% JPEG -G4 and PDF at 150dpi scaling (I am on linux). Can you explain me what have I to do? What do you mean by 'PDF size 50% JPEG -G4'? Do you want to have JPEG compression applied to color ...
by zbgns
22 Jan 2021, 08:02
Forum: Scan Tailor
Topic: Combining Split output into PDF
Replies: 9
Views: 10082

Re: Combining Split output into PDF

Adobe Acrobat in more recent versions does a good job with Scan Tailor output and creates such 'optimized' pdf files, where color and b&w content is segmented and compressed separately. The result seems to me to be similar to Abbyy FineReader MRC compression model.
by zbgns
26 Nov 2020, 07:57
Forum: Tutorials/How-To's
Topic: How to convert a book to serchable pdf using open source software
Replies: 33
Views: 166436

Re: How to convert a book to serchable pdf using open source software

Actually, each book created by me using the described method has a colored front cover and back cover. Contents between covers are binarized (B&W). There may be added pictures in color, but it would be necessary to manually convert them to appropriate format and turn into pdf, and afterwards ins...
by zbgns
24 Nov 2020, 15:40
Forum: Scan Tailor
Topic: output dimensions ratio
Replies: 2
Views: 5219

Re: output dimensions ratio

In the bottom part, near the right-hand corner you have information on dimensions of the image. Width and height may be adjusted at the 'Margins' stage. You may change units by selecting 'Tools' and 'Units'.
by zbgns
14 May 2020, 14:21
Forum: Tutorials/How-To's
Topic: How to convert a book to serchable pdf using open source software
Replies: 33
Views: 166436

Re: How to convert a book to serchable pdf using open source software

My bad. I was convinced that OCRmyPDF supports jbig2 but apparently this applies only to regular pdfs.
by zbgns
06 May 2020, 09:34
Forum: Tutorials/How-To's
Topic: How to convert a book to serchable pdf using open source software
Replies: 33
Views: 166436

Re: How to convert a book to serchable pdf using open source software

Thank you for your comments and sharing details of your workflow. Nice to see, that someone found useful the thread I wrote. First I think the cover should be at the same size when scrolling the pdf file. I had some problem since I scanned the covers at higher resolutions. You are right. The scripts...
by zbgns
22 Feb 2020, 19:06
Forum: Tutorials/How-To's
Topic: From tiff-scans, ScanTailor and Tesseract to djvu-files - how?
Replies: 2
Views: 6751

Re: From tiff-scans, ScanTailor and Tesseract to djvu-files - how?

By far the most time consuming part is the OCR. I am wondering, if the -j option from ocroodjvu would speed this up (number of OCR threads)? Is there a relation between the threads and the cpu-cores. What amount of threads would be meaningful (I have an AMD cpu with 6 cores, and an nivida GPU) I gu...