Search found 63 matches
- 06 May 2020, 09:34
- Forum: Tutorials/How-To's
- Topic: How to convert a book to serchable pdf using open source software
- Replies: 33
- Views: 168940
Re: How to convert a book to serchable pdf using open source software
Thank you for your comments and sharing details of your workflow. Nice to see, that someone found useful the thread I wrote. First I think the cover should be at the same size when scrolling the pdf file. I had some problem since I scanned the covers at higher resolutions. You are right. The scripts...
- 22 Feb 2020, 19:06
- Forum: Tutorials/How-To's
- Topic: From tiff-scans, ScanTailor and Tesseract to djvu-files - how?
- Replies: 2
- Views: 7258
Re: From tiff-scans, ScanTailor and Tesseract to djvu-files - how?
By far the most time consuming part is the OCR. I am wondering, if the -j option from ocroodjvu would speed this up (number of OCR threads)? Is there a relation between the threads and the cpu-cores. What amount of threads would be meaningful (I have an AMD cpu with 6 cores, and an nivida GPU) I gu...
- 20 Jan 2020, 18:15
- Forum: Scan Tailor
- Topic: What to do with a page with text and graphics
- Replies: 4
- Views: 8285
Re: What to do with a page with text and graphics
I attach some screenshots and hope you will find them useful. 1. Let's say there is a page with pictures and text (doesn't matter there are in grayscale, the same apply to color ones) Zrzut ekranu z 2020-01-17 22-00-59.png 2. In the "Output" stage: a. Change "Mode" from "Bla...
- 09 Jan 2020, 21:17
- Forum: Tutorials/How-To's
- Topic: How to convert a book to serchable pdf using open source software
- Replies: 33
- Views: 168940
Re: How to convert a book to serchable pdf using open source software
As you already wrote it was the sorting problem due to inconsistent naming of files. BTW this is not the Tesseract issue as it cannot process batch of separate files directly and the workaround is necessary by creating a list of files in right order which Tesseract may follow. This list was created ...
- 09 Jan 2020, 06:50
- Forum: Scan Tailor
- Topic: What to do with a page with text and graphics
- Replies: 4
- Views: 8285
Re: What to do with a page with text and graphics
In case where there are pages with text and photos it would be possible to apply the "mixed output" (text areas are binarized but pictures remain in color). Picture areas should be selected and indicated as picture zones and "Rectangular picture shape" mode from ST Advanced is re...
- 27 Aug 2019, 07:25
- Forum: Tutorials/How-To's
- Topic: How to convert a book to serchable pdf using open source software
- Replies: 33
- Views: 168940
Re: How to convert a book to serchable pdf using open source software
As I use books in pdf format as a source for TTS, I compared also, how they are read by Moon Reader Pro + Ivona TTS engine on my Android phone. The main problem was that there are erroneous additional paragraph breaks in random places what makes the listening less fluent and comfortable. I need to ...
- 06 Aug 2019, 12:38
- Forum: Tutorials/How-To's
- Topic: How to convert a book to serchable pdf using open source software
- Replies: 33
- Views: 168940
Re: How to convert a book to serchable pdf using open source software
Thanks! I am already using Scantailor for cropping, but am looking for software that would get rid of excess white space around text, and fingers, to edit my scans with before I start working with Scantailor. I use Scan Tailor Advanced for this, i.e. in order manually select area, where STA looks a...
- 05 Aug 2019, 15:42
- Forum: Tutorials/How-To's
- Topic: How to convert a book to serchable pdf using open source software
- Replies: 33
- Views: 168940
Re: How to convert a book to serchable pdf using open source software
Does anyone know of useful software for cropping scans that runs on Mac Os? I think this thread answers more or less to your question: https://forum.diybookscanner.org/viewtopic.php?f=24&p=21785&sid=0ab34fa7fd0f5ee70fc925d0d99c4f21#p21785 In short, I would recommend Scan Tailor Advanced. It...
- 18 Jul 2019, 03:30
- Forum: HELP
- Topic: How to crop and deskew pages using only free software?
- Replies: 8
- Views: 16951
Re: How to crop and deskew pages using only free software?
I do not expect that Scan Tailor is able to provide perfect or very good results without some manual corrections. It is universal tool but it seems you would need something more customized and tailor made. Maybe something based on OpenCV?
- 17 Jul 2019, 17:33
- Forum: Programs, Software releases, and more.
- Topic: djvu 1bit vs 8bits
- Replies: 6
- Views: 9557
Re: djvu 1bit vs 8bits
PDF in Abbyy. djvu with djvu small mod. I do not have much experience with Abbyy Finereader. However I would guess is that in case of 1 bit pictures there is jbig2 compression applied and no MRC compression is necessary. In case of grayscale images, algorithms do separation of foreground (letters) ...