Re: Make a djvu file and add ocr: DjVuToy; TiffDjvuOcr; CuneiDjVu
Posted: 21 Apr 2017, 05:35
Thanks a lot for the source code, really appreciate your effort!
A set of discussions about DIY Book Scanners
As will be seen from that thread, using free tools (MS Windows) one can not only create rather small DjVu files using even greyscale tiff files with greater definition (e.g. 400dpi up if needed) but a searchable sub-image text layer can be included, again using free software as therein shown. And the problem of creating a DjVu file including one or more pages of mixed text and pictures (otherwise resulting sometimes in artifacts spoiling the picture part) can be worked round by saving such page(s) e.g. in DjVuSolo as photo (as opposed to e.g. scanned, perfect) and then substituting them (for any such pages that may have been DjVu encoded as 'scanned' or other default modes) using the Edit function of e.g. DjVuToy. (This workaround being an inferior means to a similar (in result) end as the djvu_imager and djvu_small application suite[*], which I found had more steps to learn before practical implementation).
I don't know if licensing issues affect the permitted use of DjVu format by e.g. archive.org but their Luradoc compressed pdf files I find are a very inferior substitute; even a good multi-format reader like SumatraPDF stalls and halts in page turning while it labours to decompress whereas in my experience DjVu files scroll smoothly without such hesitation.
Last, I find DjVu metadata can now be viewed/edited using an MS Windows explorer extension:
DjVu Shell Extension Pack
I haven't yet tested whether Phil Harvey's updated exiftool can operate likewise on a DjVu file: