I think there is another problem with inserting text layer.
Trying tesseract engine:
Code: Select all
% djvubind --ocr-engine=tesseract --tesseract-options="-l rus" -v
djvubind version 1.0.1
Executing with these parameters:
{'ocr_engine': 'tesseract', 'tesseract_options': '-l rus', 'verbose': True, 'cjb2_options': '-lossy', 'cuneiform_options': '-l ruseng', 'bitonal_encoder': 'minidjvu', 'color_encoder': 'csepdjvu', 'ocr': False, 'quiet': False, 'minidjvu_options': '--dpi 300 --pages-per-dict 80 --verbose', 'win_path': 'C:\\Program Files\\DjVuZone\\DjVuLibre\\;C:\\Program Files\\Tesseract-OCR;C:\\Program Files\\ImageMagick-6.6.5-Q16', 'cores': 2, 'csepdjvu_options': '', 'c44_options': ''}
* Collecting files to be processed.
* Analyzing image information.
Spawning 2 processing threads.
* Performing optical character recognition.
Spawning 2 processing threads.
* Encoding all information to book.djvu.
The same time monitoring working directory:
Code: Select all
% inotifywait -m -r --format '%:e %f' tst
...
CREATE 155_box.box
OPEN 155_box.box
MODIFY 155_box.box
MODIFY 155_box.box
CLOSE_WRITE:CLOSE 155_box.box
...
CLOSE_NOWRITE:CLOSE 155.tif
CREATE 155_txt.txt
OPEN 155_txt.txt
MODIFY 155_txt.txt
MODIFY 155_txt.txt
CLOSE_WRITE:CLOSE 155_txt.txt
OPEN 155_box.box
ACCESS 155_box.box
CLOSE_NOWRITE:CLOSE 155_box.box
OPEN 155_txt.txt
ACCESS 155_txt.txt
CLOSE_NOWRITE:CLOSE 155_txt.txt
DELETE 155_box.box
DELETE 155_txt.txt
...
CLOSE_NOWRITE:CLOSE 157.tif
CREATE enc_temp.djvu
OPEN enc_temp.djvu
MODIFY enc_temp.djvu
...
As far as I can see tesseract output files were deleted
before starting djvu encoding. The whole log attached.