Getting Abbyy to Recognize Chapters/Divisions
Posted: 24 Oct 2012, 00:45
Hi,
I'm using Abbyy to OCR my book scans. I'm scanning both fiction (literary novels) and non-fiction (academic books). The OCR itself is working well and there are few errors in recognition. I read in mobi or epub formats using Caliber to convert between them.
However, I find the fiction sometimes difficult to read because the chapter divisions are stripped from the books. Some books also have little breaks within chapters -- in the hard copies that is indicated by a skipped line in the text -- that indicate scene changes/flashbacks/etc. These are also stripped. So other than paragraphs, the text I get is undivided. This actually makes the books less enjoyable since I sometimes don't realize that the chapter or scene has changed and have a little moment of confusion until it becomes clear.
I would like Abbye to preserve the chapter structure and the divisions between chapters. How can I make this happen? Most of the fiction books in question have at most a simple number indicating the new chapter. So a new chapter is indicated by a page where the text starts half way down the page and there's a number above and divisions between chapters are indicated by a skipped line. I would like the epubs to similarly have new chapters when chapters start and skip lines within chapters where appropriate.
How can I make this happen? If not with abbyy is there other software I can use instead of or along with Abbyye that will do this?
I'm using Abbyy to OCR my book scans. I'm scanning both fiction (literary novels) and non-fiction (academic books). The OCR itself is working well and there are few errors in recognition. I read in mobi or epub formats using Caliber to convert between them.
However, I find the fiction sometimes difficult to read because the chapter divisions are stripped from the books. Some books also have little breaks within chapters -- in the hard copies that is indicated by a skipped line in the text -- that indicate scene changes/flashbacks/etc. These are also stripped. So other than paragraphs, the text I get is undivided. This actually makes the books less enjoyable since I sometimes don't realize that the chapter or scene has changed and have a little moment of confusion until it becomes clear.
I would like Abbye to preserve the chapter structure and the divisions between chapters. How can I make this happen? Most of the fiction books in question have at most a simple number indicating the new chapter. So a new chapter is indicated by a page where the text starts half way down the page and there's a number above and divisions between chapters are indicated by a skipped line. I would like the epubs to similarly have new chapters when chapters start and skip lines within chapters where appropriate.
How can I make this happen? If not with abbyy is there other software I can use instead of or along with Abbyye that will do this?