Auto vs Manual for "Split Pages"

Scan Tailor specific announcements, releases, workflows, tips, etc. NO FEATURE REQUESTS IN THIS FORUM, please.

Moderator: peterZ

User avatar
JonEP
Posts: 81
Joined: 19 Apr 2010, 15:09

Auto vs Manual for "Split Pages"

Post by JonEP »

Hi,

I'm trying out the new pre-release of ST. It's still great (no surprise!) -- I don't notice the difference too much, but I'll take your word for it!

I did have a case where ST failed to identify the split in the platen and thought I might mention it. Also, I tried to use the "manual" setting and apply to all pages, but I couldn't really tell the difference between manual and auto--in both cases, ST set the split line. I'd hoped to be able to set the split line myself manually in the first image, and then have ST use that same setting for all additional images (I'd selected to have it apply to all pages). I'm guessing that that must not be the function of manual. If not, I wonder, what is?

Thanks for a great program,
J
User avatar
daniel_reetz
Posts: 2812
Joined: 03 Jun 2009, 13:56
E-book readers owned: Used to have a PRS-500
Number of books owned: 600
Country: United States
Contact:

Re: Auto vs Manual for "Split Pages"

Post by daniel_reetz »

Hey jonEP, two thoughts... maybe this would go better in the Pre thread (easier for Tulon to find)... and I know Tulon really appreciates original sample pages that cause problems, including them would be a big help!
spamsickle
Posts: 596
Joined: 06 Jun 2009, 23:57

Re: Auto vs Manual for "Split Pages"

Post by spamsickle »

I think "manual" mode in "page split" only changes one page, and doesn't offer an "apply to" option that would change other pages. The "apply to" option just propagates the "hint" about how the page is split, not the actual coordinates of the split you specified.

In another post about a deskew problem, Tulon suggested manually changing one page, then editing the project file to apply that change to other pages. I've used that technique to apply a content selection to multiple pages, but I hadn't thought to use it for "split pages". I'd like to try it, because some of the scans I did before I knew about Scan Tailor (and which I'm going back and processing now) are getting more than 60% errors on the split page step, and those are usually the source of subsequent content selection errors as well. I'm worried that my "left" pages may not split the same way as my "right" pages, which would make editing the project file about as slow as going through and changing each page manually, but it may be that the two split lines are still in roughly the same place. If I do try it, I'll post results here.

And, if I ever get around to modifying Scan Tailor code itself, adding "apply to" for split pages will be right after adding "apply to" for content selection.
Last edited by spamsickle on 21 Oct 2010, 16:05, edited 2 times in total.
Tulon
Posts: 687
Joined: 03 Oct 2009, 06:13
Number of books owned: 0
Location: London, UK
Contact:

Re: Auto vs Manual for "Split Pages"

Post by Tulon »

Page splitting stage shouldn't make that many mistakes. Post a sample page if you want me to take a look.
Scan Tailor experimental doesn't output 96 DPI images. It's just what your software shows when DPI information is missing. Usually what you get is input DPI times the resolution enhancement factor.
User avatar
JonEP
Posts: 81
Joined: 19 Apr 2010, 15:09

Re: Auto vs Manual for "Split Pages"

Post by JonEP »

Thanks to you both, and continued thanks to Tulon

Wow, Spamsickle, if you ever got around to contributing a fixed content box to ST, and fixed split-pages, you would have a lot of grateful admirers!

Here's a sample tiff that didn't split (I've been having a lot of these--I imagine it's the reflection, although I'm not sure what I've done to my rig that it now creates a reflection that is more difficult for ST to interpret than previously.

N.B: I'd be happy for this to go in the Pre thread...
Attachments
IMG_1771.jpg
(329.14 KiB) Downloaded 142 times
univurshul
Posts: 496
Joined: 04 Mar 2014, 00:53

Re: Auto vs Manual for "Split Pages"

Post by univurshul »

x
Last edited by Anonymous on 15 Nov 2010, 11:14, edited 1 time in total.
spamsickle
Posts: 596
Joined: 06 Jun 2009, 23:57

Re: Auto vs Manual for "Split Pages"

Post by spamsickle »

A few of the books scanned with my first scanner cause the Page Split processing to make more mistakes than successes. It looks to me like reflections or misinterpreting book geometry accounts for most of these. Here's a screen shot of Scan Tailor's automatic page split results for a representative section:
PageSplit.jpg
PageSplit.jpg (73.01 KiB) Viewed 12174 times
For the page shown (and most right-hand pages), Scan Tailor appears to be identifying the reflection of the glossy book cover at the outer edge of the "target" page as its split line.

For the subsequent page (and most left-hand pages), it looks like the edge-on pages from the non-target side of the scanner are similarly confusing.

The originals can be found here.

I suspect I can crop out the parts of the image which are confusing Scan Tailor in a pre-processing step, but it may be possible to modify Page Split to handle such situations, if there is interest in doing so.
User avatar
Misty
Posts: 481
Joined: 06 Nov 2009, 12:20
Number of books owned: 0
Location: Frozen Wasteland

Re: Auto vs Manual for "Split Pages"

Post by Misty »

I'm curious. Does the page split algorithm have as much benefit to camera scans as for flatbed scans? My experience has typically been that pages manually set to have no splits process just as well as pages which were split, without incorrectly selecting anything outside the page for the content area. Since the page split algorithm does make mistakes sometimes, it seems to me that it can be more of a hindrance than a help for camera scanning.
The opinions expressed in this post are my own and do not necessarily represent those of the Canadian Museum for Human Rights.
Tulon
Posts: 687
Joined: 03 Oct 2009, 06:13
Number of books owned: 0
Location: London, UK
Contact:

Re: Auto vs Manual for "Split Pages"

Post by Tulon »

In both samples we have the same problem: they fail Scan Tailor's expectations. When you use "Page + offcut" mode, ST doesn't expect to see both vertical edges of the book. It expects to see the spine line and one or maybe even no vertical edges. You are giving it two. It's actually quite hard to tell a book's edge from its spine, so ST doesn't even try. In "Page + offcut" mode it does the following:
1. Find all more or less vertical lines.
2. Take the are between the leftmost and the rightmost of those. Cut off the rest.
Misty wrote:My experience has typically been that pages manually set to have no splits process just as well as pages which were split, without incorrectly selecting anything outside the page for the content area.
By not cutting off the neighbouring page, you make the life of "Select Content" stage harder. It may or may not be able to cope with that.
Scan Tailor experimental doesn't output 96 DPI images. It's just what your software shows when DPI information is missing. Usually what you get is input DPI times the resolution enhancement factor.
univurshul
Posts: 496
Joined: 04 Mar 2014, 00:53

Re: Auto vs Manual for "Split Pages"

Post by univurshul »

x
Last edited by Anonymous on 15 Nov 2010, 11:15, edited 3 times in total.
Post Reply