I recently did a project where I OCR'd a very rare book that I could only find i...

superpermutat0r · on Dec 20, 2019

That's very interesting given that Tesseract uses Leptonica. I'm not sure if they use it for dewarping but all of my little projects with Leptonica really worked well. Dewarping, binarizing, extracting individual elements etc.

https://github.com/DanBloomberg/leptonica

umvi · on Dec 20, 2019

Maybe I wasn't using tesseract to its fullest potential, but I had a really hard time getting it to do accurate OCR on warped paged - straight pages worked perfect.

dunham · on Dec 20, 2019

Also, the last time I checked tesseract liked stuff to be 200-300 dpi. (You don’t have to scan it at that resolution, but helps if you scale it up.)

Years ago, I dug up a couple of papers on the spine thing, but never got around to implementing it. I think you can estimate the curvature based on the shadow and dewarp.

I was just scanning some recipes to save typing, so it wasn’t really worth the efffort.

umvi · on Dec 20, 2019

I ended up using this guy's tool[1] for dewarping, which worked pretty well. The tool was more a prototype than anything, but it was enough to finish my project.

[1] https://mzucker.github.io/2016/08/15/page-dewarping.html

khalilravanna · on Dec 20, 2019

I'm curious what the book was and the context for why you were reading it.

NicoJuicy · on Dec 20, 2019

Why didn't you combine it with a dictionary and then ML in it to correct the letter based on the context?