It seems to be with the fuzziness of old typed texts where the ink has bled. A printed sheet in times new roman from e.g. my local newspaper ocr's almost perfectly under kooka. My old text actually looks better to me in the kooka preview than the modern text. (BTW kooka seems to work better if your scanner is switched on. . .sorry about ranting in orig. message!). The nearest I've got so far is around 50% correct by using gimp's sharpening tool. Takes around 10 minutes per page. We *must* be able to do better than this. Cheers, Steve. On Monday 23 February 2004 17:15, Martin Mielke wrote:
Hi,
same problem here...
A text like, for example:
-- I'm trying to ocr some old typewritten ... --
would turn into something like:
--- -_- -__| ocr s0me old _|||_ ... ---
Regards, Martin
Hi. I'm trying to ocr some old typewritten documents using xsane and gocr. There are many errors however. I've experimented with different brightness, gamma and contrast settings but can't seem to get acceptable quality. I've also tried kooka but it's won't scan nor preview the text and a good quality scan I adjusted from gimp still doesn't give good results. Any advice anyone?
Thanks, Steve.