Re: [opensuse] PDF OCR

12 Dec 2007

      On Wednesday 12 December 2007 10:52, Ken Schneider wrote:
...
Roger Oberholtzer pecked at the keyboard and wrote:
...
Hello
We have a network printer that will scan docs and send them as pdf docs
to an e-mail address in the company. Is there any software with OpenSUSE
10.3 that can do OCR from a PDF doc? I am guessing that the doc contains
tiff images of the scanned documents. Any and all pointers are welcome.
Have you tried pdftotext ?
I will happily recommend Tesseract.  

http://code.google.com/p/tesseract-ocr/

Here's a how-to on how to do PDF to text, though I've yet to be able to 
convert PDF to TIFF yet...

http://www.groklaw.net/articlebasic.php?story=20061210115516438

And a few more articles...

http://www.linuxjournal.com/article/9676

http://www.howtoforge.com/ocr_with_tesseract_on_ubuntu704

-- 
To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org
For additional commands, e-mail: opensuse+help@opensuse.org

Re: [opensuse] PDF OCR

Kai Ponte