Roger Oberholtzer pecked at the keyboard and wrote:
Hello
We have a network printer that will scan docs and send them as pdf docs to an e-mail address in the company. Is there any software with OpenSUSE 10.3 that can do OCR from a PDF doc? I am guessing that the doc contains tiff images of the scanned documents. Any and all pointers are welcome.
Have you tried pdftotext ? pc5:~ # pdftotext -h pdftotext version 3.02 Copyright 1996-2007 Glyph & Cog, LLC Usage: pdftotext [options] <PDF-file> [<text-file>] -f <int> : first page to convert -l <int> : last page to convert -layout : maintain original physical layout -raw : keep strings in content stream order -htmlmeta : generate a simple HTML file, including the meta information -enc <string> : output text encoding name -eol <string> : output end-of-line convention (unix, dos, or mac) -nopgbrk : don't insert page breaks between pages -opw <string> : owner password (for encrypted files) -upw <string> : user password (for encrypted files) -q : don't print any messages or errors -cfg <string> : configuration file to use in place of .xpdfrc -v : print copyright and version info -h : print usage information -help : print usage information --help : print usage information -? : print usage information -- Ken Schneider SuSe since Version 5.2, June 1998 -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org