On 02/06/2013 10:12 PM, Ken Schneider - openSUSE wrote:
On 02/06/2013 01:08 AM, Gustav Degreef pecked at the keyboard and wrote:
On 02/06/2013 09:46 PM, Karl Sinn wrote:
Hi,
Yes, you understood correctly. But why can okular read out other PDF documents? Are they not images? Excuse my ignorance. Thanks, Gustav.
Yes, they are not images, they are text.
It's like in a openoffice-document. You can write text "as text" or you can include text as a scanned image. If you include text as a scanned image you'll not be able to alter the text because it's not "text".
Karl
Thanks very much, I understand. Gustav.
You can try to run an OCR programme on the image to extract text.
Yes, I've used tesseract about three years ago. It was a command line program but produced very good results. However, I want to scan several books (200-400 pages each), and have them read out to me. For smaller jobs OCR is great, for these long documents having to OCR and verify the text is prohibitive in terms of time. I have a proprietary program (and hardware) that can scan books and produce a document which is eventually converted to speech. I was trying to get around using it because it stores in a proprietary format, requires proprietary hardware and also requires a win OS. I am now using FoxVox (tts plugin, Firefox), okular for PDF's (I was previously had to use acrobat reader) and jovie (with kmouth) to read out text docs. I'm trying to move even more towards open source tts tools, thanks for the comments, Gustav. -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse+owner@opensuse.org