Re: [opensuse] Text-to-speech from scanned documents (xsane)

6 Feb 2013

      On 02/06/2013 10:12 PM, Ken Schneider - openSUSE wrote:
...
On 02/06/2013 01:08 AM, Gustav Degreef pecked at the keyboard and wrote:
...
On 02/06/2013 09:46 PM, Karl Sinn wrote:
...
Hi,
...
Yes, you understood correctly.  But why can okular read out other PDF
documents? Are they not images? Excuse my ignorance.  Thanks, Gustav.
Yes, they are not images, they are text.
It's like in a openoffice-document. You can write text "as text" or
you can include text as a scanned image. If you include text as a
scanned image you'll not be able to alter the text because it's not
"text".
Karl
Thanks very much, I understand. Gustav.
You can try to run an OCR programme on the image to extract text.
Yes, I've used tesseract about three years ago.  It was a command line 
program but produced very good results. However, I want to scan several 
books (200-400 pages each), and have them read out to me. For smaller 
jobs OCR is great, for these long documents having to OCR and verify the 
text is prohibitive in terms of time.

I have a proprietary program (and hardware) that can scan  books and 
produce a document which is eventually converted to speech. I was trying 
to get around using it because it stores in a proprietary format, 
requires proprietary hardware and also requires a win OS.

I am now using FoxVox (tts plugin, Firefox), okular for PDF's (I was 
previously had to use acrobat reader) and jovie (with kmouth) to read 
out text docs.  I'm trying to move even more towards open source tts 
tools,  thanks for the comments, Gustav.
-- 
To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org
To contact the owner, e-mail: opensuse+owner@opensuse.org