Masaru, Maura, On Sunday 28 August 2005 19:19, Masaru Nomiya wrote:
Hello,
...
Muara> What would be a sensible choice to get just a plain readable Muara> English text ?
I downloaded an English pdf file, and just executetd
# pdftotext profile.pdf
then I got a plain text file.
I frequently encounter the symptoms Maura describes. As I said, I've never taken the time to investigate what the issue is.
I also did
# pdftotext -enc UTF-8 profile.pdf
this gave me a same result. What's the matter, I wonder?
Could you show the result of the below operation;
# pdfinfo foo.pdf
and
# pdffonts foo.pdf
I tried these commands a couple of PDF files, but nothing in their output is indicative of the encoding used for the text in the file. On the other hand, if you use the Document Properties command in the File menu of Adobe Reader 7 and view the Fonts tab, you can see encodings. Of the few files I looked at, most of the encodings were listed as "Ansi". Another I saw as "Identity-H".
Regards,
--- Masaru Nomiya
Randall Schulz