Randall R Schulz wrote:
Converting PDF to HTML is guaranteed to produce inferior results. I don't recommend it. Playing the kind of games you hint at is unlikely to benefit your end users.
Hi Randall, I agree about the results, but there are times when it's useful. I'm with a non-profit that sends out a monthly newsletter to about 700 members. This is printed in b/w and sent via snail mail, its usually about 5 double-sided pages. But I also place the source pdfs on the org's web site and keep the older versions there for archival purposes. This works well since the source is in color and there are frequently color photos that we can't afford to distribute in paper form. But it's nice to be able to index the archived newsletters for historical reference (it's a museum), but pdf doesn't work well for indexing text. So I use pdftohtml and offer it right next to the link pointing at each pdf edition. The text is indexable and the high quality pdf is right there with it. pdftohtml does a fairly good job, about the only time I've seen it mess up is when the source pdf has photo credits written at a 90-degree angle along the vertical sides of photos. They come out as horizontal lines. Regards, Lew -- To unsubscribe, e-mail: opensuse+unsubscribe@opensuse.org For additional commands, e-mail: opensuse+help@opensuse.org