if you look around the htdig website (www.htdig.org
?), you'll find info
on using word2txt pdftotxt and others to make .doc .pdf and so on files
searchable by normal search engines - when the search engine finds a
file to be indexed, it finds the suitable text converter, executes it,
and parses the output.
On Tue, 2002-07-23 at 23:37, Greg Freemyer wrote:
I've got a Samba fileserver working pretty well, but it would be nice if I
could have a web interface that allowed end-users to search for documents on
Sort of like a google interface to the fileserver.
The trouble is that a lot of the docs are in word, acrobat, etc.
Does anyone know an open source search engine that has converters? I've looked
at a couple of commercial search engines that would do the job, but I found the
prices shockingly high. (i.e. $20K for a 100,000 document index)
Deployment and Integration Specialist
Compaq ASE - Tru64 v4, v5
Compaq Master ASE - SAN Architect
The Norcross Group
To unsubscribe send e-mail to suse-linux-e-unsubscribe(a)suse.com
For additional commands send e-mail to suse-linux-e-help(a)suse.com
Also check the archives at http://lists.suse.com