if you look around the htdig website (www.htdig.org ?), you'll find info on using word2txt pdftotxt and others to make .doc .pdf and so on files searchable by normal search engines - when the search engine finds a file to be indexed, it finds the suitable text converter, executes it, and parses the output. Ewan On Tue, 2002-07-23 at 23:37, Greg Freemyer wrote:
All,
I've got a Samba fileserver working pretty well, but it would be nice if I could have a web interface that allowed end-users to search for documents on the fileserver.
Sort of like a google interface to the fileserver.
The trouble is that a lot of the docs are in word, acrobat, etc.
Does anyone know an open source search engine that has converters? I've looked at a couple of commercial search engines that would do the job, but I found the prices shockingly high. (i.e. $20K for a 100,000 document index)
Greg Freemyer Internet Engineer Deployment and Integration Specialist Compaq ASE - Tru64 v4, v5 Compaq Master ASE - SAN Architect The Norcross Group www.NorcrossGroup.com
-- To unsubscribe send e-mail to suse-linux-e-unsubscribe@suse.com For additional commands send e-mail to suse-linux-e-help@suse.com Also check the archives at http://lists.suse.com