[opensuse-wiki] Sphinx vs. Lucene
Hello, I have installed both Sphinx and Lucene on the staging sites. They each have their advantages and disadvantages, so I want to see what everyone would like to see go to production. Here is a quick rundown of the advantages of each: Lucene: -Better integration with MediaWiki default search -Displays the custom "No results" message that Henne created if no results are found (Sphinx can probably be hacked to do this without too much trouble) -Slightly better relevance on some queries -Displays word count and timestamp for each result (some hacking can probably get this on Sphinx as well) Sphinx: -More frequent indexing possible (full indexing daily with incremental indexing every few minutes) -Written in C, no Java installation required -Because of the above, it can be moved live without having to wait on an OS upgrade -Spell checking through aspell (Lucene's is supposedly better, but I cannot get it to work at all) -Easier to customize the extension and better documentation Both search options are extremely fast, provide highly relevant results, support wildcard searches, and work well with multiple wikis (including language specific stemming). Needless to say, both are far superior to the default search. I am slightly more in favor of Sphinx right now. This is partly because it is easy to hack the extension, and partly because it could go live tomorrow if we want to. For those of you who can get to the staging site, Lucene is set as the default search, and Sphinx can be found at stage.opensuse.org/Special:SphinxSearch. Feel free to try them out and compare. -Matt -- To unsubscribe, e-mail: opensuse-wiki+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-wiki+help@opensuse.org
Hey, On 27.08.2010 04:41, Matthew Ehle wrote:
I have installed both Sphinx and Lucene on the staging sites. They each have their advantages and disadvantages, so I want to see what everyone would like to see go to production.
I know that sphinx might be tempting right now but i would definitely go with lucene. Because we can then live off the development and maintenance of mediawiki instead of doing our own. I also think we should use the same style as the mediawiki search page because people are used to it. Henne -- Henne Vogelsang, openSUSE. Everybody has a plan, until they get hit. - Mike Tyson -- To unsubscribe, e-mail: opensuse-wiki+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-wiki+help@opensuse.org
participants (2)
-
Henne Vogelsang
-
Matthew Ehle