I will look at doing this.  For now, the custom search engine will at least not display undesirable results, but this can also be beneficial for searches made from outside the wiki (i.e. www.google.com).
 
-Matt

>>> Christian Boltz <opensuse@cboltz.de> 7/29/2010 3:45 AM >>>
Hello,

on Mittwoch, 28. Juli 2010, Matthew Ehle wrote:
> It appears that when you get towards the end of the
>  useful search results, the search results start getting into the
>  history and diff pages.  I don't know if that's really something we
>  want, but it should be easy to change the CSE settings to fix that.

Don't fix the search results, fix the indexing ;-)

I'd propose to create a   robots.txt   with
    Disallow: /index.php

This should keep out the page history, view source etc. out of search
engines. Articles will still be listed because they don't have index.php
in their URL.

You may also want to add things like
    Disallow: /Special:Search
    Disallow: /Special:Random
and all its translated counterparts - http://de.wikipedia.org/robots.txt
has a nice list ;-)


Regards,

Christian Boltz
--
Den ganzen Prozess zusammengenommen nennt man "Branding": Man nimmt
ein glühendes Eisen mit der neuen Ausdrucksform und drückt sie der
Firma kräftig drauf - und die reagiert wie ein Rindviech. :-) [Ratti]
--
To unsubscribe, e-mail: opensuse-wiki+unsubscribe@opensuse.org
For additional commands, e-mail: opensuse-wiki+help@opensuse.org