CMS

Ruediger Landmann r.landmann at redhat.com
Sat Jun 5 07:14:36 UTC 2010


On 06/05/2010 03:23 AM, Paul W. Frields wrote:
>
> I don't consider myself "outside" but the most glaring problem we have
> now is what has been induced by changing the entire site structure:
>
> * Broken links from Google that bring the user back to a front page
>
> * An embedded search that brings users to Google, which doesn't work
>    and often brings the user back... (etc.)
>
> Some of this might change as Google re-indexes content, but we really
> do need to keep a careful eye on how the user experience of finding
> our docs content is working.
>
>    

One of the new features of Publican 2.0 that I haven't mentioned yet is 
that it creates an XML sitemap for search engine bots to crawl. You can 
find d.fp.o.'s sitemap here:

http://docs.fedoraproject.org/Sitemap

I've fed this to Google, Yahoo, and Bing, and they're all slowly 
re-indexing the site. The map now contains a little over 2,000 URLs and 
at the time of writing, Google has crawled about 350 of them.

The dilemma we face is the decision of when to turn off the 404 
redirect. For the sake of all the existing links scattered around the 
net (both on the Fedora Project site and off it), we'd want to postpone 
this as far as possible. On the other hand, any bot attempting to verify 
that link gets a page served up and probably concludes that the link is 
valid; I suspect that if these links 404ed, they'd start to evaporate 
from search results.

Given that existing links around the net are pointing to (at most 
recent) the F12 versions of docs, there will be no need to keep the 404 
redirect in place past October; however, if we want to start allowing 
dead links to 404 out rather than poison search results, maybe we should 
bring that date forward? The sooner we do this, the sooner search will 
start working properly...

Cheers
Rudi



More information about the docs mailing list