With all the Google Fever going around the media lately, I'm somewhat surprised the "mainstream" press hasn't done any stories about the Google cloaking fiasco. Maybe it's one of those things that only search engine geeks like us would understand, or maybe it's just too far under the radar, but it did show up on
slashdot yesterday. Although, most of the Slashdot nerds didn't think it was a big deal.
Anyway, there's an interesting
thread about it on webmasterworld in which GoogleGuy explains the situation as follows:
Those pages were primarily intended for the Google Search Appliances that do site search on individual help center pages. For example, http://adwords.google.com/support has a search box, and that search is powered by a Google Search Appliance. In order to help the Google Search Appliance find answers to questions, the user support system checked for the user agent of "Googlebot" (the Google Search Appliance uses "Googlebot" as a user agent), and if it found it, it added additional information from the user support database into the title.
The issue is that in addition to being accessed via the internal site-search at each help center, these pages can be accessed by static links via the web. When the web-crawl Googlebot visits, the user support system thinks that it's the Google Search Appliance (the code only checks for "Googlebot") and adds these additional keywords.
That's the background, so let me talk about what we're doing. To be consistent with our guidelines, we're removing these pages from our index. I think the pages are already gone from most of our data centers--a search like [site:google.com/support] didn't return any of these pages when I checked. Once the pages are fully changed, people will have to follow the same procedure that anyone else would (email webmaster at google.com with the subject "Reinclusion request" to explain the situation).
So basically, Google gave their own pages the boot for not following their own guidelines. It sounds like a harmless mistake, but I still think it's hilarious. At least they fessed up to it and made the best of the situation by kicking their own pages out of the index. Although, I have a feeling they'll have a pretty easy time getting those (non-cloaked) pages back in the index.
If anyone missed this, I've got screenshots of the cloaked page and the SERPS page. Not much to see, but you can see the title stuffed with "useful" keywords.