This screenshot is doctored
I was curious – how many spam blogs does Google have on their Blogspot server? To get a feeling for this, I’m using the following URL, which brings me to a random blog each time I access it:
Now here’s what I found in chronological order. I’m defining “Spam” as any blog that apparently is automated, in order to link to and therefore promote a certain product (mostly, promote it in terms of Google rankings) or to show off Google AdSense. Anything that you would write manually, whether or not you promote a product, would not count as spam here.
|37.||247guide.blogspot.com||Spam (looks like a spam blog in the making)|
30 bad ones out of 50 overall – that makes it around 60% spam on Blogspot. Google itself shows there are around 7,500,000 pages hosted on Blogspot. If we extrapolate the number, we might estimate Google is hosting 4 million spam pages. (Of course, this number is by no means in any way precise.)
Even though I expected some amount of spam, I was surprised just how much it is. From the small sample I took it looks like on average, a site hosted at Google’s Blogspot is more likely to contain spam than anything else. If you’d consider Blogspot a community, it would be a very unhealthy one. If you consider it just another free web hosting service, than the amount of spam still reflects badly on Google. As for those spam blogs which try to boost their Google rankings, it’s ironic that this is done on Google’s own servers. That’s as ridiculous as a cleaning lady with very dirty shoes.
Google reacted in partly by introducing Captchas for signing up, and by allowing users to flag content they find questionable. Could this be too little too late? And do they really expect users to flag thousands of spam pages – or do they want to get an idea of general patters that make up this spam, in order to automatically remove large portions of it?
50 100 tries to the sample above and now get a lower number of 63 spam blogs in 150 counted blogs, making for 42% spam on random Blogspot URLs. Like I said (and still) the number is by no means precise.
>> More posts