Google Blogoscoped

Forum

google engineer responds to billion page index scam - can't wait for matt to blog again

Search-Engines-Web.com [PersonRank 10]

Thursday, June 22, 2006
18 years ago2,503 views

http://battellemedia.com/archives/002661.php

Finally a response from an Engineer – (Matt has resisted the temptation to take a Blog Vacation Break...)

So this will have to do – until He starts Blogging again.

NOW THAT'S WILLPOWER!!!!!!!!!

Ionut Alex. Chitu [PersonRank 10]

18 years ago #

Not true.

Adam has responded to the Digg post 4 days ago:
http://digg.com/technology/How_One_Spammer_Got_BILLIONS_of_Pages_into_Google_in_3_Weeks
(search for AdamLasnik)

Philipp Lenssen [PersonRank 10]

18 years ago #

Where's the response from an engineer in that post Search-Engines-Web?

Search-Engines-Web.com [PersonRank 10]

18 years ago #

I've long been a lurker / occasional commenter for quite some time here, and I figured I might as well offer a few clarifications on the "5 billion" issue :-).

I work with Matt Cutts and other engineers in the Search Quality Team at Google. And yes, we noticed that lots of subdomains got indexed last week – and sometimes listed in search results – that shouldn't have been. Compounding the issue, our result count estimates in these contexts was MANY orders of magnitude off. For example, the one site that supposedly had 5.5 billion pages in the index actually had under 1/100,000th of that.

So how did this happen? We pushed some corrupted data with our index. Once we diagnosed the problem, we started rolling the data back and pushed something better... and we've been putting in place checks so that this kind of thing doesn't happen again.

Posted by: Adam Lasnik June 20, 2006 10:44 PM

-----------------------------------------------------------------

   by AdamLasnik on 6/17/06
[comment buried, show commenthide comment] + 2 diggs Hey there,

I'm on the Search Quality Team at Google. We're aware of the situation that's been discussed here.

In particular, I wanted to note a couple of things:
1) Our engineers recently noticed that our site: queries (number of results listed for a search) were showing bizarre results. This has turned out to be tied to a bad data push, and we're fixing this right now.
2) I am *NOT* on the AdSense team, but I'd hope common sense would suggest to fellow Digg'ers that a click on an AdSense ad does not necessarily result in a charge to an advertiser and a credit to a publisher.

Search-Engines-Web.com [PersonRank 10]

18 years ago #

Just for the record to be TECHNICALLY accurate AL is a Search Evangelist

http://www.mattcutts.com/blog/better-conversations/

http://www.bladam.com/main/entry/now-a-googler/

I am Google's first "Search Evangelist," working to improve communications between Webmasters and Google. As you can imagine, this position is quite a challenge, an adventure, and an honor.

Philipp Lenssen [PersonRank 10]

18 years ago #

Thanks, I added it to the frontpage article!

Forum home

Advertisement

 
Blog  |  Forum     more >> Archive | Feed | Google's blogs | About
Advertisement

 

This site unofficially covers Google™ and more with some rights reserved. Join our forum!