My reasoning is simple: the Toolbar includes an advanced “show PageRank” option which, when turned on, must send a page’s URL to a Google server, so that this server will then be able to return the PageRank value. I think grabbing this value for other purposes as well would be mostly fair and square, too, because a) Google warns about the advanced feature, b) a page which it would index would be public anyway, and c) it would help get some “deep web” crawling going, in tune with Google’s mission & motivation.
Well, it turns out my suspicion was dead-wrong. Here’s the setup of the bet with Matt that was meant to prove this, as far as possible:
It was an illuminating experiment, because after some months, a search for blogoscoped55521384239 only resulted in the page where Matt and I discussed the bet (this part was obvious), but not the semi-hidden page on my server... which is located here. In other words, nope, the Google Toolbar won’t index pages (as far as this experiment was able to find proof for that, and I can’t think of any setup that would yield final proof – though Matt could walk up to the Toolbar team and ask, of course). This is not to say that toolbars by other vendors act the same way... if anyone wants to continue the experiment on say, the Yahoo toolbar, I’m curious about the results.
So congrats to Matt, and hope you or your friends enjoy the copy of my book! :)
>> More posts