Google Blogoscoped

Forum

Google Cricket Added to Robots.txt

Philipp Lenssen [PersonRank 10]

Thursday, March 8, 2007
17 years ago7,919 views

Google added yet another folder to their robots.txt file < http://www.google.com/robots.txt >, excluding it from web spiders: /cricket? and /cricket/. None of these URLs resolve to something on Google.com, however, other country domains show a JavaScript snippet. Apparently, this is used for the inclusion of a Google Desktop cricket live scores gadget, so no new product can be expected along these lines. See Digital Inspiration's post from 2006: < http://labnol.blogspot.com/2006/05/cricket-mania-google-bowls-googly.html > and http://www.google.co.in/cricket

TOMHTML [PersonRank 10]

17 years ago #

Cricket is the code name of Google Talk!!!
http://www.zorgloob.com/2006/11/inside-google-france-discussion-avec.asp

Suresh S [PersonRank 10]

17 years ago #

World cup going to start in west indies

Alaskan Carnivor [PersonRank 1]

17 years ago #

I'll take a stab.. It is a Google App/Google Pages Google Talk widget for a MNVO Called Leap Wireless. They have device/service called... drum roll please... Cricket .. .. details http://www.leapwireless.com/l1_our_cricket_service.htm .. Should port nicely on the Google Phone.. GooPhone maybe? :-)

/pd [PersonRank 10]

17 years ago #

the mobile space is on the move.. but whats up is intriguing..
yesterday I heard chatter that a new phone+service is going to be luanched

today, the bot.txt file is changed.. somethings in the air..

but cricket could also be a widget just for the WorldCup.. after all ealy 500Million folks in the asian subcontient track this event /daily.. so one can see the revenue potentials via this method!! :)_

Alaskan Carnivor [PersonRank 1]

17 years ago #

oops.. MVNO I meant to say..

Daniel Garcia [PersonRank 6]

17 years ago #

www.mycricket.com looks well!!

Alaskan Carnivor [PersonRank 1]

17 years ago #

Yep.. www.mycricket.com is owned by LeapWireless BTW..

Philipp Lenssen [PersonRank 10]

17 years ago #

One thing to remember, the chances are low I think that Google would add an internal code name to their public robots.txt, so it's probably not the code name... but who knows?!

Haochi [PersonRank 10]

17 years ago #

Cricket World Cup
Ionut got the scope. :)
http://googlesystem.blogspot.com/2007/03/special-google-page-for-cricket-world.html

http://www.google.co.in/cricket/contest.html

Ionut Alex. Chitu [PersonRank 10]

17 years ago #

Nope. It's the same URL from the first post of this thread.

Haochi [PersonRank 10]

17 years ago #

Oops.

Mambo [PersonRank 10]

17 years ago #

Is there no way to make robots.txt unaccessible to the public, but visible to Web crawlers?

Philipp Lenssen [PersonRank 10]

17 years ago #

And here's the cricket world cup logo with thanks to http://zorgloob.com :
http://www.google.co.za/logos/cricket07.gif
http://www.google.co.za/

> Is there no way to make robots.txt unaccessible
> to the public, but visible to Web crawlers?

Then you'd need not only an authentication system, but also a standards organization which decides just who is authorized to create web crawlers... that would be a serious problem for innovation. At this moment, everyone's allowed to write their own search engine and let it index parts of the web. (In practice you need some pretty robust servers to do so, but at least you are allowed to in theory.)

/pd [PersonRank 10]

17 years ago #

So How many cricket fans are in this forum ??

It will kewl to see a cricket gadget that I can place on my sidebar or hompy.. just like the ones we had for FIFA :)-

Tony Ruscoe [PersonRank 10]

17 years ago #

> Is there no way to make robots.txt unaccessible to
> the public, but visible to Web crawlers?

If you knew all the user agents you wanted to access it, this would be possible. But since there are so many user agents out there (both bot and non-bot clients) it wouldn't really be recommended.

Mott [PersonRank 1]

17 years ago #

If you look at http://www.google.co.in/cricket you'll realise now why /cricket was excluded in robots.txt!

It means what is says – cricket the game – not GTalk. I love how rumours get started – good to see the buzz about potential new Google products continues :)

Forum home

Advertisement

 
Blog  |  Forum     more >> Archive | Feed | Google's blogs | About
Advertisement

 

This site unofficially covers Google™ and more with some rights reserved. Join our forum!