I noticed that the home page of my website received an unusually high number of requests on 8 March 2007. On inspection of my logs, it seemed that Googlebot had apparently requested my home page a whopping 15,475 times in just 1 hour 24 minutes (between 06:52 and 08:16).
However... the requesting IP address was 75.126.156.50 – which doesn't seem to be one owned by Google: http://www.dnsstuff.com/tools/whois.ch?ip=75.126.156.50
Did anyone else notice this in their logs on that day? Is Googlebot spoofing popular? This is the first time I've seen anything noticeable in my logs, but I guess many other bots could easily spoof as Googlebot when they're crawling sites or "borrowing" content.
(By comparison, the "real" Googlebot had requested the same page 6 times that day.)
Edit: The same IP address also requested the same page on the previous day (7 March 2007) 8,878 times – again spoofing as Googlebot. |
Could you publish source IP and a thin slice of your log ? |
Btw.. whast the "Bot Obedience" rules for spidering ?? |
Funky .. could be feedfetcher playing up...!! |
You can let same IP and Same header success your one page for not more than 10 time one day. But they can build a transparent router that can fake the IP and Header at multi-names.
|
You can easily fake the user-agent, maybe a HTTrack-like software which fake is user-agent.
But you can fake IPs of googlebot too ;-) (Tony, watch your logs in some minutes ;-]) |
I see your requests TOMHTML, although they're not using a Google-owned IP address either.
(I know you can easily fake the user-agent – which is what I meant by "spoofing" – as I often do this myself when testing things.) |
It wasn't an IP address of Google??? :-S Contact me by mail, I'm sure there is something wrong.. |