Google notes that they only do this form submission for “GET" forms. A form using GET results in a parametrized URL like example.com/show?foo=bar. The guidelines for webmasters are that a GET request should never actually change data on the server, like trigger a user registraton or something; for such things, webmasters should use POST, which the Googlebot will not submit. Google also note that they “omit any forms that have a password input or that use terms commonly associated with personal information such as logins, userids, contacts, etc.” Plus, Google say that pages they find will not reduce the PageRank of other pages on the site.
With this move, Google digs a bit deeper than before which may result in more relevant results for searchers, and a smaller “deep web.” And if webmasters misconfigure their scripts or robots.txt files so their site goes against net standards, it may also result in a bit of new confusion for some. On the other hand, this move by Google also has the potential to help webmasters who have such misconfigurations, especially those who aren’t very knowledgeable about web accessibility or SEO, and who don’t put up crawlable links to all their sub-pages (and in reverse, if Googlebot continues to be smarter about what it crawls, in the long run some web developers may also see less incentive to remove small inaccessibilities on their site).
[Thanks Miss Universe! Sketch drawn by MMOArt.]
>> More posts