Google Blogoscoped

Forum

[Meta] how does blogoscoped work out a post is duplicated

dave [PersonRank 0]

Thursday, February 14, 2008
12 years ago2,305 views

I just tried to post here on google clock icons and it told me a duplicate post already exists – how the hell did it do that – the title wasn't identical by any means. Is this google blogoscoped AI at work?! (;

Philipp Lenssen [PersonRank 10]

12 years ago #

I believe I'm first breaking up the words in the title, then normalize the words (and ignore common words like "the"), and then check all blog and forum posts from the last two months in the database for multiple (over 3) matches among these words. There's a little bug right now with certain embolding of result titles and snippets which I still need to work on...

dave [PersonRank 0]

12 years ago #

well I was quite impressed! Mind you, it did stall for quite a while as it looked it up. I love following your experiments philipp, you're a creative kindred spirit!

This thread is locked as it's old... but you can create a new thread in the forum. 

Forum home

Advertisement

 
Blog  |  Forum     more >> Archive | Feed | Google's blogs | About
Advertisement

 

This site unofficially covers Google™ and more with some rights reserved. Join our forum!