Google Blogoscoped


PDF files with authors names in SERPs  (View post)

TOMHTML [PersonRank 10]

Wednesday, August 13, 2008
16 years ago10,423 views

When you are looking for PDF files, Google now displays authors of the PDF file and publication date. Only when it is available.

It may be related to Google Scholar ("cited by...")

Have you ever seen that before?

Above 1 comments were made in the forum before this was blogged,

Philipp Lenssen [PersonRank 10]

16 years ago #

Ionut suspects the data source for this to be Google Scholar:

Christoph [PersonRank 1]

16 years ago #

I can't see the author's name in the examples mentioned above. Seems not to be rolled out to everybody.

Christoph [PersonRank 1]

16 years ago #

I don't see the author's it in the example query results but it in other result pages. It seems to depend on the search, if you can see the author.

For example I can see it here

but not here

even if the first result is the same document.

Scott [PersonRank 0]

16 years ago #

I see it on a .htm result when doing a search for BMI.

The result from the CDC shows "by P Room", which could be Press Room as I couldn't find a P Room anywhere on the page.

Stefan [PersonRank 0]

16 years ago #

I don't like this function. May not be that useful as many documents don't use real authors. I hope that they've got filters for the most popular writers of our century, e.g. 'GoLive' and 'Administrator'. ;-)

Juha-Matti Laurio [PersonRank 10]

16 years ago #

Works here, and absolutely this is a new feature.

When searching with the text is localized too:
"kirjoittanut P Belhouchat – 2004"

It appears that Google can digg the year document was saved too

Colin Colehour [PersonRank 10]

16 years ago #

This feature seems to work with PDF, PS, HTML, Doc ...




This must be a Google Scholar feature that was pushed to searches this month.

Steven [PersonRank 0]

16 years ago #

Not only PDF files.

Philipp Lenssen [PersonRank 10]

16 years ago #

(I added an update, thanks!)

Ionut Alex. Chitu [PersonRank 10]

16 years ago #

This has nothing to do with the PDF/DOC/HTML files. Google uses data from Google Scholar to improve the results for scientific paper.

Some people thought Google uses metadata from the files:

Philipp Lenssen [PersonRank 10]

16 years ago #

Ionut, where does the name "H Another" come from? (I show it in the update.)

Ionut Alex. Chitu [PersonRank 10]

16 years ago #

Well, these links might provide an answer:

"We rely on a document's layout to extract metadata, citations and other information which plays a significant role in relevance ranking."

H Another is most likely the result of a bug. As you can see from this search: Google doesn't find the author correctly all the time. There's an author called "H another error enters Diffey’s" for a text that includes in the middle:

"However another error enters Diffey’s calculations when he assumes that 600 to..."

There's another author called "HOMLT Another" from a text that includes "How One Mistake Leads To Another" as part of the title.

Roger Browne [PersonRank 10]

16 years ago #

I can imagine that Knol's "Real Names" will appear in this field in the future.

Ionut Alex. Chitu [PersonRank 10]

16 years ago #

Another weird result:

(the page includes "Appendix B. Changes from CSS1")

The scholar results are hilarious:

Ionut Alex. Chitu [PersonRank 10]

16 years ago #

More examples of papers:'Citing+Articles+via+Google+Scholar'

Juha-Matti Laurio [PersonRank 10]

16 years ago #

The example search

comparing and gives the author information here, when using link too.

Jonathan Wall [PersonRank 0]

16 years ago #

A similar feature that I just saw-- is Google treating forum results differently? For example, when I search , I get results that say things like "20 posts – 8 authors" at the top.

Juha-Matti Laurio [PersonRank 10]

16 years ago #

I can't see these post & author information with the test URL listed from my country

Forum home


Blog  |  Forum     more >> Archive | Feed | Google's blogs | About


This site unofficially covers Google™ and more with some rights reserved. Join our forum!