Google Blogoscoped

Forum

New set of Google domains. Archive Search ?

Sohil [PersonRank 10]

Saturday, September 2, 2006
13 years ago5,145 views

From ZDnet

"The first thing I thought of when I saw domains like google-archivesearch.com (registered Friday) was the WayBackMachine. If my gut instinct is correct, it won't be long before Google launches a similar service that lets us peer into the history of the web."

blogs.zdnet.com/Google/index.p ...

Digged here
digg.com/tech_news/New_set_of_ ...

David Hetfield [PersonRank 10]

13 years ago #

does other companies provide this archive search thing?(like MS or Yahoo!) no, right?

Haochi [PersonRank 10]

13 years ago #

Perhaps "Archive" is referring to "Cached".

TOMHTML [PersonRank 10]

13 years ago #

or it's just a new feature of the "search history"?

Sohil [PersonRank 10]

13 years ago #

[put at-character here] David, no but Yahoo provides a link to Wayback Machine.

[put at-character here] Haochi, ya and Tom makes sense too.

Multiple Cached Versions of Pages. Is there a use for this ?

Garett Rogers [PersonRank 10]

13 years ago #

I imagine it will be similar to www.archive.org (the way back machine)... It's useful to a point – but mostly it's just fun to see what sites looked like years ago.

photoactive [PersonRank 10]

13 years ago #

Yes, or – this is a long shot – might it have something to do with the hinted-at new service in Google News, with Google paying the Associated Press? When the relationship was announced, uninventive me was scratching my head thinking if there really was a way in which Google News could be radically improved . . . The only thing I could come up with was a system of tracking stories back over time: using the AP stories as a reliable summary of what was actually going on on a given day might be a way of organising the history of an unfolding current event – the Israel/Lebanon war, for instance etc etc.

Sorry, probably way off target. But a news archive would be great.

NateDawg [PersonRank 10]

13 years ago #

[put at-character here]photoactive, you may be on the right track. It would make sense. Garett's got a pretty go breakdown on the theories.

blogs.zdnet.com/Google/?p=314

gary price [PersonRank 10]

13 years ago #

I believe this was first posted here:
blogoscoped.com/forum/66090.ht ...

Before sohil's post.

gary price [PersonRank 10]

13 years ago #

Sohil:

APOLOGIES. I was wrong. Your post was first.
I also have several comments on Resourceshelf.
resourceshelf.com/2006/09/02/y ...

and more here on BetaNews.
betanews.com/article/Finding_A ...

Again, apologies.

gary price [PersonRank 10]

13 years ago #

Several people outside of the U.S. have contacted me asking me about services for people who do not have access to the free library services I mention in my posts.

Libraries in Australia and Canada have many of the same services that are found in U.S. libraries. Remote access. Check locally.

Of course, I can't go nation by nation. Here are a look at a few resources available in the UK.

Elsewhere:
1) FindArticles.com
Thousands free, some fee-based

2) LexisNexis by Credit Card
Over 5000 sources. Free to search. Pay $3 per article.
w3.lexis.com/lne/login_en.asp? ...
If that article doesn't work, go to:
lexisnexis.co.uk/news_and_busi ...
and select credit card.

More here:
w3.lexis.com/LNE/ServiceSelect ... (source list)
and
web.lexis-nexis.com/executive/ ...
and

2) Dialog Select
50,000 sources. pay by credit card. Search free.
dialog.com/products/openaccess ...

3) Financial Times Level 2 UK (about 20 Pounds/month)
Access to over 500 global press sources
registration.ft.com/registrati ...

4) IngentaConnect. Over 20 million articles
Search is free.
ingentaconnect.com/

5) British Library Document Delivery Service
Articles direct from the BL.
bl.uk/services/document/articl ...

Tim Finin [PersonRank 1]

13 years ago #

The idea of a Web archive is a great one, and Brewster Kale richly deserves credit for creating the first. If Google does provide a similar service, maybe they will spur new thinking about how to better support searching over archives and what new services can be built given one. For example, knowing how often a Web page changes can have an impact on how much you trust it's information. Wikipedia's archiving of old versions of articles, for example, has been used for this [1].

I think there's a case for having multiple archives of Web pages, focusing on certain kinds of Web pages. At the beginning of 2005 we started a Semantic Web archive [2] by retaining copies of old versions of RDF documents when we discover and process a new version. We hope that it will provide a good resource that researchers can use to explore topics like how ontologies evolve.

[1] ebiquity.umbc.edu/paper/html/i ...
[2] swoogle.umbc.edu/index.php?opt ...

R. Rajesh Jeba Anbiah [PersonRank 1]

13 years ago #

FWIW, I suggested this feature a long time ago groups.google.com/group/google ...

Corsin Camichel [PersonRank 10]

13 years ago #

Has anybody tought about the amount of data this will cost? And I think Google will do it right, not like archive.org. They have very often import images that are missing.

Philipp Lenssen [PersonRank 10]

13 years ago #

I think next to spidering the HTML it makes sense to save & offer screenshots. Who knows if browsers in 2030 will be able to read HTML, JavaScript and Flash from 1999?

Art-One [PersonRank 10]

13 years ago #

Will we be able to read gif, jpeg or png in 2030?
(Sure they make C64 emulations nowadays too ;-))

Sohil [PersonRank 10]

13 years ago #

[put at-character here] Philip, then wouldn't PDF or ODF be better.

Google already uses PDF in Book Search and is a member of the ODF coalition (or something like that)

elyk [PersonRank 6]

13 years ago #

If I remember correctly, each time Google recrawl's the web, they donate the old cached images to archive.org. If they start their own service, would this mean that archive.org has to start doing their own crawling.

This thread is locked as it's old... but you can create a new thread in the forum. 

Forum home

Advertisement

 
Blog  |  Forum     more >> Archive | Feed | Google's blogs | About
Advertisement

 

This site unofficially covers Google™ and more with some rights reserved. Join our forum!