Google Blogoscoped

Forum

The Problems With the (Great) Wayback Machine  (View post)

Luke [PersonRank 0]

Thursday, May 14, 2009
11 years ago4,334 views

check out Ebay in 1997 web.archive.org/web/1997061400 ...

Son Dang [PersonRank 0]

11 years ago #

With the innumerable sites and myriad of browsers out there, this is the best you're going to get. As far as I know, there isn't another website out there that does this (Google's cache gets updated regularly).

The errors in the middle were probably due to the Archive crawling the site at the wrong time. And it's impossible to get one single view for a website b/c there are so many browsers out there.

Manoj Nahar [PersonRank 4]

11 years ago #

Since Google crawls the websites all the time, I think they are best placed to maintain archives of websites they crawl. They just need to hold on to the cache and not flush it.

Roger Browne [PersonRank 10]

11 years ago #

Longer-term, multiple archives are being set up. Already the Bibliotheca Alexandrina in Egypt runs the Wayback Machine user interface on top of its copy of the Internet Archive. There are plans for similar archives in Europe and elsewhere.

Over time, I expect these archives will diverge and develop their own specializations. For example, Alexandrina has started a project to scan and add their own Arabic books.

"Take me back" at Bibliotheca Alexandrina
bibalex.org/isis/frontend/arch ...

Bibliotheca Alexandrina Press Release (see last paragraph)
archive.org/about/bibalex_p_r. ...

Luka [PersonRank 10]

11 years ago #

What if Google had taken screen captures of sites each time they index the content ?

This could be a great way to monitor change but hard to do due to disk space problem.

This thread is locked as it's old... but you can create a new thread in the forum. 

Forum home

Advertisement

 
Blog  |  Forum     more >> Archive | Feed | Google's blogs | About
Advertisement

 

This site unofficially covers Google™ and more with some rights reserved. Join our forum!