Skip to main content
ad info

 
CNN.com technology > computing
  Editions | myCNN | Video | Audio | Headline News Brief | Feedback  

 

  Search
 
 

 
TECHNOLOGY
TOP STORIES

Consumer group: Online privacy protections fall short

Guide to a wired Super Bowl

Debate opens on making e-commerce law consistent

(MORE)

TOP STORIES

More than 11,000 killed in India quake

Mideast negotiators want to continue talks after Israeli elections

(MORE)

MARKETS
4:30pm ET, 4/16
144.70
8257.60
3.71
1394.72
10.90
879.91
 


WORLD

U.S.

POLITICS

LAW

ENTERTAINMENT

HEALTH

TRAVEL

FOOD

ARTS & STYLE



(MORE HEADLINES)
*
 
CNN Websites
Networks image


Company aims to preserve Web history

 larger 
Bytes of digital history: early version of CNN.com's nav bar (above); story page from August 1995 (below)  

July 7, 2000
Web posted at: 2:02 p.m. EDT (1802 GMT)

SAN FRANCISCO (CNN) -- The Internet provides a unique glimpse into the lives of ordinary people, much like newspapers of old, but little is being done to preserve Web pages for future historians. One non-profit company is trying to change that.

"We have a shadow of the world that we're able to capture and make available to the future," said Brewster Kahle, founder of the Internet Archive.

The Internet Archive is a massive collection of Web sites donated by the Alexa Internet, an arm of Amazon.com. It preserves those Web pages that would otherwise be wiped from computer memories and lost forever.

"The way we're able to pull this off is by having robots that go around and contact every Web server around the world periodically, and download each page -- each image -- off of every one of those sites," Kahle said.

The amount of information stored already surpasses in volume the entire contents of the Library of Congress.

Begun in 1996, the collection included only text until this year, when the Internet Archive began collecting images at a rate of about 200 images every five seconds.

But many Web pages already are lost.

"I don't known if John McCain this year saved some of the Web pages that documented the fact that he raised a tremendous amount of his campaign funds on the Internet," said David Allison, chairman of the Division of IT and Society at the Smithsonian Institution. "Maybe that moment in history is gone."

Some organizations activate a filter so companies like the Internet Archive can't archive their content.

For now, the Internet Archive stores part of its collection on the company's premises, but Kahle hopes the archives will be available online in a few years.

Why save the entire Internet, when some would argue that most of it is junk?

Referring to newspapers of the past, Kahle said, "If we had been selective, we probably would have kept all the articles and thrown away those ads, but it's the ads that the historians really like. That's what gives them a much better glimpse of what life was like."

CNN Science Correspondent Ann Kellan contributed to this report.



RELATED STORIES:
New Web site aims to help families
July 6, 2000
What's on Excite tonight?
July 3, 2000
Wanted: A Web for all of us
July 3, 2000
New domains at last
June 27, 2000
Voters can now register online
June 23, 2000

RELATED SITES:
The Internet Archive


Note: Pages will open in a new browser window
External sites are not endorsed by CNN Interactive.
 Search   

Back to the top  © 2001 Cable News Network. All Rights Reserved.
Terms under which this service is provided to you.
Read our privacy guidelines.