Skip to main content

The Internet Archive is now home to 10 petabytes of data

The Internet Archive is now home to 10 petabytes of data

Share this story

GOOGLE Connie Zhou data center server
GOOGLE Connie Zhou data center server

The Internet Archive is now storing 10,000,000,000,000,000 bytes of "cultural material." That equates to ten petabytes, or ten million gigabytes of information. The archive is tasked with preserving the internet for posterity, and hosts over a million public videos and almost 1.5 million audio files. It's also home to the Wayback Machine, which lets you see what sites were like back in 1997 (or what theverge.com pointed in 2003).

While ten petabytes may pale in comparison to the data held by internet giants like Facebook (which stores more than 100 petabytes of photo and video alone), it's still a significant milestone for the archive. Recent months have seen the non-profit expand its cataloguing efforts to more traditional media, and embrace BitTorrent links for faster downloads. It also announced that it's offering up 80 terabytes of data from a crawl that covered 2.7 billion pages to worthy research teams.