Amount of data stored doubles in three years

If you're feeling overwhelmed by information overload lately, you may not be alone. The amount of new information stored on various media such as hard drives has doubled in the past three years, to five exabytes of new information produced in 2002, according to a study released Tuesday by the University of California, Berkeley.

That's exabytes, as in one byte with 18 zeros behind it, six zeros more than a terabyte. The amount of information put into storage in 2002, five exabytes, was equal to the contents of a half a million new libraries, each containing a digitized version of the print collection of the entire U.S. Library of Congress, according to the study by professors Peter Lyman and Hal Varian of the UC Berkeley School of Information Management and Systems. The professors estimated that between two and three exabytes of information was generated in 1999.

Most of that data -- 92 percent of it -- was stored on magnetic media, primarily hard drives, the study estimates.

The study, a follow-up to a 2000 study by UC Berkeley, doesn't dwell on how people and companies process these massive amounts of information coming at them, Lyman said, but his next goal is to produce a study examining that very issue. "I'm going to spend the next year on the consumption of information," he said. "How do people make sense of this? How do they cope?"

The current study doesn't address the quality of information and how people choose good information sources, he added. Significant differences exist in the "accessibility and usability and trustworthiness" of information between various sources, Lyman noted. "We treated it all the same, simply to understand how much there was ... but when you get into consumption, the discrimination over the quality of information, and how you make that decision, really becomes important," he added.

With the amount of stored information growing at a rate of about 30 percent a year, a "real change in our human ecology" is taking place, said Lyman, who presented the study at a conference in Florida Tuesday. "Everything is public," he said. "Everything is on the record."

One problem with all this information being stored is that it's not always accurate, he added. As information passes through multiple hands, it can be condensed or mischaracterized. So commentaries or reports on a speech or a paper Lyman gave 20 years ago sometimes contain distortions, he said.

"There are multiple renditions, only one of which I remember," he added.

The study underscores the need for companies to smartly manage their information, said Gil Press, director of corporation information at EMC Corp., an information storage vendor and a sponsor of the study. But IT solutions aren't the only answer, because humans still need to look at information with a critical eye, he added.

"We are getting swamped, and we need better ways to organize and manage information," Press said. "Hopefully, information technology will never replace smart thinking and the human analytical thinking."

The amount of stored information is not all the information that's being produced. Electronic channels -- including TV, radio, the telephone and the Internet -- produced three and a half times as much information as was stored in 2002. Most of that information was exchanged through voice telephone calls and not recorded or stored, Lyman said. The telephone accounts for the largest percentage of information flow -- 17.3 exabytes if stored in digital form -- followed by e-mail, which generates about 400,000 terabytes of new information each year, the study's authors said.

The researchers estimated that the World Wide Web contains 172 terabytes of information on public pages.

The UC Berkeley researchers used various methods to estimate the amount of information generated and stored, including statistics such as hard drive and paper sales, publication statistics and a sampling of the Web. The research team's methods are described in more detail at

One surprise for Lyman was that while digital storage continues to grow, the use of paper to transmit information is not shrinking. His team estimated that the number of terabytes of information put on paper each year increased by 36 percent from 1999 to 2001, while the amount of data stored magnetically each year increased by 80 percent between 1999 and 2002.

North Americans each consume 11,916 sheets of paper each year, while residents of the European Union consume 7,280 sheets, the team estimated. The majority of that paper information is produced by office documents and mail, not in formally published titles such as books or newspapers.

Join the newsletter!


Sign up to gain exclusive access to email subscriptions, event invitations, competitions, giveaways, and much more.

Membership is free, and your security and privacy remain protected. View our privacy policy before signing up.

Error: Please check your email address.
Keep up with the latest tech news, reviews and previews by subscribing to the Good Gear Guide newsletter.

Grant Gross

IDG News Service
Show Comments

Brand Post

Most Popular Reviews

Latest Articles


PCW Evaluation Team

Luke Hill


I need power and lots of it. As a Front End Web developer anything less just won’t cut it which is why the MSI GT75 is an outstanding laptop for me. It’s a sleek and futuristic looking, high quality, beast that has a touch of sci-fi flare about it.

Emily Tyson

MSI GE63 Raider

If you’re looking to invest in your next work horse laptop for work or home use, you can’t go wrong with the MSI GE63.

Laura Johnston

MSI GS65 Stealth Thin

If you can afford the price tag, it is well worth the money. It out performs any other laptop I have tried for gaming, and the transportable design and incredible display also make it ideal for work.

Andrew Teoh

Brother MFC-L9570CDW Multifunction Printer

Touch screen visibility and operation was great and easy to navigate. Each menu and sub-menu was in an understandable order and category

Louise Coady

Brother MFC-L9570CDW Multifunction Printer

The printer was convenient, produced clear and vibrant images and was very easy to use

Edwina Hargreaves

WD My Cloud Home

I would recommend this device for families and small businesses who want one safe place to store all their important digital content and a way to easily share it with friends, family, business partners, or customers.

Featured Content

Product Launch Showcase

Don’t have an account? Sign up here

Don't have an account? Sign up now

Forgot password?