In Pictures: 9 open source Big Data technologies to watch

With more and more companies storing more and more data and hoping to leverage it for actionable insights, Big Data is making a big splash these days. Open source technology is at the core of most Big Data initiatives. Here are nine key open source Big Data technologies to keep an eye on.

Apache Hadoop

Apache Hadoop is an open source software framework for data-intensive distributed applications originally created by Doug Cutting to support his work on Nutch, an open source Web search engine. To meet Nutch's multimachine processing requirements, Cutting implemented a MapReduce facility and a distributed file system that together became Hadoop. He named it after his son's toy elephant. Through MapReduce, Hadoop distributes Big Data in pieces over a series of nodes running on commodity hardware. Hadoop is now among the most popular technologies for storing the structured, semi-structured and unstructured data that comprise Big Data. Hadoop is available under the Apache License 2.0.

1 of 9
VIEW ALL THUMBNAILS

Best Deals on PCWorld

Networking, Wireless & VoIPView all »
NotebooksView all »
TabletsView all »
Mobile PhonesView all »
Printers & ScannersView all »

Compare & Save

Deals powered by WhistleOut
Use WhistleOut's technology to compare:
Mobile phone plans & deals
Mobile phone models
Mobile phone carriers
Broadband plans & deals
Broadband providers
Deals powered by WhistleOut
WhistleOut

rhs_login_lockSign up to PC World Today for the latest news, reviews and galleries from PC World Australia.