In Pictures: 9 open source Big Data technologies to watch

With more and more companies storing more and more data and hoping to leverage it for actionable insights, Big Data is making a big splash these days. Open source technology is at the core of most Big Data initiatives. Here are nine key open source Big Data technologies to keep an eye on.

Cascading

An open source software abstraction layer for Hadoop, Cascading allows users to create and execute data processing workflows on Hadoop clusters using any JVM-based language. It is intended to hide the underlying complexity of MapReduce jobs. Cascading was designed by Chris Wensel as an alternative API to MapReduce. It is often used for ad targeting, log file analysis, bioinformatics, machine learning, predictive analytics, Web content mining and ETL applications. Commercial support for Cascading is offered by Concurrent, a company founded by Wensel after he developed Cascading. Enterprises that use Cascading include Twitter and Etsy. Cascading is available under the GNU General Public License.

3 of 9
VIEW ALL THUMBNAILS

Best Deals on PCWorld

TabletsView all »
NotebooksView all »
Mobile PhonesView all »
Printers & ScannersView all »
Networking, Wireless & VoIPView all »

Compare & Save

Deals powered by WhistleOut
Use WhistleOut's technology to compare:
Mobile phone plans & deals
Mobile phone models
Mobile phone carriers
Broadband plans & deals
Broadband providers
Deals powered by WhistleOut
WhistleOut

rhs_login_lockSign up to PC World Today for the latest news, reviews and galleries from PC World Australia.