In Pictures: 9 open source Big Data technologies to watch

With more and more companies storing more and more data and hoping to leverage it for actionable insights, Big Data is making a big splash these days. Open source technology is at the core of most Big Data initiatives. Here are nine key open source Big Data technologies to keep an eye on.

In Pictures: 9 open source Big Data technologies to watch prev next

Loading...

Cascading

An open source software abstraction layer for Hadoop, Cascading allows users to create and execute data processing workflows on Hadoop clusters using any JVM-based language. It is intended to hide the underlying complexity of MapReduce jobs. Cascading was designed by Chris Wensel as an alternative API to MapReduce. It is often used for ad targeting, log file analysis, bioinformatics, machine learning, predictive analytics, Web content mining and ETL applications. Commercial support for Cascading is offered by Concurrent, a company founded by Wensel after he developed Cascading. Enterprises that use Cascading include Twitter and Etsy. Cascading is available under the GNU General Public License.

Prev Next 3/9

Close

In Pictures: 9 open source Big Data technologies to watch

9 images
Shopping.com

Don’t have an account? Sign up here

Don't have an account? Sign up now

Forgot password?