In Pictures: 15 high-impact Apache projects

From Harmony to Hadoop, Apache has been a powerful contributor to the open source ecosystem

In Pictures: 15 high-impact Apache projects prev next



Pig is used to analyze large data sets, featuring parallelization and a high-level language for data analysis algorithms. Developers can use Pig instead of writing Java code when using Hadoop.

“You can think of Pig as an abstraction layer on top of Hadoop,” says Daniel Dai, a committer on the project. Pig is so-named because of its ability to eat everything data-wise, Dai says. “It consumes all kinds of data.”

Users can build their own functions for special-purpose processing. The forthcoming upgrade, Pig 11.0, will feature performance enhancements and operators cube, for calculating multiple dimension aggregates, and rank, for ranking. Pig developers would like Pig to eventually be independent of Hadoop, but right now it is Hadoop-dependent, Dai says.

Prev Next 13/16

Comments on this image

There are currently no comments for this image.

Comments are now closed.


In Pictures: 15 high-impact Apache projects

16 images

Don’t have an account? Sign up here

Don't have an account? Sign up now

Forgot password?