Getting data into the cluster is just the beginning of the fun. Hive is designed to regularize the process of extracting bits from all of the files in HBase. It offers an SQL-like language that will dive into the files and pull out the snippets your code needs. The data arrives in standard formats, and Hive turns it into a query-able stash.
The image at left shows a snippet of Hive code for creating a table, adding data, and selecting information.
Hive is distributed by the Apache project at http://hive.apache.org/