Google moves two cloud data analysis services out of beta

Google Cloud Dataflow and Google Pub/Sub are both now available as commercial services

Cloud Dataflow provides a unified computation model for batch and streaming processing

Cloud Dataflow provides a unified computation model for batch and streaming processing

Two Google big data toolsets have finally moved out of beta and into full commercial release, adding to its cloud portfolio a data analysis framework and a service for managing data streams in real-time.

Google Cloud Dataflow, which could serve as a possible replacement for Hadoop, provides a framework for fusing different sources of data within one processing pipeline. Google Cloud Pub/Sub is the company's service for managing data streams in real time.

The two services fill out Google's roster of cloud-based data analysis tools, joining Google BigQuery, a commercial service for analyzing large sets of unstructured data.

These services require less maintenance and operational oversight than in-house data processing systems, Google said in a blog post Wednesday.

Both services were announced at the Google I/O 2014 conference, and have been available as public beta trials for some time.

As full-fledged commercial offerings, these services are now fully integrated into the Google Cloud Platform, Google's collection of tools for orchestrating cloud-based operations.

Customers have been using the Google Cloud Platform for tasks such as financial fraud detection, genomics analysis, inventory management, click-stream analysis, and user interaction testing.

Google Dataflow provides a unified programming model for handling different sources of data, including both batch and streaming data sources, eliminating the need for complex ETL (extract, transform, and load) software.

Dataflow can also serve as a speedier alternative for crunching large amounts of unstructured data, compared to the batch-processing-oriented Hadoop, Google claimed.

Salesforce.com is using Dataflow to augment its Salesforce Wave business intelligence service, while digital marketing firm Qubit uses it to track customer web interactions in real time.

Google Cloud Pub/Sub can serve as a messaging system, providing a way for data analysis systems to work from a stream of fresh data as it is generated. It can handle up to a million messages a second, which it can push to other Google analysis services such as Dataflow.

The beta version of the service has already delivered over a trillion messages to users.

Pub/Sub starts at $0.40 for the first 250 million messages, with the cost going down for greater usage. Cloud Dataflow pricing is based on a per job basis, depending on the time it takes to complete an operation and the amount of data that must be moved around.

Google also announced that it supports Cloudera Hadoop distributions in its cloud. Users can run copies of Cloudera Express and the Cloudera Enterprise Hadoop distributions on Google Cloud Platform.

Joab Jackson covers enterprise software and general technology breaking news for The IDG News Service. Follow Joab on Twitter at @Joab_Jackson. Joab's e-mail address is Joab_Jackson@idg.com

Join the newsletter!

Error: Please check your email address.
Rocket to Success - Your 10 Tips for Smarter ERP System Selection

Tags cloud computinginternetGoogleManaged Services

Keep up with the latest tech news, reviews and previews by subscribing to the Good Gear Guide newsletter.

Joab Jackson

IDG News Service
Show Comments

Cool Tech

Breitling Superocean Heritage Chronographe 44

Learn more >

SanDisk MicroSDXC™ for Nintendo® Switch™

Learn more >

Toys for Boys

Family Friendly

Panasonic 4K UHD Blu-Ray Player and Full HD Recorder with Netflix - UBT1GL-K

Learn more >

Stocking Stuffer

Razer DeathAdder Expert Ergonomic Gaming Mouse

Learn more >

Christmas Gift Guide

Click for more ›

Most Popular Reviews

Latest Articles

Resources

PCW Evaluation Team

Edwina Hargreaves

WD My Cloud Home

I would recommend this device for families and small businesses who want one safe place to store all their important digital content and a way to easily share it with friends, family, business partners, or customers.

Walid Mikhael

Brother QL-820NWB Professional Label Printer

It’s easy to set up, it’s compact and quiet when printing and to top if off, the print quality is excellent. This is hands down the best printer I’ve used for printing labels.

Ben Ramsden

Sharp PN-40TC1 Huddle Board

Brainstorming, innovation, problem solving, and negotiation have all become much more productive and valuable if people can easily collaborate in real time with minimal friction.

Sarah Ieroianni

Brother QL-820NWB Professional Label Printer

The print quality also does not disappoint, it’s clear, bold, doesn’t smudge and the text is perfectly sized.

Ratchada Dunn

Sharp PN-40TC1 Huddle Board

The Huddle Board’s built in program; Sharp Touch Viewing software allows us to easily manipulate and edit our documents (jpegs and PDFs) all at the same time on the dashboard.

George Khoury

Sharp PN-40TC1 Huddle Board

The biggest perks for me would be that it comes with easy to use and comprehensive programs that make the collaboration process a whole lot more intuitive and organic

Featured Content

Product Launch Showcase

Latest Jobs

Don’t have an account? Sign up here

Don't have an account? Sign up now

Forgot password?