Microsoft expands Azure Data Lake with new big data tools

A new, dynamically scalable analytics service is built on Apache YARN

Microsoft had its sights set squarely on big data when it introduced its Azure Data Lake earlier this year, and on Monday it broadened that effort with new tools designed to make big data processing and analytics simpler and more accessible.

First, what Microsoft originally called Azure Data Lake has now been renamed Azure Data Lake Store, offering a single repository for data of any size and type -- including unstructured, semi-structured and structured -- without requiring application changes as data scales.

Data can be securely shared there and made accessible for processing and analytics. It can be acquired in real-time from sensors and devices for Internet of Things (IoT) applications, for example, or from online shopping websites, all without restrictions on account or file size.

Available in preview later this year, the store is compatible with the Hadoop Distributed File System (HDFS), so Hadoop distributions such as Hortonworks, MapR and Cloudera can readily access the data for processing and analytics, Microsoft said.

Second, Azure Data Lake Analytics adds to the storage portion of Azure Data Lake with a new, dynamically scalable analytics service built on Apache YARN that will also be available in preview later this year.

The new analytics service includes the U-SQL query language, whose scalable and distributed query capability allows users to efficiently analyze data in the Azure Data Lake Store and across SQL Servers in Azure, Azure SQL Database and Azure SQL Data Warehouse, Microsoft said.

Finally, Microsoft's Azure HDInsight is now included in Azure Data Lake as well, offering a fully managed Apache Hadoop cluster service with open-source analytics engines including Hive, Spark, HBase and Storm. As of Monday, managed clusters on Linux are generally available with a service-level agreement (SLA) specifying 99.9 percent uptime.

Also supporting the Azure Data Lake are Azure Data Lake Tools for Visual Studio, which provide an integrated development environment that spans the Azure Data Lake, and leading Hadoop applications from independent software vendors spanning security, governance, data preparation and analytics, Microsoft said.

Pricing details were not immediately available.

Join the PC World newsletter!

Error: Please check your email address.

Tags Microsoft

Our Back to Business guide highlights the best products for you to boost your productivity at home, on the road, at the office, or in the classroom.

Keep up with the latest tech news, reviews and previews by subscribing to the Good Gear Guide newsletter.

Katherine Noyes

IDG News Service
Show Comments

Father’s Day Gift Guide

Most Popular Reviews

Latest News Articles

Resources

GGG Evaluation Team

Kathy Cassidy

STYLISTIC Q702

First impression on unpacking the Q702 test unit was the solid feel and clean, minimalist styling.

Anthony Grifoni

STYLISTIC Q572

For work use, Microsoft Word and Excel programs pre-installed on the device are adequate for preparing short documents.

Steph Mundell

LIFEBOOK UH574

The Fujitsu LifeBook UH574 allowed for great mobility without being obnoxiously heavy or clunky. Its twelve hours of battery life did not disappoint.

Andrew Mitsi

STYLISTIC Q702

The screen was particularly good. It is bright and visible from most angles, however heat is an issue, particularly around the Windows button on the front, and on the back where the battery housing is located.

Simon Harriott

STYLISTIC Q702

My first impression after unboxing the Q702 is that it is a nice looking unit. Styling is somewhat minimalist but very effective. The tablet part, once detached, has a nice weight, and no buttons or switches are located in awkward or intrusive positions.

Latest Jobs

Don’t have an account? Sign up here

Don't have an account? Sign up now

Forgot password?