IBM prepares Spark for machine learning

IBM has contributed SystemML language to the Spark community and will offer Spark as a Bluemix service

IBM is putting considerable resources behind Apache Software Foundation's Spark to ready the platform for machine learning duties such as pattern recognition and object classification.

The company plans to offer Spark as a service, and has devoted 3,500 researchers and developers to help in its upkeep and further development.

It is also contributing some of its own software to the Apache project, namely SystemML, a programming language for machine learning tasks, and will work with Databricks, the company that has largely shepherded the development of Spark to date. In machine learning, computer systems can refine their performance on given tasks as they acquire new information.

"Spark represents for us a whole new way of working with data," said Joel Horowitz, director of marketing for IBM analytics. "It is a very powerful in-memory compute engine with a very easy-to-use interface for data scientists and developers."

Spark, which many view as a successor to the Hadoop big data processing platform, is well suited for machine learning tasks, which typically require large clusters of computers to execute.

The latest version of the platform released last week extends it to run machine-learning algorithms.

"Machine learning is a very powerful technique of extracting the essence of value from data," Horowitz said. Machine learning algorithms are especially good at tasks such as automated classification and helping devices sense their surroundings with greater sophistication, he said. Such tasks were previously considered to be too compute-intensive to be carried out on a single server. Spark can coordinate multiple computers to work in tandem.

IBM already offers a number of platform services based on machine learning algorithms, such as language translation and data visualization. The Spark service, which will be available by the end of this month, will allow developers to build and run their own machine learning algorithms, Horowitz said.

Spark will be available on the IBM Bluemix, a set of platform services for developers. The Spark service will provide an easy way to load data, examine the data, and pass the results back to another application, all without the work of setting up the supporting infrastructure.

In the past year, the Spark has grown in popularity, as more organizations have incorporated big-data-level analysis into their operations. Companies such as eBay, NASA, Opentable and Yahoo have all used Spark to make sense of large collections of data. About 17 percent of 3,000 Java professionals noted that they were running Spark in their operations, according to a December 2014 survey conducted by Java tool provider TypeSafe.

Joab Jackson covers enterprise software and general technology breaking news for The IDG News Service. Follow Joab on Twitter at @Joab_Jackson. Joab's e-mail address is Joab_Jackson@idg.com

Join the PC World newsletter!

Error: Please check your email address.

Tags applicationsIBMsoftwaredata mining

Our Back to Business guide highlights the best products for you to boost your productivity at home, on the road, at the office, or in the classroom.

Keep up with the latest tech news, reviews and previews by subscribing to the Good Gear Guide newsletter.

Joab Jackson

IDG News Service
Show Comments

Essentials

Lexar® JumpDrive® S57 USB 3.0 flash drive

Learn more >

Microsoft L5V-00027 Sculpt Ergonomic Keyboard Desktop

Learn more >

Mobile

Lexar® JumpDrive® S45 USB 3.0 flash drive 

Learn more >

Exec

Audio-Technica ATH-ANC70 Noise Cancelling Headphones

Learn more >

Lexar® Professional 1800x microSDHC™/microSDXC™ UHS-II cards 

Learn more >

Lexar® JumpDrive® C20c USB Type-C flash drive 

Learn more >

HD Pan/Tilt Wi-Fi Camera with Night Vision NC450

Learn more >

Budget

Back To Business Guide

Click for more ›

Most Popular Reviews

Latest News Articles

Resources

PCW Evaluation Team

Michael Hargreaves

Windows 10 for Business / Dell XPS 13

I’d happily recommend this touchscreen laptop and Windows 10 as a great way to get serious work done at a desk or on the road.

Aysha Strobbe

Windows 10 / HP Spectre x360

Ultimately, I think the Windows 10 environment is excellent for me as it caters for so many different uses. The inclusion of the Xbox app is also great for when you need some downtime too!

Mark Escubio

Windows 10 / Lenovo Yoga 910

For me, the Xbox Play Anywhere is a great new feature as it allows you to play your current Xbox games with higher resolutions and better graphics without forking out extra cash for another copy. Although available titles are still scarce, but I’m sure it will grow in time.

Kathy Cassidy

STYLISTIC Q702

First impression on unpacking the Q702 test unit was the solid feel and clean, minimalist styling.

Anthony Grifoni

STYLISTIC Q572

For work use, Microsoft Word and Excel programs pre-installed on the device are adequate for preparing short documents.

Featured Content

Latest Jobs

Don’t have an account? Sign up here

Don't have an account? Sign up now

Forgot password?