Facebook engineers identify Graph Search's big data challenges

"There's still a lot of work we have to do," engineers said

Facebooks engineers have many challenges ahead of them as they work to scale up Graph Search, the sites new social search tool. One stumbling block: an over-abundance of data to sift through.

Take the example of searching for Japanese restaurants in New York City liked by people from Japan. A search that would seem to generate hundreds if not thousands of results only spits back two measly businesses.

The search engine, in its current beta form, simply does not have the processing power to sift through the millions of connections among Japanese people on the site to perform the search, Facebook engineers said Thursday during a small media briefing at the companys headquarters in Menlo Park, California.

Theres still a lot of work we have to do, said software engineer Michael Curtiss. A query like this is very difficult computationally, to start with the 100 million in Japan, and then in a fraction of a second to sort through all the pages liked by people in Japan, he said.

This is virtually intractable in the limited amount of time that we have, said the engineer, who helped to design the sites Unicorn search engine that provides Graph Searchs infrastructure. What we end up having to do is cut out possibly good results.

Facebook is taking a variety of approaches to solve this and other big data problems associated with Graph Search.

One strategy involves a concept in computer databases known as query optimization, to improve the speed and efficiency of certain types of searches.

In the case of the Japanese restaurant search, the technique could be applied to start first with the restaurants that are liked instead of starting with Japan, and then filtering down the likes by people, Facebook engineers said.

The company is also addressing the challenges at the hardware level, by adding additional flash memory and other new features to the servers it uses at data centers, to accommodate the increase in search traffic caused by Graph Search.

We need to do extra work in data centers, buying new hardware platforms, [with] new types of servers being put up to support the computational needs of Unicorn, said Soren Lassen, who led the search infrastructure team behind Graph Search.

Facebook began rolling out Graph Search last month to a limited number of users in the U.S. The search tool is designed to let people comb through the social networks 1 trillion connections among users to search for people, places, photos and interests using phrases in plain English.

In principle, nothing can stop users from typing in a query that is unusually long, such as Employers of friends of my friends who live in New York and who like Downton Abbey, engineers said, since Graph Search uses cues such as Likes and check-ins to more easily rank the results.

Eventually Graph Search will incorporate other metrics such as user comments and status updates to compile and rank results, but thats further down the line, the company said.

Zach Miners covers social networking, search and general technology news for IDG News Service. Follow Zach on Twitter at @zachminers. Zach's e-mail address is zach_miners@idg.com

Join the newsletter!

Error: Please check your email address.
Rocket to Success - Your 10 Tips for Smarter ERP System Selection

Tags Internet-based applications and servicessocial networkingsocial mediainternetsearch enginesFacebook

Keep up with the latest tech news, reviews and previews by subscribing to the Good Gear Guide newsletter.

Zach Miners

IDG News Service
Show Comments

Cool Tech

SanDisk MicroSDXC™ for Nintendo® Switch™

Learn more >

Breitling Superocean Heritage Chronographe 44

Learn more >

Toys for Boys

Family Friendly

Panasonic 4K UHD Blu-Ray Player and Full HD Recorder with Netflix - UBT1GL-K

Learn more >

Stocking Stuffer

Razer DeathAdder Expert Ergonomic Gaming Mouse

Learn more >

Christmas Gift Guide

Click for more ›

Most Popular Reviews

Latest Articles

Resources

PCW Evaluation Team

Walid Mikhael

Brother QL-820NWB Professional Label Printer

It’s easy to set up, it’s compact and quiet when printing and to top if off, the print quality is excellent. This is hands down the best printer I’ve used for printing labels.

Ben Ramsden

Sharp PN-40TC1 Huddle Board

Brainstorming, innovation, problem solving, and negotiation have all become much more productive and valuable if people can easily collaborate in real time with minimal friction.

Sarah Ieroianni

Brother QL-820NWB Professional Label Printer

The print quality also does not disappoint, it’s clear, bold, doesn’t smudge and the text is perfectly sized.

Ratchada Dunn

Sharp PN-40TC1 Huddle Board

The Huddle Board’s built in program; Sharp Touch Viewing software allows us to easily manipulate and edit our documents (jpegs and PDFs) all at the same time on the dashboard.

George Khoury

Sharp PN-40TC1 Huddle Board

The biggest perks for me would be that it comes with easy to use and comprehensive programs that make the collaboration process a whole lot more intuitive and organic

David Coyle

Brother PocketJet PJ-773 A4 Portable Thermal Printer

I rate the printer as a 5 out of 5 stars as it has been able to fit seamlessly into my busy and mobile lifestyle.

Featured Content

Product Launch Showcase

Latest Jobs

Don’t have an account? Sign up here

Don't have an account? Sign up now

Forgot password?