What will it take to make AI sound human?

'It's a matter of being personalized,' says CMU professor Alan Black

Pepper the robot appears on stage with a Softbank executive at CES in Las Vegas on Jan. 7, 2016 Credit: James Niccolai

Pepper the robot appears on stage with a Softbank executive at CES in Las Vegas on Jan. 7, 2016 Credit: James Niccolai

Conversation fillers such as "hmm" and "uh-huh" may seem like insignificant parts of human conversation, but they're critical to improving communication between humans and artificial intelligence.

So argues Alan Black, a professor in the Language Technologies Institute at the Carnegie Mellon School of Computer Science, who specializes in speech synthesis and ways to make artificially intelligent speech sound more real.

Both Siri and Cortana incorporate aspects of Black's work, he says. But for the most part, such technologies still boil down to a pretty simple pattern: The human speaks, then the machine processes that speech and answers.

"It's not really how humans interact," Black said in an interview on Friday. "It's a stilted kind of interaction."

Key to making such conversations more natural are pauses, fillers, laughs and the ability of speakers to anticipate and complete each other's sentences -- all of which help build rapport and trust.

"Laughing is part of communication," he said. "Machines don't do that -- if they did, it would be unbelievably creepy -- but ultimately they should."

Black and his students are working on those areas.

"You need mm-hmm, back channels, hesitations and fillers, and so far our speech synthesizers can't do that," Black said. "If a system does say 'uh-huh,' it sounds like a robot."

Technologies using synthetic voices typically use speech recorded by humans "in a little room reading sentences," he explained. That, in turn, is "why they sound bored."

Working with students, Black is experimenting with using voices recorded in dialog, so that even if you just capture and use one side, it's clear the speakers are engaged. The idea is to model and incorporate the variance in human responses rather than using the same response all the time -- otherwise, humans can tell it's fake, Black said.

Ultimately, good AI will also know your views on certain topics, such as which candidate you support or oppose in a political race, so it won't say something offensive.

"On a higher level, it's a matter of being personalized," Black said. "That can be creepy, but it can also be appropriate, and it's important for trust. It's all about building this thing that's close to what humans expect and makes it easier to have this conversation."

Looking ahead, another big issue is how to get people to learn to do new things with their devices. There's basic interaction happening now with technologies like Siri and Cortana, but the next challenge is to get users to turn to AI first for answers, Black said.

Some users have been embarrassed talking to their phones but more comfortable talking to Amazon Echo because all they have to do is speak out loud in their homes. "People are treating it differently," he said. "It's there in the room with you."

Join the PC World newsletter!

Error: Please check your email address.

Our Back to Business guide highlights the best products for you to boost your productivity at home, on the road, at the office, or in the classroom.

Keep up with the latest tech news, reviews and previews by subscribing to the Good Gear Guide newsletter.

Katherine Noyes

IDG News Service
Show Comments

Essentials

Lexar® JumpDrive® S57 USB 3.0 flash drive

Learn more >

Microsoft L5V-00027 Sculpt Ergonomic Keyboard Desktop

Learn more >

Mobile

Lexar® JumpDrive® S45 USB 3.0 flash drive 

Learn more >

Exec

Lexar® Professional 1800x microSDHC™/microSDXC™ UHS-II cards 

Learn more >

HD Pan/Tilt Wi-Fi Camera with Night Vision NC450

Learn more >

Lexar® JumpDrive® C20c USB Type-C flash drive 

Learn more >

Audio-Technica ATH-ANC70 Noise Cancelling Headphones

Learn more >

Budget

Back To Business Guide

Click for more ›

Most Popular Reviews

Latest News Articles

Resources

PCW Evaluation Team

Azadeh Williams

HP OfficeJet Pro 8730

A smarter way to print for busy small business owners, combining speedy printing with scanning and copying, making it easier to produce high quality documents and images at a touch of a button.

Andrew Grant

HP OfficeJet Pro 8730

I've had a multifunction printer in the office going on 10 years now. It was a neat bit of kit back in the day -- print, copy, scan, fax -- when printing over WiFi felt a bit like magic. It’s seen better days though and an upgrade’s well overdue. This HP OfficeJet Pro 8730 looks like it ticks all the same boxes: print, copy, scan, and fax. (Really? Does anyone fax anything any more? I guess it's good to know the facility’s there, just in case.) Printing over WiFi is more-or- less standard these days.

Ed Dawson

HP OfficeJet Pro 8730

As a freelance writer who is always on the go, I like my technology to be both efficient and effective so I can do my job well. The HP OfficeJet Pro 8730 Inkjet Printer ticks all the boxes in terms of form factor, performance and user interface.

Michael Hargreaves

Windows 10 for Business / Dell XPS 13

I’d happily recommend this touchscreen laptop and Windows 10 as a great way to get serious work done at a desk or on the road.

Aysha Strobbe

Windows 10 / HP Spectre x360

Ultimately, I think the Windows 10 environment is excellent for me as it caters for so many different uses. The inclusion of the Xbox app is also great for when you need some downtime too!

Mark Escubio

Windows 10 / Lenovo Yoga 910

For me, the Xbox Play Anywhere is a great new feature as it allows you to play your current Xbox games with higher resolutions and better graphics without forking out extra cash for another copy. Although available titles are still scarce, but I’m sure it will grow in time.

Featured Content

Latest Jobs

Don’t have an account? Sign up here

Don't have an account? Sign up now

Forgot password?