Computational creativity and the future of AI

Scientists develop child-like synthetic voice for children who can't speak


February 21, 2012

Norwegian researchers have devised a new way of creating a child-like synthetic voice for ...

Norwegian researchers have devised a new way of creating a child-like synthetic voice for children who are unable to speak (Photo via Shutterstock)

You may think that Stephen Hawking's synthesized voice sounds a little ... unusual, but imagine how much weirder it would be to witness a child using that same adult voice to communicate. For many children who are unable to speak, however, they have no choice but to use assistive devices that utilize just such a voice. Now, help may be on the way. Norwegian researchers have developed a new method of creating synthetic speech, that actually sounds like it is being spoken by a child. Such technology could also allow computers to better recognize words spoken to them by young users.

One of the systems is the result of a collaboration between software company Lingit, and Media LT, a company that develops devices for assisted living.

Initially, the researchers created a master voice, which was made by combining recordings of multiple adult speakers reciting several thousand phrases - enough to create a workable library of words and sounds. Then, they recorded a single child reciting a smaller number of phrases, which were selected to include the sounds that are most essential to the Norwegian language.

When a computer compared the master voice to the child's voice, using the phrases as a point of reference, it was able to alter the master voice to make it sound like that of a child. "The result sounds rather like a child with unusual elocution skills, but it's still much better than the voice of an adult," said Lingit's Dr. Torbjørn Nordgård.

Over at the Norwegian University of Science and Technology, meanwhile, synthetic children's speech is being used to teach computer voice recognition systems to better understand the voices of children.

Presently, most voice recognition systems are tailored toward adult speech, and often have difficulty recognizing words spoken by younger users. In order for these systems to get the hang of children's voices, they would need to be "trained" on recordings of children speaking - recordings that aren't nearly as plentiful as those of adult speech.

In order to remedy this situation, the researchers created a synthetic child's voice of their own. In their case, they analyzed how children's shorter vocal tracts affect the frequency distribution of their speech energy. They then altered the energy distribution of an adult speech program, to achieve a child-like sound. "We could apply our conversion technique to a large database of adult speech and generate a functional database of artificial childlike voices," explained Prof. Torbjørn Svendsen. "We then used this to train a separate speech recognition program for children."

When tested, a prototype version of the program had an error rate 50 to 70 percent lower than traditional "adult-oriented" programs.

Source: The Research Council of Norway

About the Author
Ben Coxworth An experienced freelance writer, videographer and television producer, Ben's interest in all forms of innovation is particularly fanatical when it comes to human-powered transportation, film-making gear, environmentally-friendly technologies and anything that's designed to go underwater. He lives in Edmonton, Alberta, where he spends a lot of time going over the handlebars of his mountain bike, hanging out in off-leash parks, and wishing the Pacific Ocean wasn't so far away.   All articles by Ben Coxworth
Post a Comment

Login with your gizmag account:

Or Login with Facebook:

Related Articles
Looking for something? Search our 31,282 articles
Recent popular articles in Good Thinking
Product Comparisons