 
                    Originally published by Wired Middle East on January 18, 2022
MBZUAI student Karima Kadaoui is developing algorithms to help speech-impaired people communicate and navigate their way through society.
 The idea behind the project is to have a person with speech impediments talk and have an application to translate what they said in a way that is understandable by other people.
							
						
							 MBZUAI master's student 
						
Kadaoui, a master’s student at MBZUAI, is drawing on her machine-learning expertise to build an app for the speech-impaired to communicate with others. “The idea behind the project is to have a person with speech impediments talk and have an application to translate what they said in a way that is understandable by other people,” she explains.
It’s a project designed to help people with all sorts of conditions, including strokes and cerebral palsy. People with speech impairments sometimes struggle to control the muscles used to talk, making their speech difficult to understand. The AI that Kadaoui is developing could help improve communication and allow the speech-impaired to participate in society more smoothly. As voice-enabled technologies grow increasingly important in our daily lives, Kadaoui also hopes that such a solution can ultimately be plugged into speech recognition systems like Siri and Google Assistant.
Kadaoui isn’t the first to recognize the gaps in speech recognition. Since 2019, Google has been working on developing algorithms that adapt speech recognition to people suffering from a stroke or other conditions that can lead to speech impairment. Amazon meanwhile integrated the Israeli startup Voiceitt’s app into Alexa last June, building a personalized AI-powered models that can understand specific requests from speech-impaired users.
Still, it’s a problem that’s easier identified than solved. Algorithms are only as good as the data they are trained with, and the phrases they come across most frequently become patterns for learning how to speak. This can be a problem when it comes to refining algorithms for the speech impaired, who can often struggle to speak for long periods of time, according to Kadaoui. So researchers need lots of audio samples (and manual transcriptions of the speech) to make associations between sounds and words.
Google has tackled the problem by prioritizing scripted speech. It sends its volunteers some 1500 phrases to read and record for its database, including a mix of unique units of speech, and those repeated to better train the algorithms. So far, the company says it has gathered some 1400 hours of data from more than a thousand volunteers, allowing researchers to refine their algorithms to understand different types of speech.
Kadaoui and her team are contemplating a similar fix. “We thought of making an app where a person can read a sentence that’s given to them and on their own time, record the sentences,” she says, noting that such an approach will also make submitting the data easier for the speech-impaired volunteers giving the team audio samples. “Eventually, we’ll get this continuously growing data set for speech.”
Baloch offered insights and inspiration to the MBZUAI community during a fireside chat hosted by the Incubation.....
Unlike most pitch events, the IEC's Build It platform is designed for products and prototypes already in.....
MBZUAI students and researchers got under the hood of Careem’s AI strategy during an interactive session with.....