|Scientists at KIT are developing silent speech interfaces, which allow humans to communicate with each other by speaking silently – The system could support silent phone conversations or people who lost their voice|
|The computer scientists currently use electrodes which are attached to the skin. In the future, such electrodes might be incorporated into cell phones (Photo: Deutsche Messe Hannover)|
Telephoning silently without disturbing any bystanders? Speaking in a foreign tongue? Giving people a voice who lost it due to illness or accidents? Computer Scientists at the Karlsruhe Institute of Technology (KIT) are working on a groundbreaking technology that opens up a variety of possibilities. “Speech Recognition by Electromyography” is the official labeling for a new approach that allows the recognition of speech from the electrical activity of the facial muscles. The application scenarios are manifold and could change human communication habits in the future.
Professor Tanja Schultz is head of the Cognitive System Laboratory at KIT and responsible for the project: “I came up with the idea a couple of years ago when I was sitting on a train. I grew annoyed by a fellow passenger who was loudly speaking into a cell phone and started thinking of ways to change this. Silent speech interfaces seem to be a great solution for this.” Schultz and her team came up with a prototype that captures the electrical potentials of the articulatory muscles with surface electrodes to recognize spoken speech. This allows to recognize and transmit silently uttered speech. The technology is based on Electromyography, i.e. the capturing and recording of electrical potentials that arise from muscle activities.
Speech is produced by the contraction of muscles that move our articulatory apparatus. The electrical potentials are captured by surface electrodes attached to the skin. The analysis and processing of these signals by suitable pattern matching algorithms allow to reconstruct the corresponding movement of the articulatory muscles and to deduct what has been said. The recognized speech is output as text or synthesized as an acoustic signal. Since electromyography records the muscle activity rather than acoustic signals, speech can be recognized even if it is uttered silently, without any sound production.
The accuracy of the system is very encouraging: “The prototype currently can recognize more than 2000 words vocabulary and give up to 90% accuracy”, states Schultz and believes that in five, maybe ten years, this will be a usable, everyday technology.
The computer scientists at KIT envision several practical applications for Silent Speech Recognition:
1) Silent Telephony: Silent speech recognition allows for silent communication without disturbing any bystanders.
|author:||links:||Cognitive Systems Lab (CSL)|