Voice Activity Detection For Speech Recogniction System On Mobile Devices

Posted on:2015-08-18

Degree:Master

Type:Thesis

Country:China

Candidate:N Jiang

Full Text:PDF

GTID:2298330452464045

Subject:Electronic Science and Technology

Abstract/Summary:

PDF Full Text Request

In recent years, speech recognition systems started to be widely used inmobile devices. As a requisite step in speech processing, voice activedetection (VAD) contributes to distinguish the speech interval from thenon-speech interval in the streaming digital signals. Based on the detectionresult, speech system could abandon the non-speech data to benefit bothperformance and energy consume. Especially on embedded device, it requiresa less complexity but still effective VAD solution.Firstly, the pre-processing step is provided to prepare for featureextraction, containing framing, pre-emphasis and speech enhancement. Thena low-power solution is proposed. The feature is selected from classictime-domain and newest popular features to represent speech distinction. Thedecision strategy is optimized based on threshold comparison. Experimentalresult shows it works well for high SNR (≥10dB) environment.Then, to deal with low SNR cases, an algorithm combined theMFCC-GMM model classifier and time-domain feature in innovation ways isproposed. It brings robustness performance in low signal noise ratio popularnoise environment. And in annoying babble noise situation, the proposedalgorithm has lower false-alarming rate than the other VAD solutions.At the end, a sample application, named “wake on voice” in androiddevice is implemented to testify the practicability and adaptability of theproposed solutions, while the APIs are packaged and reusable in any mobilespeech system...

Keywords/Search Tags:

mobile application, speech signals processing, voice activedetection, GMM

PDF Full Text Request

Related items

1	Prosthetic voice controlled by muscle electromyographic signals
2	Distributed Speech Recognition And Voice XML Standardlanguage In Vivid-Ring Application
3	Distributed Speech Recognition And Voice XML Standardlanguage In Vivid-Ring Application
4	Distributed Speech Recognition And Voice Xml Standardlanguage In Vivid-ring Application
5	Research And Implementation Of Real-time Speech Processing Technology For Mobile Internet
6	Research On Embedded Speech Synthesis Technology
7	Mobile Voice Over Ip Communication System Audio Codecs
8	The Research Of The Speech Encoding & Decoding In The Digital Processing Of Speech Singnals
9	Design And Realization Of One-Shot Vehicular Voice-User Interface System
10	Modular Implementation Of The Mobile End Of The Cochlear Speech Processing Function