Research On Adaption Technique In Continuous Speech Keyword Spotting System

Posted on:2007-02-27

Degree:Master

Type:Thesis

Country:China

Candidate:L Zhu

Full Text:PDF

GTID:2178360185485541

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Automatic speech recognition is used more and more widely in people's life, which is categorized into continuous speech recognition and keyword spotting. Compared with continuous speech recognition, keyword spotting has advantage in increasing the naturalness of the dialogue. It is due to the user's meaning is understood by catching the keywords with important information of his utterance, while there is no need to recognize every word accurately. Keyword spotting is also a good solution for problems of tongue, such as non-standard, incoherence, etc.When there are many differences between the speeches for training and the speeches for testing, the performance of the system is greatly degraded. Adaption technique can reduce the gap between system model and speakers by adjusting the parameters of the system using a few speeches from the speakers, which increases the recognition rates.In this thesis, we focus on the application of speaker adaption technique and speaker normalization technique in keyword spotting system for the following aspects:1. A baseline system of keyword spotting based on Continue Hidden Markov Model (CHMM) is constructed. We discuss the design of baseline system in detail, which includes speech pretreatment, feature extract, acoustic models establishing and training, keyword detection, and keyword verification, etc. Also we evaluate the baseline system and bring forward the necessity of adding adaption module in baseline system.2. Both the speaker adaption technique and speaker normalization technique are investigated, and then the idea of combining the two techniques is brought forward. Experimental results indicate that the trained model is more independent after adding speaker normalization technique in the training, and the adaption based on this model could achieve higher recognition rates. Comparation and validation of the combination between several speaker normalization methods and speaker adaption methods are done. We select the scheme of combining SAT and CMLLR.

Keywords/Search Tags:

Keyword Spotting, Speaker adaption, Speaker normalization, Constrained Maximum Likelihood Linear Regression, Interactive Voice Respond

PDF Full Text Request

Related items

1	Telephone Channel Natural Voice Keywords Detection
2	Speaker Recognition Based On Multi-domain Analysis
3	Research On Way Of Speaking Reliability In Voiceprint Recognition
4	Research Of Small Vocabulary, Speaker-independent Chinese Keyword Spotting Algorithm
5	Research On Adaptive Methods For Text-independent Speaker Recognition
6	Detection Based On Keywords Speaker Adaptation Research
7	Speaker Adaptation Technology And Its Key Words In The Telephone Channel Detection System Applications
8	Research On I-vector Based Speaker Normalization For Speech Recognition
9	Research On Speaker Representation Based On MG Training Criteria
10	Research On Improvement Of Speaker Recognition Algorithms Based On Hand-held Device