Research On Confidence Measure In Speech Keyword Spotting

Posted on:2016-03-08

Degree:Master

Type:Thesis

Country:China

Candidate:P Wang

Full Text:PDF

GTID:2308330482479211

Subject:Information and Communication Engineering

Abstract/Summary:

PDF Full Text Request

As an important technology of speech recognition, keyword spotting has broad application space and great practical value. Although there are a lot of researches about the keyword spotting technology, errors are still inevitable in the recognition results, due to the influence of adverse factors, such as environmental noise, pronunciation differences of speakers and so on. Confidence measure can be used to evaluate the reliability of the candidate keywords objectively without reference alignment. Therefore it is crucial for the keyword spotting. According to the different properties of hypothesized words, the methods of confidence measure and normalization for In-Vocabulary (IV) and Out-of-Vocabulary (OOV) words are studied respectively in this paper. Research contributions of this thesis are as follows:(1) To solve the problem of insufficient use of contextual information, a confidence measure method for In-Vocabulary words based on optimization of context consistency is presented. Firstly, the reliability of the hypothesized words is evaluated by the context consistency. Then an adaptive sliding window is used to segment the semantic fragments of the speech recognition results to remove consistency interference, after which the context consistency of hypothesized words can be forced to calculate just in its own semantic fragment. Furthermore, the distance information between words is introduced to calculate context consistency to distinguish the influence of diverse position context words. Experimental results show that the proposed optimal measures could improve the accuracy of context consistency greatly.(2) As some In-Vocabulary words may be recognized as Out-of-Vocabulary words in the soft match-based OOV detection, we propose a confidence measure method for OOV based on error types in the speech recognition. The errors of recognition are estimated according to the properties of hypothesized words. Then the recognition error types are turned into features and a distinguish model is trained to detect the IV/OOV positions. If the OOV candidate based on soft match is found in the IV area, it can be treated as a false alarm. Otherwise the OOV candidate is in the OOV area, a fused confidence measure will be employed to calculate the reliability of the hypothesized words. Experimental results show that the confidence measure based on extended errors can improve the performance of OOV words detection greatly.(3) As different hypothesized words have different properties, the same confidence scores may represent different confidence level. A term-dependent confidence normalization method based on ATWV (Actual Term-Weighted Value) optimization is proposed. Firstly, we adjust the wordsâ€™confidence score according to their frequency in the test. Then the confidence bias in ATWV optimization is compensated in a linear and discriminative way respectively. The linear compensation adjusts confidence by adding weighted and translation factors. While the discriminative compensation convert confidence score to classification posterior probability, which meets the requirements of ATWV optimization, by discriminative model training. Experimental results show that the proposed confidence normalization is effective for the keyword spotting.

Keywords/Search Tags:

Confidence Measure, In-Vocabulary Words Detection, Optimized Context Consistency, Out-of-Vocabulary Words Detection, Extended Errors in Speech Recognition, Term-dependent Normalization, ATWV Optimization, Bias Compensation

PDF Full Text Request

Related items

1	Research On Out-of-vocabulary Spoken Term Detection
2	Research On Confidence Measure For Chinese Spoken Term Detection
3	Learning Out-of-Vocabulary Words in Automatic Speech Recognition
4	Research And Design Of Small Vocabulary And Speaker-independent Isolated Words Speech Recognition System
5	Small Vocabulary Isolated Words Speech Recognition Method Research
6	A Study Of An Irrelevant Variability Normalization Based Large Vocabulary Continuous Speech Recognition
7	Research Methods Of Speech Recognition Of Specific Two Words Chinese Vocabulary Based On Spectrogram
8	Syllables and concepts in large vocabulary speech recognition
9	Research On Human Behavior Recognition Based On Kinect And Bag-of-words Model
10	Research On The Translation Of Out Of Vocabulary Words In The Neural Machine Translation For Chinese And English Patent Corpus