Study On Robust Feature Extraction Method Of Speech And Audio-based Context Recognition

Posted on:2010-08-16

Degree:Master

Type:Thesis

Country:China

Candidate:W T Gao

Full Text:PDF

GTID:2178360302460669

Subject:Signal and Information Processing

Abstract/Summary:

PDF Full Text Request

Audio signal contains a rich of information, and has the advantages of non-contact, nature and low-cost of capture equipment, which makes audio-based recognition technology has a good prospect in the human-computer interaction and the intelligence of electronic products. In practical applications, speech signals are often disturbed by environments which reduce the speaker recognition accuracy. The robust feature extraction methods can effectively reduce affection of noises and improve the speaker's recognition accuracy. Research on robust feature extraction methods has become a popular area in audio-based recognition filed. Audio-based context recognition technology can make the computer automatically sense environmental characteristics and have auditory intelligence. Now, audio-based context recognition has gained more concern from researchers.Based on previous researchers' efforts and achievements, this thesis is mainly about robust feature extraction methods in speaker recognition, and features selection methods, model optimization method in audio-based context recognition.Cepstrum coefficients extraction method based on the minimum variance distortionless response and perceptual information is studied. This method is an improved algorithm of Mel frequency cepstrum coefficients which can effectively improve the speaker recognition system in noisy conditions.A feature selection algorithm based on local discriminant basis is proposed. This method selects the feature subset using two discriminant criterions and evaluates the feature subset with the recognition accuracy. After gained the best size of feature subset in training scheme, feature subset of the audio samples can be online received in the recognition scheme. Experiments in audio-based context recognition show that the local discriminant basis based feature selection method can effectively improve the recognition accuracy.A discriminative training algorithm that uses support vector machines to improve the classification accuracy of the continuous hidden markov model (HMM) is studied. The algorithm trains HMM models as a baseline system and uses support vector machines training scheme to rescore the parameters of HMM. Experiments show that support vector machine rescoring of hidden markov models typically improve the recognition accuracy compared to the original HMM.

Keywords/Search Tags:

Robust Audio Features, Features Selection, Local Discriminant Base, Hidden Markov Model, Support Vector Machine

PDF Full Text Request

Related items

1	Automatic Audio Stream Classification Based On Hidden Markov Model And Support Vector Machine
2	Online Signature Verification Based On Multi-dimensional Force Information
3	Face Recognition Algorithms Based On Image Reconstruction And Features Fusion
4	Research On Human Action Recognition Algorithm Based On 3D Data
5	L_p Regular Izat Ion In Support Vector Machine For Features Selection
6	Enhancement Of Feature Discriminant Ability With Synergistic Solution For Scene Categorization
7	Research On The Methods Of Machine Recognition Of Faces
8	Research On Image Splicing Detection By Statistical Feature Analysis
9	Study On Content-based Audio Retrieval Technology
10	Research On Stock Selection Model Based On Linear Discriminant Analysis And Weighted Support Vector Machine