Research And Application On Simultaneous Recognition Of Both Speech And Speaker

Posted on:2016-08-14

Degree:Master

Type:Thesis

Country:China

Candidate:J Fang

Full Text:PDF

GTID:2298330467961850

Subject:Computer application technology

Abstract/Summary:

With computer technology widely used, more and more people have been paid attentionon speech recognition technology. Speech is one of the most popular human-computermethods. And speech recognition technology is critical to the man-machine voice interaction.For certain environments, we need to some methods which not only can accurately identifyspeech and speaker of voice, but also can be applied in embedded systems, such as in car andintelligent home system. In this paper, we mainly analyze speech recognition as well asspeaker identification applied in intelligent home system. Our research mainly includes:(1) Study on voice activity detection and feature extraction, which used forpreprocessing of voice signal. With propose of speech recognition and speakerâ€™s groupidentification at the same time, we explore several speaker adaptive methods and further studythe mechanism of speech and speaker simultaneous recognition which proposed by Herbig in2011.(2) Based on Bagging and GMM, which integrated ensemble learning and speechrecognition, improves speech recognition rate and stability. In order to reduce spaceconsumption, we use SQ (Soft Quantization) for integrating speech models which makesspeech recognition system more suitable to embedded system with limited resources.Compared with voting mechanism, this method can improve speech recognition rate andstability in the case of a small amount of speech models. With propose of speech recognitionand speakerâ€™s group identification at the same time, we use SQ to integrate speech models andspeakerâ€™s group models so that we can real-timely computing optimal decoder for each frameof voice and vote for model with highest SQ score. Through compare vote of models tocomplete speakerâ€™s group identification, meanwhile, use optimal decoders to complete speechrecognition. When we integrated6speech recognition models, the average of speechrecognition rate reached88%and the average of speakerâ€™s group recognition rate reached81.56%. The experimental results confirmed feasibility of speech and speakerâ€™s group aresimultaneously recognized in certain environments.(3) In the intelligent home environment, we use method of speech and speakerâ€™s groupsimultaneous recognition for realization of speech and speaker simultaneous recognitionsystem. When we integrated5speech recognition models, the speech recognition rate reached96.64%and the speakerâ€™s group recognition rate reached88.24%. The experimental resultsshow that this method is suitable for speech and speaker simultaneous recognition in theintelligent home environment.

Keywords/Search Tags:

speech recognition, speaker identification, ensemble learning, speakerâ€™sgroup recognition, SQ(Soft Quantization), voting mechanism, embedded system, Bagging

Related items

1	Design And Research Of Speaker Recognition System Based On Speech Enhancement
2	Research On The Performance Of Speech Features In Gender-based Speaker Recognition
3	Research And Speaker Recognition System
4	Design And Implementation Of Embedded Speech Recognition System Based On Deep Learning
5	Research On Speaker Recognition System And Its Influence On Stuffy Nose
6	Any Text Speaker Recognition System
7	Research On Speaker Recognition Based On Vector Quantization (VQ)
8	Research And Implementation Of Speaker Recognition Method For Anti-playback Fake Speech
9	Text-independent Speaker Recognition Method And System Based On Spatial Distribution Of Speech Features
10	The Research Of Speaker Recognition