Speaker Recognition Under Short Utterance Based On Support Vector Machine

Posted on:2013-12-17

Degree:Master

Type:Thesis

Country:China

Candidate:H M Jin

Full Text:PDF

GTID:2248330371485173

Subject:Communication and Information System

Abstract/Summary:

Speaker recognition technology is a biometric method that uses speech signal torecognize different speakers. Due to its special merits in terms of flexibilityï¼Œeconomyï¼Œ accuracy and safetyï¼Œ speaker recognition technology owns abroadapplication future in identity verification. At presentï¼Œ speaker recognition systemperforms well under quiet laboratory environment and with sufficient voice. However,the performance of system will rapidly decrease when there are fewer of the speakerâ€™svoice data in practical applications. Meanwhile, the actual application environmentcan not be predicted, resulting in the test speech and training speech do not match, andmaking the speaker recognition become more difficulty. To solve the above problem,this paper considered the system performance under the short utterance and noiserobustness.The work of this paper focuses on the method of speaker recognition under shortutteranceï¼Œmainly researching on the support vector machine algorithm. Combinedwith GMM, the one-class SVM-GMM model was proposed to increase the systemperformance under short utterance. Multiple kernel mapping was introduced, themultiple kernel SVM-GMM was proposed for speaker recognition with shortutterance. For noise robustness, an extended spectral subtraction based on the cycleWiener filter and a noisy speech endpoint detection methods combination of SAP anddynamic threshold method were proposed for voice enhancement and speech endpointdetection, which can improve the quality of the input speech signal, further improvethe system performance.The main works of this paper includes:An algorithm based on one-class SVM was proposed for speaker recognitionwith short utterance. Then a new method called one-class SVM-GMM combinedone-class SVM with GMM was proposed. This method effectively reduces the timesof training and testing, consequently improve the practicality of the system.Multiple kernel mapping was introduced, the multiple kernel SVM-GMM was proposed for speaker recognition with short utterance.It combined KL function andcommon kernel function to achieve multiple kernel mapping. Used this mapping canmapped speakerâ€™s feature to a characteristic space which combined by multiple featurespace. In the new feature space, the feature of speaker was expressed better, thus theperformance of speaker recognition system was improved.Considering the influence of noise existed in the actual application on theperformance of speaker recognition system, we used an extended spectral subtractionbased on the cycle Wiener filter for voice enhancement in the speech signalpretreatment process. This algorithm can cut noise and improve the signal-to-noiseratio of the speech signal. A noisy speech endpoint detection methods combination ofSAP and dynamic threshold method was used for endpoint detection under noiseenvironment. We combined the method mentioned above with multiple kernelSVM-GMM to further improve the performance of speaker recognition under shortutterance in noise environment.

Keywords/Search Tags:

short utterance, speaker recognition, support vector machines, multiplekernel map, expanded spectral subtraction

Related items

1	Research And Implementation Of Speaker Recognition Method Based On Short Utterance
2	Research On Speaker Recognition Over Short Utterance And Varying Channels
3	Research On Robust Speaker Recognition Over Noisy Short Utterance
4	Study On Speaker Recognition In Short Utterance Condition
5	Phoneme Category Based Short Utterance Speaker Recognition
6	Multiple Kernel Support Vector Machine And Its Application In Face Recognition
7	Study Of Speaker Recognition System Based On Support Vector Machines
8	The Study Of Support Vector Machine And Its Application On Speaker Recognition
9	A Short Speech Speaker Recognition Methods And Applications
10	Research And Development Of Speaker Recognition System