Research On Speaker Recognition Based On SVM And Deep Learning

Posted on:2020-01-11

Degree:Master

Type:Thesis

Country:China

Candidate:J A Zhou

Full Text:PDF

GTID:2438330596497510

Subject:Electronic and communication engineering

Abstract/Summary:

With the continuous development of speech recognition technology,speaker recognition technology has received more and more attention as an important method of identity authentication.Traditional speaker recognition technology usually uses MFCC,LPCC,etc.as feature parameters,and the recognition algorithm uses implicit Markov model,vector quantization and Gaussian model,but the speaker recognition technology needs to be further improved in recognition accuracy,identifiable sample size and recognition speed.This thesis mainly studies the following aspects:(1)detailing the model and principle of speaker recognitionThe speech preprocessing stage is studied in detail,and the work of each step in the preprocessing stage is discussed.The specific calculation process of a series of parameters such as MFCC is introduced.Then the mainstream speaker recognition method is studied and four different ways are explored.The speaker recognition model confirms the limitations of mainstream methods.(2)An improved speaker recognition method based on support vector machine and Mel Frequency Cepstrum Coefficient is proposed.In the feature extraction method,the Mel frequency cepstral coefficients are used,and the speech feature parameters are improved.Four improved audio feature parameters are added based on the traditional feature quantities,and then the kernel function types and parameters are analyzed for the SVM model.The experimental results and experimental simulation results show that the improved recognition rate of the speaker recognition system is 21% higher than before.(3)Research on speaker recognition system based on CNN and spectrogramIn this thesis,the speaker’s voice information is input as the characteristic parameter,the original information parameter is retained,the speaker’s voice signal is processed into a two-dimensional spectrum map,and the format is processed as input,and the spectrum map is processed to obtain different sounds.The pattern is connected to the convolutional neural network to construct a speaker recognition system to test system performance with a recognition rate of 91.2%.

Keywords/Search Tags:

Speaker recognition, Neural network, Deep learning, Convolution, Feature extraction

Related items

1	Research On Speaker Identification Based On Deep Learning
2	Research On End-to-end Speaker Recognition Based On Raw Waveform
3	Research On Feature Extraction And Model Algorithm For Speaker Recognition
4	Research On Speaker Recognition Algorithm Based On Deep Neural Network
5	Research Of SAR Feature Extraction And Target Recognition Based On Deep Learning
6	Research On Speaker Recognition Based On Discriminative Feature Learning
7	Research On Speaker Recognition Method Based On Deep Learning
8	Research On Text Detection And Recognition Method Of Natural Scene Based On Deep Learning
9	Fluid Cells Based On Convolution Neural Network Image Visible Part Of Feature Recognition Method Research
10	Study On Speaker Recognition Based On Deep Learning