Research On Voiceprint Recognition Method Based On Deep Learning

Posted on:2020-04-21

Degree:Master

Type:Thesis

Country:China

Candidate:Y Y Dong

Full Text:PDF

GTID:2428330572985657

Subject:Engineering

Abstract/Summary:

PDF Full Text Request

Voiceprint recognition is an important branch in the field of biometric identification.It can verify the identity of an undetermined speaker by signal preprocessing,reasonable feature extraction and an effective recognition model.It is widely applied in criminal investigation,human-computer interaction voiceprint password verification,bank voiceprint identity verification,human rehabilitation index verification and other fields.Based on the technical needs of forensic identity verification,the paper collected actual sound signal,while the signal will inevitably have strong environmental noise and stable white Gaussian equipment noise.Adaptive Wiener filtering is adopted to suppress noise effectively,and eliminate or reduce the influence to the voiceprint recognition later.After analyzed the time series and spatial features,the length of spectrum is determined to be 4 seconds to limit the spectral size as the features of voiceprint recognition.Then use GMM-UBM with opening mandarin date set Surfingtech to recognize the identity.The results show that the accuracy of voiceprint recognition is higher in the condition of determinate Wiener filter and spectrum features.On this basis,in order to reduce the redundant information in spectrum to improve the efficiency of deep learning networks in training and learning.Proposes the word-embedding spectral map dimension reduction method.And combined with the LSTM which with good timing characteristics of signal capture,a voiceprint based on word embedded LSTM using deep learning is proposed.The experimental results show the improve of the accuracy and efficiency in obviously the method above.Since the LSTM only uses the time series features of the voiceprint spectrum,the spatial characteristics are not considered.To improve the accuracy of voiceprint recognition more,making full use of the characteristics of CNN to capture the spatial information features,a idea combining LSTM and CNN is proposed.Through experimental verification,it is concluded that the spatial information of the spectrum is captured by CNN first,and the timing characteristics captured by LSTM is better for the accuracy.The results show that the word embedding dimension reduction spectrum,combined with CNN_LSTM,can get the accuracy to 97.42% in the standard speech data test set,and 95.48% in the date set with noise removed.,which was 4.25% higher than the word embedding dimension reduction LSTM voiceprint recognition method.

Keywords/Search Tags:

Wiener filtering, spectrum, word embedding, deep learning, voiceprint recognition

PDF Full Text Request

Related items

1	Research On Voiceprint Recognition Technology Based On Deep Learning
2	A File Encryption Method Based On Voiceprint Recognition
3	Small Sample Voiceprint Recognition Based On Deep Learning
4	Research And Implementation Of Voiceprint Recognition System Based On Deep Learning
5	Research On Voiceprint Recognition Algorithm Based On Deep Learning
6	Study On Voiceprint Recognition Algorithm Base On Deep Learning
7	Research On The Representation Of Word Embedding Based On Knowledge Fusion
8	Research On Key Technologies Of Voiceprint Recognition Based On Deep Learning
9	Study On The Deep Learning Models And Algorithms Of DGA Domain Name Detection Based On Word Embedding Methods
10	Deep Contextual Word Embedding In Natural Language Processing