Study On Deep Learning-based Speaker Recognition

Posted on:2015-09-26

Degree:Master

Type:Thesis

Country:China

Candidate:G S Geng

Full Text:PDF

GTID:2298330467985793

Subject:Signal and Information Processing

Abstract/Summary:

PDF Full Text Request

Speaker recognition is called voiceprint identification. It is a kind of authentication technology. Speaker recognition technology has many advantages, including high user acceptance, low equipment costs, strong scalability and easy to transplantation. It is widely used in military field, bank system, internet security and judicial security. Speaker recognition technology is related to our life closely and has great research value and practicality.This thesis mainly studies the speaker recognition system with deep learning model. Some basic system performance testing is completed and discussed, and this paper modified speech feature parameters and statistical method to obtain a higher speaker recognition system rate. What are this paper talking about is as follows:(1)The basic performance of system based on deep learning. The deep learning model is introduced in speaker recognition system. The impact of the different length of speech units on speaker recognition system rate is studied. On the same test condition, the impact of different speech features on speaker recognition system rate is also studied. The impact of different layers and nodes of deep learning model on system recognition rate is studied. The accuracy and reliability of deep learning model applied on speaker recognition system is proved.(2)Based on human auditory characteristics, we apply a new speech feature by combining MFCC with GFCC to speaker recognition system to improve the recognition rate.(3)Considering the traditional system statistics algorithm for multi-speaker recognition leads to misjudgment, we proposed a modified statistics algorithm for multi-speaker recognition system. The effectiveness of modified method is proved by experiments.

Keywords/Search Tags:

Speaker Recognition, Deep Learning, Restricted Boltzmann Machine, Mel-Frequency Cepstral Coefficients, Gammatone Frequency Cepstrum Coefficients

PDF Full Text Request

Related items

1	Study On Deep Learning-Based Speech Quality Assessment
2	The Research Of Speaker Recognition Based On Vector Quantization
3	Speaker Recognition Technology In Noise Environment
4	Discrimination Based On Support Vector Machine Speaker
5	Research And Implementation Of Deep Belief Networks Based Speaker Recognition
6	Hidden Markov Model Based Automatic Speech Recognition Using Mel Frequency Cepstral Coefficients In Nepalese
7	Speaker Recognition Based On Support Vector Machine
8	Study Of Speech Recognition System For Mandarin Digit Based On HMM
9	Anti-noise Power Normalized Cepstral Coefficients For Two-level Robust Environmental Sounds Recognition In Real Noisy Conditions
10	Research On Robust Speaker Recognition Technology Based On GMM-UBM