Research On Speaker Recognition Algorithm Based On Cepstrum Feature

Posted on:2018-10-17

Degree:Master

Type:Thesis

Country:China

Candidate:M N Zhao

Full Text:PDF

GTID:2348330518476584

Subject:Information and Communication Engineering

Abstract/Summary:

PDF Full Text Request

Language is the most basic way of human information exchange,and with the development of information technology,human-computer interaction has become a new and urgent demand in science and technology promotion.Speakers identification technology with fast,effective and low cost advantages,has become widely accepted one of the important biometric authentic technology.The technology is widely used in the information authentication,judicial investigation,e-commerce,document security and other aspects.Speaker recognition,also known as voiceprint recognition,is one of the important areas of speech signal processing technology.Although the speaker recognition technology has achieved some results in theoretical research,the actual environment with high noise and high distortion can cause the recognition performance of the speaker recognition system to drop sharply.Speech feature parameter extraction is the most critical part of the speaker recognition algorithm.Therefore,the current research hotspot is how to extract the recognition performance and distinguish the superior performance characteristics in the high noise environment.Based on the existing speaker recognition algorithm,this paper makes an in-depth study on the feature extraction and weighting,filter design and other aspects of signal feature extraction,and puts forward their own solutions.1.In view of the high variance and latency of the traditional Mel spectrum,the Mel filter group simulates the poor performance of the human ear,the poor resistance of the MFCC,and the static characteristics of the Mel spectrum characteristic parameters.Multi-window estimation and Gammatone filter group,the speaker 's personality feature information is preserved to the maximum extent while reducing the variance and noise reduction of the spectrum,so as to improve the recognition performance of the speaker recognition algorithm.2.For the traditional Mel filter group,the distribution of low frequency and high frequency sparse distribution is not consistent with the spectrum distribution of the effective information of the normal sound,and the number of fixed filters is not suitable for changing the speech signal.The design is based on the Fisher principle and the Gammatone filter.Adapting to the improved filter bank,and propose a speech recognition algorithm based on the adaptive improved filter bank.

Keywords/Search Tags:

gammatone filter banks, weighted function, multi-window estimation, F principles

PDF Full Text Request

Related items

1	Auditory Filter Banks In Speech Recognition System
2	The Research Of Front-end Filter For Speaker Independent Robust Speech Recognition
3	Design Of Uniform And Nonuniform Filter Banks
4	Development Of Tactile Auditory Substitution System Based On Gammatone Filter
5	Research On Channelization Techniques For Software Defined Radio
6	The Design Of Sub-band Filter Banks Based On Cosine-Modulation
7	Study On The Design And Applications Of Cosine Modulated Filer Banks
8	Design Algorithms Of Two-dimensional NPR Cosine Modulated Filter Banks
9	Research On The Design And Applications Of Filter Banks
10	Study On Monopulse Angle Estimation For Wideband Linear Frequency Modulation Radars