Speech Recognition Research Based On Neural Network And Acoustic Characteristics Of Hybrid Structures

Posted on:2018-02-03

Degree:Master

Type:Thesis

Country:China

Candidate:H J Xu

Full Text:PDF

GTID:2428330569475085

Subject:Information and Communication Engineering

Abstract/Summary:

PDF Full Text Request

In recent years,computer technology has a rapid develop,and neural network has also ushered in a new round of upsurge,based on neural network hybrid structure model and acoustic feature extraction have made remarkable achievements.Since the first time in2011,Microsoft Research Institute has used deep neural network(DNN)in the large-scale speech recognition task to obtain significant results to enhance the neural network in the field of speech recognition gradually more and more attention from all walks of life.The purpose of speech recognition technology is to let the machine can understand people say,due to computer technology has the rapid development,people once again on the speech recognition technology hope.Firstly,this paper introduces some generalizations of speech recognition technology,and then discusses the feature extraction of speech recognition.In view of the traditional static characteristic parameters(LPCC,MFCC)and dynamic characteristic parameters(LPCC,MFCC(LPMFCC),the corresponding theory and algorithm are analyzed.At the same time,the progress of the proposed neural network is introduced.Secondly,it discusses the acoustic modeling of neural network and its role in speech recognition.It summarizes the development process of acoustic modeling of neural network,focusing on the details of Restricted Boltzmann Machine(RBM),commonly used learning Algorithms,and evaluation methods are described systematically.The speech recognition experiment based on backward propagation algorithm(Back-Propagation)is studied by using MFCC and(LPMFCC,MFCC)mixed parameters in BP neural network and DBN neural network acoustic model.The results show that the linear prediction plume The characteristics of the cepstrumn coefficient can be better than the Meier cepstrum coefficient,and the BP neural network in the recognition system can be optimized and shortened The training time,improve the recognition performance.RBM model is easy to learn in depth learning algorithm,this model of the algorithm to overcome the direct multi-layer network training efficiency.In this paper,the Deep Belief Nets(DBN)model is built by RBM stack in speech recognition experiment.With the deep belief network,the continuous multi-frame speech feature can be used together,because the multi-layer structure of the human brain can be simulated,so the information feature extraction can be carried out step by step.In the DBN acoustic model,we use the time-adjusted MFCC and(LPMFCC,MFCC)mixed parameters as input data.The experimental process is based on RBM setting rules to optimize the network model,enhance the model learning effect,and with the traditional BP neural network The model is compared and found to achieve a better recognition effect.

Keywords/Search Tags:

LPMFCC, Back Propagation, Restricted Boltzmann Machine, Deep Belief Nets

PDF Full Text Request

Related items

1	Research On Chinese Information Extraction Based On Deep Belief Nets
2	Deep Learning Models And Applications Based On The Restricted Boltzmann Machine
3	The Improvement Of Restricted Boltzmann Machine And Its Application
4	Research Of Restricted Boltzmann Machines Based On Smooth L₀ Norm And Its Application
5	Image Classification Method Based On Abandoned Stacked Restricted Boltzmann Machine
6	Research On Medical Image Classification Method Based On Restricted Boltzmann Machine
7	Retrieval And Recommendation System Of Resources Based On Deep Belief Networks
8	Research On Parallel Sparse Deep Belief Networks
9	Deep Belief Network And Its Application In Handwritten Font Recognition
10	Research On Training Algorithm Of Restricted Boltzmann Machine Under Classification Rate Criterion