Deep Networks With Stochastic Depth For Acoustic Modelling

Posted on:2019-11-19

Degree:Master

Type:Thesis

Country:China

Candidate:D S Chen

Full Text:PDF

GTID:2428330566986907

Subject:Engineering

Abstract/Summary:

PDF Full Text Request

The natural communication of machine and human beings has always been the dream of humankind.In the past 50 years,great progress has been made in the research of speech recognition technology.Especially since 2011,with the improvement of deep learning theory,the drastic increase of computer performance as well as the accumulation of massive voice training data,speech recognition begin to enter commercial age.Compared with traditional networks,the deeper neural network has better ability of model fitting and expression,but it lacks the applications in speech recognition at present.There are two reasons,on the one hand,due to the effect of gradient vanishing phenomenon,the gradient values decay continuously during the process of error back propagation,which make it difficult to update the parameters close to the network input layer.On the other hand,the deeper network has a large number of parameters,which leads to a lot of time in training and testing.This paper concentrates on the research of stochastic depth network algorithm in acoustics modeling.By adopting the residual network and randomly dropping some residual network blocks during the training process,the gradient vanishing is alleviated.And then combined with the model compression algorithm,under the condition of controlling the scale of the model parameters,a deeper speech acoustic model is trained,which ultimately leads to lower word error rates.The experimental results show that compared with the traditional network,deep networks with stochastic depth can improve the recognition rate of the speech system little when the data is limited,but it can significantly improve the recognition rate of the system when the training data is sufficient.At the same time,the application of model compression technology can effectively reduce the parameters and the amount of calculation during operation.The conclusions of this research have important reference to the design of speech acoustic modeling.

Keywords/Search Tags:

Speech recognition, Acoustic model, Stochastic depth network, Deep residual network, Model compression

PDF Full Text Request

Related items

1	A Study On The Extraction Of Speech Depth In Tibetan Language And Its Speech Recognition
2	Research On The Compression Of Neural Networks Based Acoustic Models For Speech Recognition
3	Research On Uyghur Speech Recognition Based On Deep Learning
4	Research And Application Of Speech Recognition Technology Based On Deep Neural Network
5	Research On Mongolian Speech Recognition Acoustic Model Based On Deep Learning
6	Research On Mandarin Speech Recognition Technology Based On Deep Neural Network
7	Research On Acoustic Model Of Speech Recognition In Educational Scene Based On Deep Learning
8	Research On Speech Recognition Based On Convolutional Neural Networks
9	Uyghur Speech Recognition Based On Deep Recurrent Neural Network
10	Deep Neural Networb For Chinese Speech Recognition