Convolutional Neural Networks for Speaker-Independent Speech Recognition

Posted on:2012-04-22

Degree:M.E

Type:Thesis

University:The Cooper Union for the Advancement of Science and Art

Candidate:Belilovsky, Eugene

Full Text:PDF

GTID:2468390011963114

Subject:Engineering

Abstract/Summary:

In this work we analyze a neural network structure capable of achieving a degree of invariance to speaker vocal tracts for speech recognition applications. It will be shown that invariance to a speaker's pitch can be built into the classification stage of the speech recognition process using convolutional neural networks, whereas in the past attempts have been made to achieve invariance on the feature set used in the classification stage. We conduct experiments for the segment-level phoneme classification task using convolutional neural networks and compare them to neural network structures previously used in speech recognition, primarily the time-delayed neural network and the standard multilayer perceptron. The results show that convolutional neural networks can in many cases achieve superior performance than the classical structures.

Keywords/Search Tags:

Convolutional neural networks, Speech recognition

Related items

1	Research On In-car Speech Recognition Based On One-dimensional Convolutional Neural Networks
2	Research On End-to-end Speech Recognition Based On Convolutional Neural Networks
3	Research On Speech Recognition Based On Convolutional Neural Networks
4	Convolutional Neural Networks for Speaker-Independent Speech Recognition
5	Research Of Speech Emotion Recognition Method Based On Convolutional Recurrent Neural Networks
6	Chinese Speech Recognition Based On Deep Convolution Neural Networks
7	Research On Speech Bandwidth Extension Methods Using Neural Networks
8	Research On Multi-dimensional Speech Recognition Technology Based On Multi-task Neural Network
9	The Research Of Speech Emotion Recognition Based On CNNs
10	Research On The Silent Speech Recognition Algorithm Based On SEMG Signal