Acoustic Model Of Speech Recognition Based On Lightweight Neural Network And Its Application In Robot

Posted on:2021-09-04

Degree:Master

Type:Thesis

Country:China

Candidate:J Y Wu

Full Text:PDF

GTID:2518306104480034

Subject:Mechanical engineering

Abstract/Summary:

PDF Full Text Request

Currently,speech recognition has been widely researched and applied,and coupling neural networks to acoustic models can achieve better system performance.However,in practical applications,the computing power and storage capacity that the hardware can provide are limited.Therefore,while ensuring the performance of the speech recognition system,how to adjust to adapt to small storage,hardware equipment with low computing power has become a problem that needs to be studied and solved.And,this paper studies two lightweight neural networks,couples them to acoustic models,and builds and optimizes a speech recognition system for adaptive robot hardware devices.First,the sparse neural network is applied to the speech recognition acoustic model.A step-cut method is proposed.This method determines the size of the neurons in the neural network by specific evaluation indicators,and obtains a lightweight neural network by pruning and retraining.The method was verified on the Wall Street Journal speech data set.With only a slight decrease in system performance,the acoustic model size was reduced from 77 M to 21 M,greatly reducing the storage and computing power required by the speech recognition system.Secondly,the structural growth type lightweight neural network is studied.Contrary to the idea of sparse neural networks,structural growth neural networks control the size and performance of neural networks by gradually increasing the structure of small neural networks.Aiming at the actual application scenarios of speech recognition,based on the theory of Ada Boost algorithm,a new Con Boost structure-increasing lightweight neural network is proposed.The method was verified on the Wall Street Journal's voice data set.The results show that the Con Boost algorithm enables the neural network to reduce the number of connections,but instead gets a 8.2% performance improvement.Then,a speech recognition system suitable for embedded devices on the robot is designed and built.On the basis of the above research,for the hardware equipment provided by the robot,using the proposed Con Boost structural growth lightweight neural network,and based on the Kaldi speech recognition toolkit,the speech recognition system was designed and optimized,including the feature processing module Acoustic model,dictionary and language model combined decoder.The size of the acoustic model is only9.9M,which meets the hardware requirements.Finally,the research work and results of the full text are summarized,and several points worthy of further research are put forward.

Keywords/Search Tags:

Automatic Speech Recognition, Lightweight neural network, Acoustic Model, Robots

PDF Full Text Request

Related items

1	The Study On Acoustic Model Based Neural Netword In Mongolian Speech Recognition System
2	Research On Acoustic Modeling In Low Resource Speech Recognition Based On Transfer Learning
3	Research On Speech Recognition Based On Convolutional Neural Networks
4	Uyghur Speech Recognition Based On Deep Recurrent Neural Network
5	Ensemble acoustic modeling in Automatic Speech Recognition
6	Research And Design Of Lightweight Anti-noise Speech Recognition Algorithm
7	Research On Mandarin Speech Recognition Technology Based On Deep Neural Network
8	Spoken Dialogue System For Service Robots And Research About Language Model
9	Research On Acoustic Modeling Of Speech Recognition Based On Recurrent Neural Network
10	Structured Recurrent Neural Network And Its Applications In Automatic Speech Recognition