Hardware Implementation Based On Low-resource Speech Recognition System

Posted on:2022-07-04

Degree:Master

Type:Thesis

Country:China

Candidate:J Lei

Full Text:PDF

GTID:2518306317999419

Subject:Microelectronics and Solid State Electronics

Abstract/Summary:

PDF Full Text Request

As an important branch of artificial intelligence machine learning,language recognition technology has an important position in Internet of Things technology and software development,and in ordinary acoustic models,Upon a low-resource database condition,traditional acoustic GMM-HMM model can't achieve a satisfying recognition rate and has large parameter scale.In order to solve those problems,a speech recognition BN-SGMM-HMM model is proposed in this article.In the acoustic feature aspect,a DNN-based BN(Bottle Neck)feature is extracted which improves the system's discriminability and robustness capability;meanwhile,the Dropout strategy is employed to prevent over-fitting problem during the training process.In the acoustic model aspect,the SGMM(Subspace Gauss Mixture Model)is adopted to decrease the parameter scale.The improvements in these two aspects have also improved the recognition rate of low-resource speech recognition systems.The experiments in this paper prove that the implement of BN-SGMM-HMM low-resource speech recognition system can train the better recognition effect under limited training corpus.While In the hardware implementation part,the open-source Chinese corpus is used for training based on the BN-SGMM-HMM acoustic model,and the trained acoustic model is implemented on the Raspberry Pi,and the microphone is used as the voice input through the Kaldi internal decoder.Recognize the input voice,and finally display the recognition result on the terminal.The innovation of the language recognition system lies in:In terms of software development,the BN-SGMM-HMM acoustic model is used as the basic model and the Kaldi speech recognition toolkit is used to train the model,and has internal feature extraction scripts and language model generation tools,which has changed the need for experienced engineers in speech recognition development in the past.This situation reduces the cycle of speech recognition system developers;in terms of hardware migration,because the hardware implementation uses the open source hardware Raspberry Pi,the user is extensive and the internal environment is open source,compared to other ARM-based development boards and ASICs Reduce the cost of development cycle and tape out.

Keywords/Search Tags:

Speech Recognition, Bottleneck Feature, Subspace Gauss Mixture Model, Dropout, Low-resource

PDF Full Text Request

Related items

1	Applied Gaussian Mixture Model In Speech Emotion Recognition Research
2	Research On Speaker Recognition Algorithm Based On Deep Neural Network
3	Research On Phone Feature Recognition Based On Deep Learning
4	Research On Gauss Mixture Clustering Algorithms In Image Retrieval
5	Research And Speaker Recognition System
6	The Application Of Feature Compensation Method Based On Probability Model In Speech Recognition
7	Research On Feature Extraction And Model Algorithm For Speaker Recognition
8	Research And Implementation Of Gaussian Mixture Model-based Speech Emotion Recognition
9	Research On Speech Emotion Recognition Based On Deep Features Fusion And Joint Decision
10	Research On Attention-Based End-to-End Speech Recognition