Data Augmentation For Chinese Language Models Based On Generative Adversarial Networks

Posted on:2019-12-06

Degree:Master

Type:Thesis

Country:China

Candidate:P F Zhao

Full Text:PDF

GTID:2428330590973914

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Chinese Language Model(CLM)is a mathematical model used to express the probability distribution of Chinese word sequences.It is one of the key technologies of speech recognition system.Its quality directly affects the overall performance of the system.Due to the scarcity of high-quality Chinese data sets and the diversity of Chinese vocabulary combinations in natural language scenarios,data sparsity often occurs in the trained Chinese language models.In order to solve the problem,commonly two methods are generally adopted: one is to expand the text corpus used for training,that is,to enhance the text data,and the other is to improve the smoothing algorithm for the data distribution of the text corpus used for training.However,due to the limitations and shortcomings of algorithms,more research is focused on data augmentation of training corpus to improve the performance of language models.For the problem of data scarcity in Chinese language model,we proposed an improved text retelling model based on GAN to augment Chinese text corpus,which is named xGAN,train a new language model,and employ hierarchical LSTM network and multi-level reward method to enhance the processing capability of long Chinese text sequence,diversify the generation and the capability to distinguish text sequences.Experiment results show that our model can deal with Chinese long text sequence well,and can reward the output of text sequence differentially.It solves the problem of insufficient feedback from discriminators in common classification models.Besides,we employ text retelling to augment the text data,and use the sampled data and data generated by xGAN to train language models.The two language models are interpolated to improve the robustness of the language model for parameter estimation of unknown data.The new language model is used to select one from the candidate results of speech recognition as the final recognition result.The performance of our language model on the open data set THCHS30 and AISHELL is better than the popular data training language models and the speech recognition effect is improved.

Keywords/Search Tags:

deep learning, generative adversarial networks, data enhancement, voice recognition

PDF Full Text Request

Related items

1	Research On Facial Expression Recognition Based On Generative Adversarial Networks
2	Research On Speech Enhancement Model Based On Improved Generative Adversarial Networks
3	Research On Unsupervised Image Enhancement Based On Generative Adversarial Networks
4	Research And Implementation Of Dark Image Denoising And Enhancement Algorithm Based On Generative Adversarial Networks
5	Research And Application Of Image Recognition Method Based On Deep Generative Adversarial Networks
6	Underwater Target Recognition Based On Generative Adversarial Networks
7	Research On Auto-encoders And Generative Adversarial Network Based Speech Enhancement
8	Single Channel Speech Enhancement Based On Generative Adversarial Networks
9	Image Data Enhancement Algorithm Based On Improved Generative Adversarial Networks
10	Malicious URL Detection And Data Enhancement Based On Deep Learning