The Mongolian Speech Recognition System Based On Depth Of Neural Network

Posted on:2017-04-30

Degree:Master

Type:Thesis

Country:China

Candidate:Y N Guo

Full Text:PDF

GTID:2308330485961098

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

Speech recognition technology, also known as automatic speech recognition (ASR), whose goal is the human voice in the vocabulary content into computer-readable input, such as keys, binary coding or sequence of characters.In recent years, speech recognition technology has been in full swing in a variety of languages in large, access to all areas. For example, the ubiquitous apple siri system. Unfortunately, high-quality voice service corresponding voice services in minority languages is yet to come. Mongolian speech recognition research has important significance for promoting the prosperity and development of scientific and technological progress and the development of the Mongolian language and culture of the Chinese minority voice information processing.At present, there are more and more publicly available open source software about language and speech processing, while most of them just apply to closed vocabulary. But for applications to handle unrestricted speech input vocabulary, even if again big also can not cover all of the vocabulary. The open-source speech recognition tool has been developed by RWTH Aachen University(RWTH ASR, referred to as RASR), can be combined word units to merge into a new word in the vocabulary, so as to identify the foreign words in the recognition process and realize the large vocabulary continuous speech recognition.This paper introduces the theory of speech recognition technology, and the development of large vocabulary speech recognition acoustic model and speech recognition decoder open-source tool developed at the University of Aachen, Germany (RWTH ASR, referred RASR). Complete signal analysis configuration, estimated Gaussian mixture model and voice decision trees, neural networks combined with the depth (Deep Neural Network, DNN) to give an open vocabulary automatic speech recognition (Automatic Speech Recognition, ASR) system. The main work of this paper is to use neural networks to train a large number of voice data, and get the acoustic model. Meanwhile described in detail how to use open source tools developed RASR continuous speech recognition process, focusing on configuration and implementation of training and recognition.

Keywords/Search Tags:

RASR, HMM, Neural network, Speech recognition

PDF Full Text Request

Related items

1	The Mongolian Speech Recognition System Based On Depth Of Neural Network
2	Studying On Chinese Digital Speech Recognition Technology Based On Neural Network
3	Study Of Speech Recognition Algorithm Based On HMM And Neural Network
4	Research On Speech Emotion Recognition Model Based On Deep Neural Network
5	Time Delay Neural Network Based Automatic Speech Recognition
6	Noise Robust Speech Recognition Research Based On Regression Deep Neural Network
7	Neural Network-based Chinese Speech Emotion Recognition
8	Research On Mandarin Speech Recognition Technology Based On Deep Neural Network
9	Study Of Speech Recognition For Mandarin Digit Based On Characteristics Of Hearing And Neural Networks
10	Research Of Algorithm In Identifying The Speech Recognition Based On Neural Network And HMM