Technology Of Tibetan Speech Recognition Based On Fast Walsh Transform

Posted on:2012-02-22

Degree:Master

Type:Thesis

Country:China

Candidate:Q F Liu

Full Text:PDF

GTID:2218330338967407

Subject:Signal and Information Processing

Abstract/Summary:

PDF Full Text Request

The research of Tibetan speech recognition is still in the initial stage due to the various reasons. Because there is a large population use Tibetan and development of the technology can promote academic exchange and connection between Tibet and outside world what makes great sense in promoting national unity and stability. So it plays extremely important role in wide range applications of speech identification systems.Identification speed is one of the most prime targets for the recognition of isolated Tibetan speech. But it will be unable to meet the real-time requirement when the vocabulary increases greatly. The fast Walsh transform was applied to extract the feature parameters instead of MFCC to solve this problem. It shortens the duration of parameters calculation and the systems improved obviously.For the recognition of continuous Tibetan speech, the precision of dividing the continuous speech into units determines the recognition effect. MFCC_FWT and screened twice based on Wavelet Transform segmentation algorithm were applied to process the continuous speech of Tibetan. So the continuous speech was divided into units and it could be identified.The main work and contributions are as follows:1. Analysis the Tibetan pronunciation features and sentences'syntactic features firstly. Introduce the basic principles of Tibetan speech recognition system. In-depth presented the principles of pre-processing and endpoint detection technology.2. Introduced the algorithm of MFCC. Introduce the fast Walsh transform into it to improve the computational efficiency. After the improvement, the computed speed increased greatly. And it can also ensure the effectiveness of characteristic parameters.3. Present the both DTW and HMM recognition algorithms and applied them into the isolated word Tibetan speech recognition system of medium vocabulary. DTW algorithm is rather simple and effective for isolated word recognition of a particular person. HMM algorithm has strong modeling capability, so it can conveniently represent any voice-based element. HMM is suitable for both isolated word and continuous speech recognition systems.4. MFCC FWT and screened twice based on Wavelet Transform were applied to the segmentation algorithm of continuous speech in Tibetan for the first time. Then recognize the continuous speech after it was divided into the isolated units. It greatly simplifies the system's realization difficulities for continuous Tibetan speech.

Keywords/Search Tags:

Tibetan Speech Recognition, MFCC, Fast Walsh Transform, Correlation, Continuous Tibetan Speech Segmentation

PDF Full Text Request

Related items

1	Research On Tibetan Non-specific Continuous Speech Recognition Based On Deep Learning
2	Research And System Realization Of Tibetan Continuous Speech Recognition Technology
3	Research On Tibetan Speech Recognition Based On Speech Spectral Features
4	Research On Tibetan Speech Recognition Based On Bidirectional Recurrent Neural Network
5	Research On Statistical Parametric Mandarin-Tibetan Cross-lingual Speech Synthesis
6	Research On Online Tibetan Speech Recognition System
7	Design Of End-to-end Ando Tibetan Speech Recognition System Based On Deep Learning
8	Research On End-to-end Tibetan Speech Recognition Based On Deep Learnin
9	Research On Tibetan Lhasa Dialect Speech Recognition
10	Research On Audio And Video Speech Recognition In Tibetan Lhasa Dialect