Hidden Markov Model Based Automatic Speech Recognition Using Mel Frequency Cepstral Coefficients In Nepalese

Posted on:2007-08-27

Degree:Master

Type:Thesis

Institution:University

Candidate:Tsering Shrestha

Full Text:PDF

GTID:2178360185965489

Subject:Computer Science and Technology

Abstract/Summary:

Nepalese (also called Nepali) is a language of some importance in the northern part of South Asia and is spoken mainly in Nepal, Bhutan and India. The impetus behind this undertaking to implement automatic speech recognition in Nepalese has been the fact that little research has been done in this area compared to the plethora of materials available for other languages like English. Hidden Markov models will be used with MFCC (Mel Frequency Cepstral Coefficients) analysis in the project. HMM, though applicable in many other pattern recognizers as well, has gained a prominent niche in ASR. The system, designed using HTK [1 HTKBook], starts with a preprocessing stage, which converts a speech waveform into feature vectors. The second stage is training the recognizer. Lastly, it will be used to decode new speech data. The building-block components of the system are phoneme-level statistical models. Word-level acoustic models will be formed by concatenating phone-level models according to a pronunciation dictionary. These word models will then be combined with a language model, which constrains the utterances to valid word sequences.

Keywords/Search Tags:

Coefficients

Related items

1	Design of IIR filters with canonical signed-digit (CSD) coefficients using genetic algorithms
2	The Research On Steganography Algorithm Based On ±1 DCT Coefficients For H.264 Video
3	Research On Music Genre Similarity Detection Algorithm
4	The Research Of Speaker Recognition Based On Vector Quantization
5	Software Design And Implementation Of Voiceprint Recognition Module Based On ARM
6	Design Of Broadband Beamformers With Sparse Tap Coefficients
7	Design of one-dimensional and two-dimensional filters with finite-wordlength coefficients using genetic algorithms
8	Clustering analysis of Zernike coefficients from high order aberration patients
9	The Research Of Key Techniques For Function Mining And Time Series Analysis By Gene Expression Programming
10	Speaker Recognition Technology In Noise Environment