A Study Of Modeling Of Inter-Frame Correlationin Segment-Based Models

Posted on:2004-08-17

Degree:Master

Type:Thesis

Country:China

Candidate:D F Sun

Full Text:PDF

GTID:2168360092999359

Subject:Basic mathematics

Abstract/Summary:

PDF Full Text Request

Acoustic modeling is very important for improving the performance of speech recognition system. HMM is widely used in speech recognition system presently. But one of the assumptions of HMM is state-conditioned stationarity of the observation vectors, implying that each state is a stationary source generating independent identically distribute (i.i.d.) observation vectors. This assumption of stationary is not valid as time sequence of the observation vectors is highly correlated. Thus, modeling of time correlation of a sequence of observation has become the most challenging topics of current research for acoustic modeling. In this thesis, we discuss the modeling of inter-frame correlation for speech recognition in the framework of stochastic segment model. Including: (1) We describe the motivation of stochastic segment model and develop linear dynamical system model for speech recognition. Stochastic segment model can better capture the dynamics structure over the segment of speech. Thus, we look at speech on a segmental level rather than on a frame-by-frame basis. Moreover, many experiments show linear models are adequate in modeling of inter-frame correlation. So we think that linear segment-based models can better characterize speech signal. (2) The EM algorithm for linear system identification is implemented and an initialization of the EM algorithm is proposed by using statistical analysis. EM algorithm has become one of the methods of choice for ML estimation. But it often converges slowly. If model parameter is suitably initialized, the speed of convergence is quicker. The initialization method of the algorithm proposed can quicken speed of convergence, and the algorithm is numerically very stable.(3) We proposed two ideas: one is the smoothing of trajectory and the other is context modeling. Smoothing of trajectory imply that linear dynamical system is stochastic trajectory modeling. By context modeling, we can provide an exact mathematical model for each syllable. The experiment results show that both smoothing of trajectory and context modeling can improve the performance of the system...

Keywords/Search Tags:

segment-based model, inter-frame correlation, linear model, speech recognition

PDF Full Text Request

Related items

1	Speech Recognition Algorithm
2	Research And Implementation Of Chinese Speech Keyword Recognition Algorithm
3	Application And Research Of PAD Emotion Model In Speech Emotion Recognition
4	Study Of Mandarin Digit Speech Recognition Algorithm Based On HMM Model
5	Research On Screen Content Video Coding Based On Inter Frame Correlation
6	Trajectory Model And The Vq Model-based Speaker Recognition Study
7	Research On QSHVC Inter-Frame Fast Algorithm Based On Mode And Depth Probability
8	Video Compressed Sensing Algorithm Based On The Inter-frame Correlation
9	Research On Key Technologies Of Digital Image Correlation Methods In Special Application Scenarios
10	Research On Anti-Noise Of Speech Recognition Based On Continuous Hidden Markov Model