Run-time information fusion in large vocabulary continuous speech recognition

Posted on:2005-05-07

Degree:Ph.D

Type:Thesis

University:OGI School of Science & Engineering

Candidate:Zheng, Chengyi

Full Text:PDF

GTID:2458390008988106

Subject:Computer Science

Abstract/Summary:

Continuous speech recognition systems are environmentally sensitive and suffer from the great variability of speech. In order to achieve recognition robustness, there's a strong interest among researchers on how to fuse different information sources for speech recognition. A common problem of those approaches is that complementary information is lost either before or after recognition.; To avoid this unrecoverable information loss, and to better utilize this complementary information, we proposed a run time information fusion scheme. The hypothesis of this thesis is that by performing fusion at different levels and stages of a Large Vocabulary Continuous Speech Recognition (LVCSR) system, especially inside the decoder, more reliable and efficient fusion is possible.; The hypothesis is first tested in a speech segmentation task, which is essential to the performance of an LVCSR system. Furthermore, three different approaches of run time fusion are proposed and implemented inside an LVCSR decoder. The experiments demonstrate the effectiveness and potential of these approaches.

Keywords/Search Tags:

Speech recognition, Fusion, Information, LVCSR

Related items

1	Noise Robust Speech Recognition Research Based On Regression Deep Neural Network
2	Real-time speaker -independent large vocabulary continuous speech recognition
3	Research On Speech Emotion Recognition Based On Multimodal Information Fusion
4	Research Of Speech Recognition Method Based On Audio-visual Information Fusion
5	Information fusion for robust audio-visual speech recognition
6	Research And Implementation Of Speech Emotion Recognition Algorithm Based On Fusion
7	Research On Feature Fusion Method Of Speech Emotion Recognition Based On Deep Learning
8	Research On Speech Phoneme Recognition Based On Deep Learning
9	Study On Cross-modal Speech Recognition Methods With Fusion Lipreading
10	Discriminative Training For Large Vocabulary Continuous Speech Recognition