The Research About The Acoustic Model

Posted on:2010-04-14

Degree:Master

Type:Thesis

Country:China

Candidate:G X Wang

Full Text:PDF

GTID:2178360278965704

Subject:Pattern Recognition and Intelligent Systems

Abstract/Summary:

In the information age, we have to face a large number of audio and video, and a problem that how to class the similar information and find the useful part. This is also the trend of continuous speech recognition.Broadcasting Speech contains the following features: complex background environment, speaker independent and massive amount of data. We need the system using a little data to build a base line, and then selecting some unlabeled but most informative samples to annotate them, and adding the newly transcribed samples to the training set to update the acoustic model. In this way, we can greatly reduce the number of samples transcribed. In this paper, we analyze the features of broadcasting speech, select some rules for building the broadcasting speech data base and the transcribe system. At the same time, we design an active learning algorithm and build an active learning system, then comparing the random selection and K-L distance for the initial sample selection, as well as balancing random selection other training samples, the maximum likelihood (MLE) and the posterior probability. We find out using K-L distance and the posterior probability based on confusion network select the sample can greatly reduce the sample transcribed and improve system efficiency. In addition, this article also has a comparing about vowel sound element model and phoneme element model for continuous speech recognition performance. The results shows that vowels is more suitable for Chinese acoustic modeling.

Keywords/Search Tags:

Broadcasting Speech, Broadcasting Speech Database, Transcribe System, Active Learning, Sample Selection, Posterior, Probability, Element Model

Related items

1	Research And Implementation Of Emergency Broadcasting System And Speech Enhancement
2	Speech Communication Under The Vision Of Broadcasting And Hosting Art Professional Talent Training Mode Reform
3	The Research Of Sensitive Information Detection And Retrieval Algorithm Over Encrypted Speech For Chinese
4	A Study Of Active Learning For Acoustic Modeling In Speech Recognition
5	The Study Of Real-time Speech Enhancement Algorithm For Broadcasting Station
6	The Improvement And Application Of G.723.1 In Digital AM Broadcasting System
7	Sichuan Television Broadcasting System Research And Design
8	Study On Key Technologies Of Active Learning In Division Classification Model
9	Digital Construction Of Broadcasting System For Anhui Vocational College Of Grain Engineering
10	A Study Of Key Technologies To Freely Spoken Mandarin Speech Evaluation