Research On Speech Automatic Retrieval Technology For Broadcast News

Posted on:2014-03-14

Degree:Master

Type:Thesis

Country:China

Candidate:Z T Zhang

Full Text:PDF

GTID:2298330422490408

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Speech retrieval means for the users’ queries, searching from the speechdatabase and rerurning the docunment which satisfy the query request. Astechnology advances and increasingly development of Internet, the multimediainformation which people have accessed has grown exponentially, in whichbroadcast news speech occupies a large proportion, and easy to get, hasimportant scientific value. Speech retrieval technology can help people quicklyfind what they need from the massive data, so the technology has tremendoussignificance and practical value.Speech retrieval technology is actually the effective integration of speechrecognition technology and information retrieval technology, however, combinedsimply with the two techniques will lead retrieval performance is heavilydependent on the accuracy of recognition system, so the researchers will focus onmulti-candidate formal recognition results. The common recognition form ofmultiple candidates contains N-best Lattice and confusion network, wherein theN-best not contain all candidate results, and Lattice structure containing moreredundant and not conducive to the establish the index, so the confusion networkis often to be used. In addition, on the choice of the recognition units, if weselect words as the basic unit, the OOV probem will not be avoided. So the studyof Chinese speech retrieval technique often use in lattice structure based onsyllable.In this thesis, the automatic speech recognition system use thecontext-dependent techniques to train acoustic model, analyzes the impact ofmodel parameter sharing strategy during training and the number of Gaussianmixture, and combine language model to establish a large vocabulary continuousspeech recognition system. Then use the vector space model on the One Bestrecognition results to complete the baseline speech retrieval system. Later, in theform of a grid containing multiple candidate results, generating confusionnetwork which has more compact structure and through improved vector spacemodel, build a speech retrieval system. In addition, in the selection ofrecognition units, if the word is selected, it can not solve the problem “Out OfVocabulary”, and the system’s recall rate is low, while on syllables unit, it willreduce the accuracy of retrieval system, so this thesis presents a multi-unitshybrid retrieval system based on the backward integration.Experimental results show that the use of context acoustic modelingtechniques can adapt to the changes of pronunciation, improve the accuracy and robustness of the recognition system. The improved vector space model canbetter reflect the feature vectors in the proportion of the document, it canimprove the retrieval precision of retrieval system; rather more the use ofmulti-units hybrid retrieval system can not only solve the problem of OOVfundamentally, but also makes the system accuracy and recall rat e reached a goodbalance.

Keywords/Search Tags:

speech retrieval, speech recognition, vector space model, confusion network

PDF Full Text Request

Related items

1	Research On Mandarin Speech Retrieval Technique Based On Confusion Network
2	Research On Recognition Error Processing In Speech Retrieval
3	Research Of Problems In Spoken Term Detection
4	Design And Implementation Of Korean Speech Retrieval System
5	Research On Confusion Network And Side Information For Speech Recognition
6	Research And Applications Of Speech Keyword Recognition Technology
7	The Research On Speaker-Independent Greeting Speech Recognition Based On The HDSVCT
8	Emotional Speech Conversion And Recognition Based On The Three-dimensional PAD Model
9	Research On New Method Of Rubust Speech Recognition
10	Research And Implementation Of Speech Emotion Recognition Based On Feature Selection And Confusion