Research On Pronunciation Space Modeling Of Non-native Speakers

Posted on:2013-04-17

Degree:Master

Type:Thesis

Country:China

Candidate:B H Li

Full Text:PDF

GTID:2348330518989650

Subject:Information and Communication Engineering Communication and Information Systems

Abstract/Summary:

PDF Full Text Request

In this thesis,we research on acoustic modeling for the Xinjiang region of non-native speaking people of Mandarin speech recognition technology.The research of the Xinjiang non-native speaker of Mandarin is not only theoretical meaning,but also has great practical significance.Taking the Uighur speak Mandarin,we attempt to research that how to use a few non-native language speaking people the Mandarin expected data build for non-native speaker of Chinese Mandarin identify.In this paper,the following points:1.Select the best model of the main vowel and initials-finals.As the establishment of standard Mandarin acoustic models,we will use two methods based the main vowel and initials-finals to make the model,and compare the priority of the two models with the experimental comparison.We can get that under the same conditions,the performance of initials-finals is significantly better than the main vowel,and when the number of Gaussian mixture reaches 32,the performance of initials-finals modeling can get the ideal level.2.The method of getting the map rules based on initials-finals.On the problem of dramatic decline in rates when the standard model is applied to Xinjiang non-native speaker,pronunciation dictionary adaptive methods is be used to improve the recognition rate.Compared with Non-native speaker new model which is needed a large number of corpus to be established,we just need to a small amount of non-native speaker corpus on the relative recognition rate increased by 10.44%.3.Access to multi-pronunciation dictionary,to be expanded from the syllable and phoneme level,and select three pruning strategies to make the pronunciation variation pruning compression.By a large number of comparative experiments,we can select the most suitable expansion strategy of Xinjiang non-native speaker pronunciation dictionary.Experimental results show that pronunciation dictionary which is established with the relative maximum pruning strategy based on the vowels is better than others methods and can maximize the improvement of recognition rate.The same time,when the number of dictionary scale is 1.4-1.7 times of the standard dictionary,the recognition rate is in a good level.

Keywords/Search Tags:

Pronunciation dictionary, Phoneme confusion matrix, Pruning strategy, Uighur speaker, Non-native speech recognition

PDF Full Text Request

Related items

1	The National Language And Accent Pronunciation Dictionary Adaptive Mandarin Speech Recognition
2	Speaker dynamics as a source of pronunciation variability for continuous speech recognition models
3	Research On The Assessment Of Mandarin Pronunciation Of Tibetan Speakers
4	Research On Speech Phoneme Recognition Based On Deep Learning
5	Research And Implementation Of Speech Intelligibility Evaluation Method Based On Phoneme
6	Research On Speaker Recognition Algorithm Based On Dictionary Learning
7	Speech Recognition's Application In Computer-assisted Language Learning
8	Research And Application Of Pronunciation Detection For Deaf Children Rehabilitation
9	Application Research Of Spectrogram On Pronunciation Recognition Of Chinese Characters And Speaker Recognition
10	Research Of Speech Recognition And Its Application In The Speech Error Identifying System