Analysis and compensation of the Lombard effect under different types and levels of noise with application to in-set/out-of-set speaker recognition

Posted on:2008-11-03

Degree:M.S

Type:Thesis

University:University of Colorado at Boulder

Candidate:Varadarajan, Vaishnevi S

Full Text:PDF

GTID:2448390005474825

Subject:Speech communication

Abstract/Summary:

Speech production in noise results in the Lombard effect, which is known to have a serious impact on speech system performance. In this study, Lombard speech produced under different types and levels of noise is analyzed in terms of duration, energy histogram and spectral tilt. Acoustic-phonetic differences are shown to exist between different "flavors" of Lombard speech using a Gaussian Mixture Model (GMM) based Lombard speech type classifier. For the first time, the dependence of Lombard speech on noise type and noise level is established. Also, the impact of the different flavors of Lombard effect on speech system performance is shown with respect to an inset/out-of-set speaker recognition task. System performance is shown to degrade from 7.0% Equal Error Rate (EER) under matched neutral training and testing conditions to an average EER of 26.92% when trained with neutral and tested with Lombard speech. Furthermore, improvement in the performance for in-set/out-of-set speaker recognition is demonstrated by adapting neutral speaker models with Lombard speech data of limited duration. Improved average EER of 4.75% and 12.37% were achieved for matched and mismatched adaptation and testing conditions respectively. At highest noise levels, EER as low of 1.78% was obtained by adapting neutral speaker models with Lombard speech of limited duration.

Keywords/Search Tags:

Lombard, Speech, Noise, Speaker, Levels, EER, Different, Neutral

Related items

1	Research On Speech Intelligibility Enhancement Based On Lombard Effect
2	Research On Near-end Listening Enhancement Algorithm Based On Lombard Speech Conversion
3	Speech Segregation in Background Noise and Competing Speech
4	Research On Emotional Speaker Recognition And Its Solutions
5	Analysis Of Speaker Roles For Multi-speaker Conversational Speech
6	Robust speaker recognition in noise by coherent spectral modification
7	Evaluation of the signal-to-noise ratio required to achieve the same performance in English and Mandarin Chinese
8	Research On Anti-noise Speaker Recognition Based On Non-reconstructed Compressive Sampling
9	Research On Speaker Adaptation In Speech Recognition
10	Robust Speaker Modeling in Non-Neutral Environments with Application to Large Scale Multi-Speaker Audio Stream