Advancements in robust algorithm formulation for speaker identification of whispered speech

Posted on:2013-07-30

Degree:Ph.D

Type:Dissertation

University:The University of Texas at Dallas

Candidate:Fan, Xing

Full Text:PDF

GTID:1458390008483925

Subject:Engineering

Abstract/Summary:

Whispered speech is an alternative speech production mode from neutral speech, which is used by talkers intentionally in natural conversational scenarios to protect privacy and to avoid certain content from being overheard/made public. Due to the profound differences between whispered and neutral speech in production mechanism and the absence of whispered adaptation data, the performance of speaker identification systems trained with neutral speech degrades significantly. This dissertation therefore focuses on developing a robust closed-set speaker recognition system for whispered speech by using no or limited whispered adaptation data from non-target speakers.;This dissertation proposes the concept of "High''/"Low'' performance whispered data for the purpose of speaker identification. A variety of acoustic properties are identified that contribute to the quality of whispered data. An acoustic analysis is also conducted to compare the phoneme/speaker dependency of the differences between whispered and neutral data in the feature domain. The observations from those acoustic analysis are new in this area and also serve as a guidance for developing robust speaker identification systems for whispered speech.;This dissertation further proposes two systems for speaker identification of whispered speech. One system focuses on front-end processing. A two-dimensional feature space is proposed to search for "Low''-quality performance based whispered utterances and separate feature mapping functions are applied to vowels and consonants respectively in order to retain the speaker's information shared between whispered and neutral speech. The other system focuses on speech-mode-independent model training. The proposed method generates pseudo whispered features from neutral features by using the statistical information contained in a whispered Universal Background model (UBM) trained from extra collected whispered data from non-target speakers. Four modeling methods are proposed for the transformation estimation in order to generate the pseudo whispered features. Both of the above two systems demonstrate a significant improvement over the baseline system on the evaluation data.;This dissertation has therefore contributed to providing a scientific understanding of the differences between whispered and neutral speech as well as improved front-end processing and modeling method for speaker identification of whispered speech. Such advancements will ultimately contribute to improve the robustness of speech processing systems.

Keywords/Search Tags:

Whispered, Speech, Speaker identification, Robust, Systems, Acoustic

Related items

1	Research On Whispered Speaker Identification In Channel Mismatch Conditions
2	Speaker Identification Of Whispered Speech Based On Joint Factor Analysis
3	Speaker Identification In Chinese Whispered Speech Based On Simplified Joint Factor Analysis
4	Whispered Speaker Recognition Based On Factor Analysis And SVM
5	Acoustic modeling and speaker normalization strategies with application to robust in-vehicle speech recognition and dialect classification
6	CASA-based robust speaker identification
7	Study Of Identification For Chinese Whispered Speech Based On Probabilistic Neural Network
8	Research On Robust Speaker Identification Under The Inflextion Environment
9	Research On Whispering Speaker Recognition
10	Alternate sensor based speech systems for speaker assessment and robust human communication