Parametric model based speech enhancement

Posted on:2011-11-28

Degree:Ph.D

Type:Thesis

University:The Pennsylvania State University

Candidate:Xiao, Xiaoqiang

Full Text:PDF

GTID:2448390002959863

Subject:Engineering

Abstract/Summary:

Speech enhancement is a long standing problem in speech signal processing research [1, 2]. The distortion of speech signals with additive noise has hampered the success of numerous speech signal processing applications ranging from hearing aids [3, 4, 5], to speech coding [6] and automatic speech recognition [7, 4]. Up to now, most practical approaches to speech enhancement are waveform based, i.e. these approaches are (in principle) filtering the distorted signal to estimate the clean signal [8, 9]. In this thesis, we study how speech enhancement can be improved by restricting the class of targeted signals specifically to speech signals and employing parametric speech production models. The idea of fusing more speech specific information into the enhancement process is not entirely new [10, 11], but, so far, the performance of such methods has had only limited success.We present a new method for the enhancement of speech. The method is designed for scenarios in which targeted speaker enrollment as well as system training within the typical noise environment are feasible. The proposed procedure is fundamentally different from most conventional and state-of-the-art speech enhancement approaches. Instead of filtering a distorted signal we are re-synthesizing a new "clean" signal based on its likely characteristics. These characteristics are estimated from the distorted signal.To re-synthesize "clean" signal of high quality, a speech production model is needed. Throughout this thesis, we tried two different speech production models including autoregressive speech production model and inventory based speech production model. Successful implementations of the proposed methods are presented. Experiments show that the proposed method compares very favorably to other state-of-the-art systems in both objective and subjective speech quality assessments. Potential applications for the proposed method include vehicular speaker-phone communication systems and jet pilot communication systems.

Keywords/Search Tags:

Speech, Signal, Model, Method, Proposed

Related items

1	Model-driven Time-varying Signal Analysis and its Application to Speech Processing
2	VQ Combined With HMM Model Experimental Study On Speech Signal
3	Research On Emotion Recognition Of Speech Signal Based On HMM
4	The Voice Source in Speech Production: From Models to Applications
5	Signal processing for a visual aid to improve speech intelligibility of the hearing impaired
6	Research On The Chinese Speech Signal Processing By Applying DIVA Model
7	Real-time speech signal analysis-synthesis of the H(2)M model to be implemented using networked computers
8	Research On Signal Processing Of Individual Characteristics Based On Speech Recognition
9	Endpoint Detection Algorithm For Speech Signal In Low SNR Environment
10	AR and ARMA system identification techniques under heavy noisy conditions and their applications to speech analysis