A framework for low bit-rate speech coding in noisy environment

Posted on:2006-09-12

Degree:Ph.D

Type:Thesis

University:Georgia Institute of Technology

Candidate:Krishnan, Venkatesh

Full Text:PDF

GTID:2458390008469059

Subject:Engineering

Abstract/Summary:

State of the art model based coders offer a perceptually acceptable reconstructed speech quality at bit-rates as low as 2000 bits per second. However, the performance of these coders rapidly deteriorates below this rate, primarily since very few bits are available to encode the model parameters with high fidelity. This thesis aims to meet the challenge of designing speech coders that operate at lower bit-rates while reconstructing the speech at the receiver at the same or even better quality than state of the art low bit-rate speech coders. In one of the contributions, we develop a plethora of techniques for efficient coding of the parameters obtained by the HELP algorithm, under the assumption that the classification of the frames of the MELP coder is available. Also, a simple and elegant procedure called dynamic codebook reordering is presented for use in the encoders and decoders of a vector quantization system that effectively exploits the correlation between vectors of parameters obtained from consecutive speech frames without introducing any delay, distortion or suboptimality. The potential of this technique in significantly reducing the bit-rates of speech coders is illustrated. Additionally, the thesis also attempts to address the issues of designing such very low bit-rate speech coders so that they are robust to environmental noise. To impart robustness, a speech enhancement framework employing Kalman filters is presented. Kalman filters designed for speech enhancement in the presence of noise assume an autoregressive model for the speech signal. We improve the performance of Kalman filters in speech enhancement by constraining the parameters of the autoregressive model to belong to a codebook trained on clean speech. We then extend this formulation to the design of a novel framework, called the multiple input Kalman filter, that optimally combines the outputs from several speech enhancement systems. Since the low bit-rate speech coders compress the parameters significantly, it is very important to protect the transmitted information from errors in the communication channel. In this thesis, a novel channel-optimized multi-stage vector quantization codec is presented, in which the stage codebooks are jointly designed.

Keywords/Search Tags:

Speech, Framework, Model

Related items

1	Research On Speech Recognition System Based On The Improved Hybrid HMM/SVM Framework
2	Research On Continuous Speech Recognition Based On A Hybrid HMM/SVM Framework
3	A computational framework for exploring the role of speech production in speech processing from a communication system perspective
4	Research On Low-bit-rate Wideband Speech Coding Algorithms Based On The Sinusoidal Speech Model
5	Emotional Speech Conversion And Recognition Based On The Three-dimensional PAD Model
6	Research On Algorithms Of Single Channel Speech Watermarking And Speech Enhancement
7	Research On Anti-Noise Of Speech Recognition Based On Continuous Hidden Markov Model
8	A Study On The Extraction Of Speech Depth In Tibetan Language And Its Speech Recognition
9	Design And Implementation Of Intelligent Speech Interaction
10	Design Of Robot-Oriented Speech Interaction System