Research On The Key Issues Of Compressed Speech Sensing

Posted on:2013-01-11

Degree:Doctor

Type:Dissertation

Country:China

Candidate:L H Sun

Full Text:PDF

GTID:1228330395984066

Subject:Signal and Information Processing

Abstract/Summary:

PDF Full Text Request

Speech is the most convenient way for human to communicate with each other. In thedigital era, how to model the speech signal in order to obtain fewer samples without affectingthe quality of the speech, is the research focus in the field of speech signal processing.Compressed sensing (CS) based on the sparsity of the signal is a new theory of linear andnon-adaptive sampling. Compressed sensing shows that the sparser of the signal, the better thecompression performance is. The sparsity or compressibility of the signal is a necessarycondition for achieving good performance of compression reconstruction. As the speech signalis compressible, it can be sampled at the lower rate than the Nyquist sampling rate withoutdistortion, which has brought great convenience to the signal sampling, storage, transmissionand processing. The combination of CS theory and speech signal processing means thesubversion to traditional speech analysis methods based on the Nyquist sampling theory. Withobservations in the CS theory to replace traditional speech sample value is bound to the signalcharacteristics fundamental change, thereby affecting the entire speech signal processing theoryand technology systems of a variety of applications. The CS is used in the field of speech signalto explore new ways of the speech signal processing, which has a good practical significance.Research on the key issues of compressed speech sensing is the foundation to practicalapplication. In this thesis, the main work and innovation are as follows:(1) According to the sparsity of speech signals in the discrete cosine transform basis (DCT),the CS framework of speech signal is described and CS random observation matrix andreconstruction algorithm are selected firstly, and then Via basis pursuit (BP) and orthogonalmatching pursuit (OMP), it is demonstrated that the performance of reconstruction is correlatedwith the dimensions of measurement matrix, the length of frames, the shift of frames and theshape of the window.(2) Using the correlation of speech signals and the sparsity on the residual, we propose anew algorithm of compressed sensing of speech signals based on overcomplete linear prediction(OLP) dictionary from codebook constructed by the linear prediction coefficients of trainingsignals. The proposed method not only improves the performance of reconstructed speechsignals based on Gaussian measurement matrix and basis pursuit, but also does not need to know prediction coefficients of testing signals. We compare OLP with discrete cosine transform(DCT) by applying both methods to compressed speech sensing, and the reconstructed speechsignal is evaluated by the objective and subjective evaluations. The experimental results showthat the performance of compressed speech signal sensing based on OLP is3~8dB higher thanDCT with the same number of measurements.(3) To deal with the problem that the reconstruction performance of compressed sensing ofspeech reconstructed by basis pursuit (BP) is degraded by white Gaussian noise interference, anadaptive basis pursuit de-noising (ABPDN) method is proposed for compressed sensing of noisyspeech. This method adaptively selects the prime regularization parameter according to the SNRof original speech signal and enhancement is achieved as it is applied to compressed sensing ofnoisy speech. And then an adaptive basis pursuit denoising tracked post-denoising method is putforward. Finally, pre-denoised tracked reconstruction method is proposed, which employs anadaptive speech enhancement method based on overcompletely sparse representation in adata-driven dictionary and basis pursuit. The performances of the three methods by the objectiveand subjective evaluations are compared. For compressed speech sensing of the measurementwith noise, an adaptive basis pursuit de-noising method is proposed. The reconstructionperformance of speech signal based on ABPDN is superior to BP.(4) For compressed sensing of the speech signal in the wavelet domain, a matrix form ofSym wavelet decomposition and synthesis is deduced, keeping the length of the coefficient nomore than the length of original speech signals, and then we propose a framework of speechmultiscale compressed sensing (MCS) and an adaptive multiscale compressed sensing (AMCS)method by analyzing sparsity of different wavelet levels of speech signals.We compare AMCSwith MCS by applying both methods to speech compression and reconstruction, and thereconstructed speech signal evaluated by the objective and subjective evaluation is applied tospeaker recognition. The experimental results show that the reconstruction performance ofspeech signal based on AMCS is superior to MCS.

Keywords/Search Tags:

Speech Signals, Compressed Sensing, Overcomplete Dictionary, MultiscaleCompressed Sensing, Basis Pursuit

PDF Full Text Request

Related items

1	Theory And Methods Of Non-convex Compressed Sensing Based On Overcomplete Dictionaries
2	Research On Projection Matrix And De-noising Of Speech Compressed Sensing Based On Overcomplete Dictionary
3	Research On Speech Enhancement Method Based On Compressed Sensing Theory
4	Application Study Of Compressed Sensing In Image Processing
5	The Research On PPG Signals Reconstruction Based On Compressed Sensing
6	Research On Recovery Algorithms Of Compressed Sensing For Multiband Signals
7	Research On Speech Sparse Basis And Projection Matrix Based On Compressed Sensing
8	Structure-Constrained Basis Pursuit for Compressively Sensing Speech
9	Research On The Optimized Projection And Reconstruction Technology Of Speech Signal Based On Sparse Representation
10	Research On The Application Of Compressed Sensing In Super-srsolution Image Reconstruction