Font Size: a A A

Research On Low Bit Rate Speech Coding Algorithm

Posted on:2012-10-11Degree:DoctorType:Dissertation
Country:ChinaCandidate:Z JiFull Text:PDF
GTID:1118330362967962Subject:Information and Communication Engineering
Abstract/Summary:PDF Full Text Request
The low bit rate speech coding algorithm is widely used in modern communicationsystem, and the ultra low bit rate speech compression coding is one of the mostsignificant research topics in speech signal processing area at present. Sinusoidalexcitation linear prediction (SELP) algorithm uses linear-prediction based sinusoidalmixed excitation technique, and has very outstanding performance among the speechcompression coding algorithms at the bit rate of2.4kbps or less. The research purposeof this dissertation is to analyze and research the essential techniques in speech coding,and design the150bps ultra low bit rate speech compression coding algorithm based onSELP model.The high-efficiency quantization methods of characteristic parameters areresearched first. In the scalar quantization of line spectral frequency (LSF), the globaloptimal difference quantization of LSF based on dynamic programming is proposed. Ituses multi-codebook to further improve the parameter's quantization performance, andcan attain the transparent quantization of LSF at the rate of28bits/frame. In the vectorquantization of pitch parameter, the perceptual weighting distortion measure whichutilizes the auditory characteristics of human ears is proposed to improve thequantization performance of pitch, and the integer changed optimization technique isdeveloped to further reduce the search error rate of the optimal codeword for pitchparameter.In the ultra low bit rate speech coding, the bits assigned to each frame is severelyinadequate to quantize the characteristic parameters. In order to solve this problem, therecovery algorithm of characteristic parameters in the decoder is proposed based on thecorrelation between different parameters. First the energy is recovered based on thehidden Markov model (HMM). It utilizes the LSF and the sub-band unvoiced andvoiced (U/V) parameters to estimate the change of energy parameters. Then the U/Vrecovery algorithm is proposed based on the Gaussian mixed model (GMM), whichutilizes the LSF and the normalized energy to estimate the probability distribution ofU/V parameter, so as to save the bits assigned to quantizing it.From the consideration of the decoding end, the interpolation algorithm for the characteristic parameters is developed to improve the naturalness of synthesized speechin the transition period from unvoiced speech to voiced speech. In order to improvevocoder's resistance to packet loss, mode-based linear prediction packet lossconcealment algorithm is propose, which can improve the synthesized speech qualityunder the existence of consecutive packet loss.Finally, integrating the research achievements mentioned above, the150bps SELPspeech coding algorithm is designed and realized. The vocoder's mean opinion score(MOS) is2.424, the accurate rate of the diagnostic rhyme test (DRT) is82.9%, thecodebook size is120Kword, and the algorithm delay is325ms. To sum up, the entireperformance index of the150bps SELP vocoder exceeds the requirement of the nationalEleventh Five-Year major project.
Keywords/Search Tags:speech coding, ultra low bit rate, quantization of characteristicparameter, parameter recovery in the decoder
PDF Full Text Request
Related items