Font Size: a A A

Study Of Speech Pitch Period Detection Algorithm And The Application In Speech Synthesis System

Posted on:2009-09-14Degree:MasterType:Thesis
Country:ChinaCandidate:J LiFull Text:PDF
GTID:2178360245965567Subject:Signal and Information Processing
Abstract/Summary:PDF Full Text Request
Pitch period of speech signal is a very important character parameter to describe the excitation source. Detecting the pitch period of speech signal accurately has very important significance for speech analysis and synthesis, speech compression and coding, speech recognition. The paper discusses several common methods for pitch period detection and wavelet transform, Hilbert-Huang transform, this paper proposes the algorithm of AutoCorrection Energy Function (ACEF) combined with Magnitude Difference Energy Function (MDEF) which has good performance in anti-noise, meanwhile applies the Hilbert-Huang transform to pitch synchronous mark of TD-PSOLA speech synthesis system.This paper first introduces some kinds of commonly used speech pitch period detection. For example AutoCorrection Function (ACF), Average Magnitude Difference Function (AMDF), cepstrum etc. ACF is suitable for noise environment, but it is possible to produce the situation that period estimating results is double or half times of the actual results, AMDF and cepstrum can receive good detection results under silence environment or less noisy environment, but the decline of the result is fast under bad environment or low SNR environment and the result is difficult to be satisfacted. Therefore, we proposed a method which has good anti-noise performance--AutoCorrection Energy Function (ACEF) combined with Magnitude Difference Energy Function (MDEF), It improves the anti-noise performance, compensates the shortcomings of traditional pitch period detection method effectively.Next, The paper introduces the wavelet transform theory, including continuous wavelet transform, discrete wavelet transform, multi-resolution analysis, Mallat algorithm, Etc. This paper proposed a method of pitch period detection based on Mallat algorithm—wavelet decomposition and reconstruction algorithm (high frequency set 0) and trous algorithm which is derivated from Mallat algorithm. Mallat algorithm decompose speech signal directly, it needs to drop sampling, the length of each level of decomposition component is half of the length of decomposition component of the last level, but the trous algorithm interpolates to the filter coefficients directly, the length of each level of decomposition component is equal to the length of the original signal, it is conducive to pitch period extraction.This paper introduces Hilbert-Huang transform and applies it in pitch period detection, Comparing with traditional methods, Hilbert-Huang transform doesn't need to do assumption of short-term stationary for speech signal and has highly detection accuracy, widely application scope, The length of frame greatly increases. Comparing with wavelet transform, Hilbert-Huang transform decomposes signal according to signal' own information, changes with signal itself, it reflect the real physical information of the signal and has a better adaptability and superiority.In paper. Hilbert-Huang transform is applied in pitch mark of TD-PSOLA speech synthesis system, it expands the application scope of Hilbert-Huang transform. The experiment shows: The commonly used methods only can achieve an average pitch period of each frame, and then mark the pitch period by interpolation technology, the accuracy is not high. Marking pitch period by Hilbert-Huang transform can detect almost all the pitch peaks, reflect small changes in the frame, it has highly accuracy than ACF.
Keywords/Search Tags:pitch period detection, wavelet transform, Hilbert-Huang transform, speech synthesis
PDF Full Text Request
Related items