Research On Reconstruction Of Chinese Normal Speech From Whispered Speech Based On RBF Neural Network

Posted on:2010-11-13

Degree:Master

Type:Thesis

Country:China

Candidate:T Han

Full Text:PDF

GTID:2178360275459591

Subject:Detection Technology and Automation

Abstract/Summary:

PDF Full Text Request

Whispered speech is different from the normal speech in the pronunciation pattern. The previous researches on whispered speech are mainly about voice basic study or the needs of medical work. As the mobile communication products are used widely at present, whispers are used to avoid influencing others or being overheard. Because the phonation energy of whispers is low and its intelligibility is bad, it's difficult for people to communicate with each other clearly. The problem can be resolved if the mobile communication products have the ability of converting the whispered speech into the normal speech. Besides, this technology can also help aphonic individuals and the patients with pathological voice to communicate normally.The research of this paper is mainly about reconstruction of Chinese normal speech from whispered speech based on RBF neural network. Main work includes:Based on analyzing acoustic characteristics of whispered speech, study about the presented conversion method such as the linear prediction coding (LPC) and the homomorphic signal processing (HSP).But the intelligibility and quality of the converted speech are bad.As the characteristic parameters of the whispered speech are different from the normal speech, this paper this paper establishes the relationship of spectral mapping between whispered and normal speech by using RBF neural network, which will modify the Linear spectral pair(LSP) parameters.Firstly, detect the endpoint and the initial/final of the whispers, pretreat the LSP parameters of the whispered and the normal speech; secondly, generate the nonlinear mapping of spectral envelope from whispered speech to the normal speech by RBF neural network, where the LSP characteristic parameters of the whispered speech are the inputs of the RBF neural network, and the LSP characteristic parameters of normal speech are targets of the output; thirdly, transform the spectral of the whispered speech by using the trained RBF neural network, add the average pitch, and convert the speech; finally, connect the initial consonant of the whispers, export the normal speech.Both subjective and objective ones are conducted on the converted speech quality by the methods. Simulation results show that the score of the mean opinion score (MOS) is 3.5; the distorted distance of bark spectrum is decreased. Both intelligibility and quality of the converted speech are satisfied.At last, this paper raises the shortcomings of this method and the problems that haven't been solved, and gives the direction of further study and improving.

Keywords/Search Tags:

Reconstruction

PDF Full Text Request

Related items

1	The Research Of Audiovisual Reconstruction Under The Convergent News
2	Three Dimensional Vessel Tree Reconstruction By Multi-view Stereo Reconstruction
3	Research On Algorithm Of 3-Dimension Reconstruction Based On Multi-View Image Obtained From Monocular Camera
4	Rapid Non-Cartesian Reconstruction Algorithm For Magnetic Resonance Imaging
5	Scene Reconstruction From Internet Image Collections Base On Structure From Motion
6	Sequence Reconstruction Of Flexible Deformable Objects Based On RGBD Images
7	Studies On 3D Reconstruction From Image Sequences
8	Study On 3D Reconstruction Key Technology Of Unordered Multi-view Images
9	Toward Unsupervised 3D Reconstruction From Unorganized Multi-view Images
10	Research On Iterative Reconstruction Algorithms Of Computed Tomogoraphy