Research On Text-driven Visual Speech Synthesis Technology

Posted on:2011-09-10

Degree:Master

Type:Thesis

Country:China

Candidate:B Liu

Full Text:PDF

GTID:2178330332960643

Subject:Computer software and theory

Abstract/Summary:

PDF Full Text Request

With the rapid development of computer graphics and multimedia technology in the past few years, visual speech computer animation of highly realistic has become a research hotspot in computer science field, which is widely used in deaf education, electronic affairs, movie stunt, human-computer interface, medical surgery and other aspects.This thesis focus on the study of text-driven visual speech synthesis technology, with the purpose of analyzing the characteristic of the inputting text information, extracting out of voice, facial expressions and timing control information.. A real sense of naturalness speech synchronized facial animation will be simulated by making use of the improved 3D human face model .Firstly, deep analysis of the existing methods of 3D face modeling has improved the specialized facial model to reduce grade points and the number of editable mesh, thereby decreasing the computational complexity and the alleviating the system pressure .Secondly, by emphatically analyzing physiological features of human faces, summarize the speech process facial muscles, and propose muscle abstraction methods. This method is to simulate the key parts of the face through grid model, in order to conquer the defects of 3D human face model deformation and mesh vertices beyond our control weakness in the existing methods. Thirdly, according to the characteristics of the inputting text information research , put forward a method of embedding expression tags to provide expressions and time information for subsequent synthesis of facial animation. In addition, according to mandarin pronunciation rules for estimation of each word in the flow of language long pronunciation, as synchronous control condition of visual speech animation. Moreover, analyze the degree of mutual influence of vowel and consonant changes in mouth shape in a continuous speech flow, then grade them. And to improve the Chinese pronunciation model, supported by inter-frame integration of transition treatment, synthesize speech synchronized facial animation. Realize the text-driven visual speech synthesis system.In the last place, through the related experiments on the face/embouchure frame fusion transition handle, realize of the visual speech text drivers face animation. Experiments have proved that this model, with high practical value, can simulate the process of face expression changing truly and naturally.

Keywords/Search Tags:

3D facial modeling, Text analysis, Co-articulation, Facial animation

PDF Full Text Request

Related items

1	An Automatic MPEG-4 Based Realistic 3D Facial Animation Method
2	Study Of Face Modeling And Animation Based On MPEG-4
3	Research And Implementation Of The Realistic Facial Animation
4	Research On Data-driven 3D Facial Animation
5	Mpeg-4 Compatible Facial Speech Animation System And Its Applications In Network Communications
6	Realistic 3D Facial Synthesis
7	The Game Engine, 3d Face Modeling And Facial Animation And Realization
8	Realistic 3d Facial Expression Animation Design And Realization
9	Nonrigid motion modeling and analysis in video sequences for realistic facial animation
10	Cartoon Facial Animation Based On Video-driven Study