Chinese Sign Language Synthesis Driven By Speech And Text

Posted on:2014-11-15

Degree:Master

Type:Thesis

Country:China

Candidate:S Wang

Full Text:PDF

GTID:2268330392973593

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

To help deaf people integrate into normal community better and improve theirweak position, a growing number of researchers are involved in the research of signlanguage. In recent years, a method by synthesizing sign language animation has beenproposed to facilitate the deaf understand and receive information through watchinganimations. To get more realistic and intelligibility sign language animation, prosodicinformation in sign language is necessary. As the same important role in theexpression of speech, prosody information can improve the expression ability of signlanguage and can provide additional function to help understand the content which thegesture maker wants to convey. When synthesizing of sign language animation,semantic text is not the only information which provides the necessary information forthe expression of sign language, but also it’s necessary to provide the prosodyinformation used for expressing the desired information in sign language. This articlestarts research from prosodic information in voice, and considers using the richprosodic information in the phonetic representation, and then maps the prosodyinformation to sign language prosodic parameters, at last sign language animationwith prosodic information can be got.In this paper, in order to extract the prosodic information in voice, basic acousticparameters such as syllable duration, sound intensity and pitch are chosen based onthe characteristics of Chinese prosody. Syllable duration is got by a method usingendpoint detection based on the existence of vowel in Chinese, then each syllableboundary can be divided; sound intensity is the short-time energy of every syllable;for the calculation of the pitch, it is estimated by using the cepstrum parameters. Inthis paper, based on these basic acoustic parameters, a feature vector is constructedout for identifying the emphasis pattern of prosody, and hence the hidden Markovmodels (HMM) can be trained. In order to improve the robustness of the models, aseries of relative values are used for forming the vector, it’s due to the process of thehuman ear perceiving stress. Prosody information can be got through the HMMs, andafter mapping the phonetic prosody to sign language prosody parameters it can bedirectly used to synthesize Chinese sign language animation. The mapping of prosodyparameters can be achieved by Chinese Sign Language Markup Language (CSLML)used for describing signs and prosody. Finally, the paper implements a system at theset-top box platform to achieve an accompanying television programs while playingChinese sign language animation that can effectively help deaf people watch TVshows and receive social information.

Keywords/Search Tags:

Chinese sign language, animation, speech, prosody

PDF Full Text Request

Related items

1	Research On 3D Visible Speech Animation Driven By Prosody Text
2	Synthesis Of Sign Language Animation Based On Uighur Text
3	Synthesis Of Chinese Sign Language Coarticulation Based On Key Frame Animation
4	Chinese Sign Language Synthesis Based On Multi-Clues
5	Research On Sign Language Recognition In Sign Language To Speech Conversion
6	Speech Based Gesture Generated System Study And Develop
7	Vietnam Chinese Language Conversion Technology Research
8	Prosody Extraction And Description Of Chinese Mandarin Continuous Speech
9	Perception of prosody in American Sign Language
10	Research On Virtual Human Sign Language Translation Technology Driven By Chinese Voice