Hidden Markov models for visual speech synthesis in limited data environments

Posted on:2002-05-19

Degree:Ph.D

Type:Thesis

University:Air Force Institute of Technology

Candidate:Arb, Harold Allan

Full Text:PDF

GTID:2468390011992420

Subject:Engineering

Abstract/Summary:

This research presents a new approach for estimating control points used in visual speech synthesis. First, Hidden Markov Models (HMMs) are estimated for each viseme present in stored video data. Second, models are generated for each triseme (a viseme plus the previous and following visemes) in the training set. Next, a decision tree clusters and relates states in the HMMs that are similar in a contextual and statistical sense. The tree also estimates HMMs for trisemes not present in the stored video data. Finally, the HMMs are used to generate sequences of visual speech control points for trisemes not occurring in the stored data. Statistical analysis indicates that the mean squared error between the desired and estimated control point locations is lowest when the process is conducted with certain HMMs trained using short-duration dynamic features, a high log-likelihood threshold, and a low outlier threshold. Also, comparisons of mouth shapes generated from the artificially generated control points and the control points estimated from video not used to train the HMMs indicate that the process estimates accurate control points. The research presented here thus establishes a practical method improving audio-driven visual speech synthesis quality.

Keywords/Search Tags:

Visual speech synthesis, Control points, Hidden markov models, Stored video data

Related items

1	A Study On Speech Synthesis And Visual Speech Synthesis Based On Neural Networks
2	The Study On Key Technologies Of Realistic Chinese Visual Speech Synthesis
3	Research On Method Of Unit Selection Speech Synthesis Based On Hidden Markov Model
4	Cross-lingual Speech Synthesis Based On Statistical Models
5	Research On Speech Synthesis Method Integrating Subjective Evaluation And Feedback
6	Hidden Markov models for alcoholism treatment trial data
7	Research On Statistical Parametric Speech Synthesis Integrating Speech Production Mechanisms
8	Speech Recognition Method Based On Hidden Markov Models
9	Detecting And Processing Visual Information In Speech Synthesis System Driven By Visual-speech
10	Research On Affective Speech Synthesis