Notation Of Speaking Face Based On Video And Text Infomation

Posted on:2011-10-23

Degree:Master

Type:Thesis

Country:China

Candidate:G Z Liu

Full Text:PDF

GTID:2178330338489571

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

The Speaking face detection based on video information, which means through lip-moving to judge who is speaking without audio information. The correlation technique is: the shot division of the video, the face detection and tracking, the lip location as well as the decision of lip-moving. Regarding name labeling, simultaneously needs the text information, and considered the characters of subtitles and the script, need to fuse the text information.About he fusion of the subtitles and script, the paper introduces a dynamic time warping algorithm, the algorithm uses the idea of fusion and is characterized by the word, and achieved good results. For the human face detection and tracking, taking into account the accuracy of detection, efficiency, and code reuse, this paper uses AdaBoost algorithm and MeanShift algorithm in OpenCV vision library, this combination of methods through the verification of the experiment, and achieved good results. And in this paper, we use this method in face sequence extraction.The detection of mouth has always been the content of lip-reading research field, this article will introduce it to the lips extraction in speaker detection process. Consider the methods proposed in the literature, we use lip color to extract the lip region, and do some improve in this paper, then extract more accuracy lip area. After tested and achieved good results.In previous studies, the use of the speaker's lip detection method, less complex than the lip-reading field, mostly just compute the difference between two image on the mouth region, and set a threshold to determine whether the lip is moving. This paper introduces a machine learning method, by extracting a variety of features in lip region, and training classifier to determine whether the lip is moving. The experiments prove the accuracy and robustness of this method.The method used in the literatures to detect speaking face is based on single frame, that is, people judge whether a face in a frame is speaking. But for the situation, in a face sequence just some of image in the sequence is lip-moving and this sequence isn't speaking, this method can not distinguish the difference. Therefore we propose this method which is based on the selected image sequence to determine if the sequence is speaking in a period of time. The proposed method is more realistic, and also achieved good results.

Keywords/Search Tags:

face detection and tracking, mouth location, lip-moving, dynamic time warping, speaking face detectio

PDF Full Text Request

Related items

1	Color-based Face Detection And Tracking Method
2	Research On Face Detection Based On Skin Color
3	Human Face Detection And Tracking Algorithm Based On Video Sequences
4	The Design Of Face Dynamic Detection And Real-Time Tracking System
5	Algorithms Research And Implementation For Face Detection System
6	Human Face Location And Track In Video Image Sequences
7	Research On Face Pose Estimation Based On Near-Infrared Images
8	Research On Multi-frame Based Deep Face Recognition In Videos
9	Real-time Face Detection And Tracking Based On Video Stream
10	Real-time Face Detection And Face Tracking In Embedded Environment