Research And Implementation Of Speaker Lip Feature Extraction Algorithm

Posted on:2019-05-16

Degree:Master

Type:Thesis

Country:China

Candidate:S S Cui

Full Text:PDF

GTID:2428330566464628

Subject:Engineering·Computer Technology

Abstract/Summary:

PDF Full Text Request

In many applications,such as audio-visual speech recognition,facial expression recognition and emotion detection,etc,which are based on intelligent information processing,the detection and extraction of the lip feature plays an important role.The existing lip feature extraction algorithms have many problems,such as large amount of computation,too many manual intervention in processing,and low practicability.After analyzing and comparing the current lip feature information extraction scheme,based on the changing rules of lip movement in the process of pronunciation,a new scheme of speaker lip feature extraction was designed and constructed.Then,the scheme has been discussed and studied the scheme deeply,and the main work contains the following aspects:(1)On the basis of analyzing and comparing the existing lip feature information extraction scheme,a new scheme of speaker lip feature information extraction is designed and a processing model of this scheme is built,which includes sections of video information preprocess,lip contour sequence generation and lip feature information extraction.(2)In the section of video information preprocessing,the preprocess model is designed.And some function modules of the preprocess model have been designed and implemented,which includes sections of video segmentation,face detection and lip region detection.(3)In the section of lip contour sequence generation,an automatic extraction method of lip contour based on YIQ chroma space is designed,so that the lip contour sequence generation model is designed further.And some function modules of the generation model have been designed and implemented,which includes sections of denoising,brightness filtering.(4)In the section of lip feature information extraction,the changing process of lip type can also be regarded as the serialization of the set of pronunciation video frames.Therefore,the value of speaker lip feature can be obtained through the boundary distribution characteristics and change sequence of the lip contour.Then,the model of lip feature information extraction is designed.And some function modules of the extraction model have been designed and implemented,which includes sections of key frame selection,feature data acquisition.(5)Based on the application of monosyllabic recognition,the proposed lip feature extraction scheme in this paper has been testified,which has proved that the scheme has good accessibility and practicability.

Keywords/Search Tags:

Video, Speaker, Lip Feature, YIQ Chroma Space, Frame Screening

PDF Full Text Request

Related items

1	Research On Neural Network-Based Chroma Coding Methods
2	The Design Of Key Frame Extraction System Of Video Information
3	Research And Application Of Video Super-resolution Algorithm Based On Deep Learnin
4	Feature Screening of Ultrahigh Dimensional Feature Spaces With Applications in Interaction Screenin
5	The Research Of Key Frame Extraction Algorithm In Video Retrieval Technology
6	The Research Of The Key Frame Extraction Algorithm Of Content-based Video Retrieval
7	Research On Speaker Confirmation Based On SMFCC Feature And Factor Analysis
8	Projection On Speech Features Space Improves The Performance Of Speaker Identification
9	Research On The Video Tampering Detection Methods Based On The Inter-frame Correlation
10	Speaker Adaptation Techniques Research For Traffic Broadcast Audio Information Retrieval