Caption Text Detection And Extraction In Video

Posted on:2009-07-14

Degree:Doctor

Type:Dissertation

Country:China

Candidate:J C Zhou

Full Text:PDF

GTID:1118360242991175

Subject:Pattern Recognition and Intelligent Systems

Abstract/Summary:

PDF Full Text Request

Nowadays the amount of digital videos increases explosively, and consequentlyit's a valuable research for computer to comprehend these multimedia documents,extract semantic information and boost some applications, such as video manage-ment, information retrieval and data mining. Caption texts embedded in videosare highly related to the video content and easier to extract than other semanticfeatures. They serve as a important clue for video content comprehension.We aim to design a prototype system and extract video text information.According to the proceeding sequence, it includes pretreatment, text detection,extraction and recognition. Our research focuses on the former three parts.1. Pretreatment includes video codec, image quality assessment and systeminitialization. In real application the system need to process videos with di?erentimage quality, and the proceeding method should be changed accordingly. In thispaper we propose a no-reference image quality assessment method. Firstly, ex-tract features from amplitude fall-o? curves and positional similarity on the imagefollowing natural scene statistics, and build feature vector accordingly. Secondly,train general regression neural network to predict image quality.2. Text detection is conducted to locate text boxes in frame images. Wepropose a fast and e?ective method. Firstly, calculate the edge image of a frameand revise it due to the phenomena of broken and conglutination. Secondly,label all CCs(connected components) on the edge image, filter out those frombackground partly, sort remained CCs by their position, search correspondingCCs and build text boxes depending on the geometric constraint. At last, conducttext box amalgamation to eliminate reduplicate detection results and text boxverification to eliminate false alarms.3. Text extraction is conducted to extract text strokes from text boxes.In text boxes, the color of characters can not be determined in advance andmany disturbances form background exist, so the procedure of text extraction isneeded. We propose a robust extraction method. Firstly, binarize the candidatetext box and conduct polarity estimation to determine on which polarity the textoccurs. Secondly, perform multi-frame verification and enhancement on text boxesemploying their temporal redundancy in video. At last, binarize candidate textboxes, execute CC filtering to remove disturbance from background, and generateclear binary image for recognition.We describe the experiment data set and results. They confirm the e?ective-ness and e?ciency of our methods.

Keywords/Search Tags:

video text detection, video text extraction, image quality assessment

PDF Full Text Request

Related items

1	Research On Video Text Extraction And The Application In Virtual Karaoke
2	Reasearch On Video Text Information Extraction Based On Features Integration
3	Research On The Technology Of Video Text Information Extraction
4	Research On Text And Specific Object Detection Algorithm In Images And Videos
5	Research On Video OCR
6	Text Extraction In Video
7	The Research On The Method Of High-Definition Video Text Extraction And Recognition
8	Image/Video Text Extraction And Its Application
9	Text Extraction On Video And Its Application
10	Research On Text Detection In Images And Video Frames