Research On The Technology Of Video Caption Detection, Localization And Segmentation

Posted on:2013-02-01

Degree:Master

Type:Thesis

Country:China

Candidate:T Ren

Full Text:PDF

GTID:2268330422474234

Subject:Electronic Science and Technology

Abstract/Summary:

PDF Full Text Request

Video captions contain large amounts of key objective information, and theyâ€™re themost direct description and explanation of the content of a video clip. So extractingcaptions from videos is the foundation of the video retrieval and comprehension.Generally, there are four stages in the process of video caption extracting: captiondetection, caption localization, image segmentation and OCR. The technology of OCRis highly developed, so this paper mainly studied the key algorithms about video captiondetection, localization and segmentation. The main works are as follow:In the video caption detection and localization stages, the main contributions are:(1)In order to detect the low-contrastive video captions from complicated background,we studied a caption detection algorithm of self-adapted thresholdâ€™s filtering based onlocal backgroundâ€™s complexity and of text area recovery based on the edges density. Itcan enforce the caption pixels and restrain the background pixels at the same time, sowe can locate the caption area more easily;(2)In order to overcome the shortcomings ofthe projecting and cutting algorithm in the stage of video caption localiztion, weproposed a new localizing algorithm based on template-matching in multi-scales. Ittakes full advantage of the texture density, texture difference and texture uniformity tosearch the candidate captions. By using integral image the calculation speed of thisalgorithm can be highly accelerated;(3)After video caption detecting and localizing, weadopted the muti-frame verification algorithm to obtain the precise caption based on theimage correlation peak judgment. The experiments show the good performance that wecan detect and localize the low contrastive video captions from complicated backgroundin multi-scales.In caption image segmentation stage, the main contributions are:(1)A OTSU andGaussian local adapted thresholding method was proposed to segment the video captionimge with low contrast. The experimental results confirm the good performance of theproposed method;(2)In order to overcome the shortcomings of the over-segmentatingand under-segmentating problems of the traditional segmentation methods, this paperproposed a method based on â€œwhite pixelsâ€ increment ratio. This method finds theoptimal threshold using a feed-back model by gradually changing the segmentingthreshold. In order to verify the performance of the proposed segmentation method,several experiments under different complex conditions were designed. The experiment results show that the proposed method can resolve the over-segmenting andunder-segmenting problems of the traditional segmentation methods such as the OTSUalgorithm and the K-means clustering algorithm.

Keywords/Search Tags:

video caption, extraction, detection, localization, segmentation

PDF Full Text Request

Related items

1	Study Of Video Caption Extraction Algorithm Based On Spatial-Temporal Information
2	Analysis, Based On The Detection And Extraction Of The News Video Subtitles
3	Video Caption Localization And Recognition
4	Research On Text Detection And Localization In News Video Frames
5	Research On News Video Caption Extraction Based On Corner Density Detection And Two Times Binary
6	Digital Video Subtitles In The Detection And Extraction
7	Research Of Video Text Location And Segmentation Method For News Caption Recognition
8	Study Of Methods For Video Caption Detection And Extraction
9	Overlaid Caption Extraction In News Video Based On SVM
10	News Video Story Unit Segmentation