Techniques For Text Extraction In Videos

Posted on:2015-08-16

Degree:Master

Type:Thesis

Country:China

Candidate:Y Dong

Full Text:PDF

GTID:2298330452958997

Subject:Information and Communication Engineering

Abstract/Summary:

To some extent, the high-level semantic information reflects the content of the video,and the text embed in videos contains a wealth of high-level semantic information. Iftext can be detected, segmented and recognized automatically, there will be a bigimprovement in high-level semantic information understanding, indexing andretrieval. Text extraction system is normally composed by four parts, namely, textevent detection, text area localization, text segmentation and character recognition.This paper concentrates on the algorithm research of text area localization andtext segmentation. For text localization, two methods are proposed. The first one isbased on wavelet transformation, which applies corner response image and combinedhigh-frequency sub band. And the statistical characteristic is extracted as the featurevector for K-means cluster. Then, some heuristics are employed to remove the falsepositives. This approach utilizes the classification of unsupervised learning method,therefore, it is able to avoid sample training, which is time-cosuming. The secondalgorithm is based on Gabor transformation, and it aims at Chinese text detection andlocalization. The strokes of most Chinese characters are oriented in four directions, soGabor with various scales and directions can describe it well. The cluster result fordifferent scales of Gabor transformation is combined to obtain the location of text.Experimental results show that the proposed method is robust even in low-contrastcondition.For text segmentation, in order to obtain a better recognition result from OCRsoftware, text segmentation part differentiates the text pixels from the backgroundpixels. A color space based segmentation approach is proposed, which utilizes theclassical OTSU algorithm and RGB space to find the initial text pixels. And thenK-means cluster is applied to find the final text pixels from the initial one. The binaryImage can describe the text pixels well. Experimental results show that our textsegmentation technique outperform other classical Threshold segmentation algorithm.

Keywords/Search Tags:

Text Localization, Text segmentation, Wavelet Transform, GaborTransform

Related items

1	The Research Of Video Text Extraction Under Complex Background
2	Research On Text Location And Segmentation Technique In Image
3	The Complex Background Of Video Text Localization And Segmentation
4	Learning-Based Text Extraction In Natural Background
5	Research On Text Extraction Technology In Video
6	Research On The Technology Of Video Text Information Extraction
7	Researches On Text Localization Based On Multiscale Geometric Analysis
8	Research On Text Recognition In Natural Scene And Realization
9	Research On Text Localization Algorithm In The Image
10	Research On Text Extraction From Image With Complex Background