Research On Text Location And Recognition In Natural Scenes Of Image

Posted on:2017-07-06

Degree:Master

Type:Thesis

Country:China

Candidate:Z D Chao

Full Text:PDF

GTID:2348330491459931

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

Detecting and recognizing text in natural scenes, such as sign boards on streets and buildings, traffic signs, movie marques and so on, is a core part of computer vision applications, roboticsand text reading programs for visually impaired person. Recently, with the rapid development of computer vision and robot vision and artificial intelligence, how to extract text information from scene images with high efficiencyhas become one of the hot research topics. Detecting and recognizing text includes three steps:character detection, text line detection and text recognition. On the basis of studying and summarizing existing excellent algorithms and according to the characteristics of text in natural scenes, in this thesis, text detection, text line detection and text recognition under complex scenes are deeply studied. The main contents are as follows:(1)Two kinds of character region detection methods are designed for different application scenarios.In the condition of image blur, unven illumination and low light, etc., the edge portion of text will have a serious distortion, so that it is difficult for existing text detection algorithms to achieve desired results. As to solve this problem,a character detection method byusing binarization is proposed, which usesmultistage binarizationandtwo-pass connected-component labeling algorithms to detect candidate character region sin three channelsof image and gray image.Considering that compared to background texts always have a clear outline in most natural scenes.In order to improve the detection speed of this condition, a candidate character regions locating method based on image layeringis proposed. The image is viewed as a tree model which is stacked by a series of components, and character areasare detected by component detection. And part of non character areasare removed based on the structure characteristic of the image tree.(2) In order to find candidate text lines in character regions set and to solve the problem of detecting text lines with random directions, a text line detection method based on graph model is proposed. The graph model is constructed by the relation of regional spatial distribution information and geometric information, and the search path is prunedbased on graph search strategy. In order to remove non text lines, the joint characteristics of regions in the candidate text lines are extracted, andoverall features with adaptive adjustment according to the inclination direction are extracted. The result proves that the method performs well for text lines with random directions.(3) The texts are recognized by the method combining Tesscract OCR and Google spell checking. Due to the complexity of the text in the natural scene, it is difficult for Tesseract OCR to obtainideal result. In order to solve this problem, firstly, textimages are corrected before using Tesseract OCR, and obstacles in the background are removed by combining with the color distribution offoregroundand boundingbox of text region. After that tilt correction and shear correction are operated on these images. Then the recognization results are modified by Google spell checking. The result shows that the method improves the recognition accuracy.

Keywords/Search Tags:

Character Detection, Multistage Binarization, Image Layering, Text-line Detection, Text Recognition

PDF Full Text Request

Related items

1	Research On Character Level Chinese Scene Text Detection And Recognition Based On Deep Learning
2	Character Recognition Of Complex Background Color Images
3	Video Text Extraction Technology Research And Application
4	Research On Text Detection And Recognition Algorithm For Shopping Ticket Image
5	Research And Implementation Of Key Technologies For Image-based Book Spine Detection And Recognition
6	Research On The Text Detection And Extraction From Complex Images
7	Text Detection And Extraction Technique In Video Frame
8	Research On Text Extraction From Image With Complex Background
9	Color Image And Video Text Extraction Algorithm
10	Research On Text Detection Technology Of Natural Scene Based On Link Line