Research On Scene Text Detection And Recognition Based On Deep Learning

Posted on:2022-03-28

Degree:Master

Type:Thesis

Country:China

Candidate:S Chen

Full Text:PDF

GTID:2518306314981339

Subject:Control Engineering

Abstract/Summary:

PDF Full Text Request

Text in natural scene images contains important high-level semantic information,which is helpful to analyze and understand the corresponding environment.In recent years,Scene Text Recognition(STR)has become an active research topic in computer vision.Although the traditional Optical Character Recognition(OCR)method has a high recognition rate for characters in fixed documents,it cannot be applied to the actual scene image text recognition.The rapid development of Deep Learning(DL)provides more research schemes for scene text detection and recognition.However,due to the characteristics of different text shapes,changeable angles and complex background information,the research on text detection and recognition in natural scenes still faces enormous challenges.Based on deep learning technology,this paper proposes a novel algorithm for text detection and recognition in natural scenes.The specific work are as follow:1.To solve the problem of complex scene text detection,an end-to-end text detection method STDP(Scene Text Detection Perception)is designed,and the text detection method based on semantic segmentation is used to learn the reading order and boundary information of potential text.The regression strategy is adopted to adjust the wrong detection region to solve the problem of regional deviation in text detection.A new Text Shape Transformation Module(TSTM)is designed for irregular texts with feature regions,which can transform the detected feature regions into regular shapes without additional parameters,and can better solve the incompatibility between text detection and recognition,and improve the recognition efficiency and accuracy.2.To solve the problem of natural scene text recognition,a new scene text recognition method is designed on the basis of Convolutional Recurrent Neural Network(CRNN),which combines attention mechanism with connectionist temporal classification,(CTC)can solve the problems of text information loss and slow training speed caused by single decoding,and realize the rapid convergence of the network through joint decoding training.3.In this paper,the TSTM module is used to connect the detection method with the recognition method,forming an end-to-end detection and recognition network,which is verified and analyzed on different data sets.Experiments show that the accuracy of the text recognition algorithm ACTR proposed in this paper reaches98.89%,which greatly surpasses the single decoding recognition algorithm.Compared with the latest end-to-end detection algorithm,the proposed end-to-end text detection method has a significant improvement in detection index.

Keywords/Search Tags:

deep learning, sequential semantic segmentation, Text Shape Transformation, code and decode, end to end approach

PDF Full Text Request

Related items

1	Deep Learning Based Image Semantic Segmentation And Its Application
2	Research On 3D Model Semantic Extraction Technology Based On Deep Learning
3	Research On 3D Point Cloud Shape Classification And Semantic Segmentation Based On Attention And Deep Networks
4	Research And Implementation Of Satellite Image Semantic Segmentation Technology Based On Deep Learning
5	3D Shape Analysis Based Supervised Learning
6	Research On Image Semantic Segmentation Based On Deep Learning
7	Research On Text Semantic Matching Based On Deep Learning
8	Research And Realization On Technology Of Image Semantic Segmentation Based On Deep Learning
9	Algorithm Research Of Image Semantic Segmentation Based On Deep Learning
10	Shape Modeling And Image Segmentation Method Based On Convolutional Deep Learning Model