A Research Of Natural English Character Recognition Based On Deep Learning

Posted on:2020-08-27

Degree:Master

Type:Thesis

Country:China

Candidate:R J Zou

Full Text:PDF

GTID:2428330596476074

Subject:Communication and Information System

Abstract/Summary:

PDF Full Text Request

As a common form of information in natural scenes,text often contains rich semantic feature information.This thesis studies the problem of English text recognition in natural scenes.Different from the document image used in the traditional OCR method,the text images collected in the natural scene often have difficulties such as variable fonts,irregular layout and complex environmental background,which introduces many uncertain factors for the recognition process.How to make computers more accurately extract features and recognize words has always been a hot issue for many scholars at home and abroad.With the continuous development and advancement of deep learning theory,its application fields have gradually expanded,and achieved good results in many directions such as image processing and forecast classification.Based on the deep learning theory,this thesis aims at the recognition of English characters in natural scenes and uses the accuracy and efficiency of text recognition as the standard.The main contents are as follows:(1)Combined with the idea of end-to-end recognition model,a set of English text recognition algorithm model based on CNN-BiGRU-Attention is designed in the framework of encoder-decoder.The recognition task is decomposed into two processes of coding and decoding.And then this thesis carries out research and design on each part of the algorithm.(2)In the coding process,on the basis of image preprocessing,combined with Jaderberg's convolutional neural network,a CNN model for image static feature extraction is presented.In order to further utilize the contextual information,a bidirectional recurrent neural network is introduced to extract the associated information.At the same time,in order to further reduce the computational complexity,a gated recurrent unit is adopted as its basic structure.Based on the extraction of static feature and associated feature,the context feature vector generation method is proposed,which realizes the feature coding of the original image.(3)In the decoding process,the recurrent neural network is used to decode according to the context feature vector.The attention mechanism is introduced here to enhance the application of context feature information,thereby improving the overall accuracy.Combined with the characteristics of the research data,the thesis optimizes the attention mechanism,proposes the local attention mechanism used in the decoding process,enhances the application of local correlation features,and reduces the overall calculation.In the post-processing stage,the cluster search algorithm is optimized by two methods based on the lexicon model and the n-gram language model,which further improves the accuracy of the decoding result.(4)In the process of model training,the methods of exponential decay,regularization and moving average model are introduced,and the algorithm model is further optimized from two aspects of efficiency and recognition accuracy.After finishing the model training,the efficiency,accuracy and generalization ability of the proposed algorithm are tested from several aspects.

Keywords/Search Tags:

natural scene image, English text recognition, convolutional neural network, recurrent neural network, attention mechanism

PDF Full Text Request

Related items

1	Research On Text Detection And Recognition In Natural Scene Based On Deep Neural Network
2	Research On Chinese And English Text Recognition Algorithms In Natural Scenes Based On Attention Mechanism
3	Convolutional Recurrent Network For Offline Handwritten Text And Scene Text Recognition
4	Text Classification Research Based On Deep Neural Network And Attention Mechanism
5	Design And Implementation Of Chinese Character Recognition Model Based On Deep Neural Network
6	Research On Image Description Method Based On Multimodal Recurrent Neural Networks
7	Research Of Text Sentiment Analysis Methods Based On Neural Network
8	Research On Scene-based Image Semantic Description Generation Technology
9	Research On Speech Emotion Recognition Based On Convolutional Recurrent Neural Network
10	Research On Text Detection And Recognition In Natural Scenes Based On Deep Learning