Research On Text Detection And Recognition Technology In Natural Scene Image

Posted on:2020-10-05

Degree:Master

Type:Thesis

Country:China

Candidate:P F Zhou

Full Text:PDF

GTID:2428330596479696

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

The text in the natural scene image contains rich and accurate high-level semantic information,which has guiding significance for blind obstacle navigation system,intelligent urban traffic management system,automobile driverless system and instant translation system.Therefore,it is of great research value to locate and identify the text in the natural scene image.This paper has conducted in-depth research on text localization and recognition in natural scenes.The specific work is as follows:This paper has conducted in-depth research on text localization and recognition in natural scenes.The specific work is as follows:1.The traditional MSER algorithm is sensitive to illumination and text missed detection occurs when text candidate regions are extracted on a single grayscale channel.To solve this problem,this paper proposes an MSER algorithm based on multi-channel illumination equalization.Firstly,the images are illuminated and equalized under R,G and B channels respectively;Then,the MSER region of the text character is extracted by the MSER detection operator under the corresponding channel;Finally,the MSER region of each channel is merged as the character candidate region.After experimental verification,the improved algorithm can detect relatively complete character regions for images with uneven illumination or different complex backgrounds,which improves the recall rate of the algorithm.2.The traditional MSER algorithm will cause false detection in scene text detection with complex background.To solve this problem,this paper proposes a pseudo-character region filtering algorithm based on multi-feature fusion.Firstly,the HOG feature,the LBP feature,and the CNN feature are respectively extracted for the character candidate region;and then the three features are serially fused;Finally,a character discriminator is trained by the SVM to filter the pseudo-character region.It is verified by experiments that this algorithm can eliminate more pseudo-character regions and improve the accuracy of the algorithm.3.The sliding convolution character model is based on the recognition of character classification.It only focuses on the depth characteristics of characters,but ignores the context relationship between characters in the text line,which greatly reduces the recognition accuracy of the algorithm.To solve this problem,this paper deeply studies the sliding convolution character model,and introduces the bidirectional LSTM network to carry out scene text recognition.First,remove the classification of the sliding convolution character model layer;then use the CNN sliding window to extract the characteristics of the input image sequences,the output sequence characteristics of the input to the design good two-way LSTM network to extract context features of each character;and finally,using the CTC transcriptional mechanisms transcribes LSTM output prediction for the actual string.Compared with other algorithms,the proposed algorithm can improve the recognition accuracy significantly.

Keywords/Search Tags:

MSER algorithm, SVM classifier, Sliding convolution character model, Bidirectional LSTM

PDF Full Text Request

Related items

1	Research And Application Of Natural Scene Text Detection Algorithm
2	Research And Application Of Bidirectional LSTM Based On Attention In Text Title Generation
3	Research On Image Chinese Title Generation Method Based On Bidirectional LSTM
4	Bidirectional Decision Support System For Public Events In Professional Domains Based On Improved RNN-LSTM
5	Research On Text Classification Method Based On Bidirectional LSTM
6	Research On Tibetan Word Spelling Check Technology Based On Bidirectional LSTM
7	Chinese Named Entity Recognition Based On Bidirectional LSTM-CRF Model
8	The theoretical foundation of the MSER algorithm
9	Chinese Character Recognition And Its Application In LPR System
10	Research Of Review Aspect Level Sentiment Analysis Based On Hierarchical Bi-LSTM Model