Font Size: a A A

Study On Pre-processing Of Printed Chinese Character Recognition

Posted on:2009-06-14Degree:MasterType:Thesis
Country:ChinaCandidate:H WangFull Text:PDF
GTID:2178360272970677Subject:Signal and Information Processing
Abstract/Summary:PDF Full Text Request
Printed Chinese character recognition is one of the important subjects in Chinese character recognition. It is involved in the domain of image processing and pattern recognition. In the era of intelligent information and internet, printed Chinese character recognition will be broadly applied to identity recognition, Chinese information processing, OA and so on. As a result, it is of great significance in both practical value and theoretical meaning. Pre-processing is one of the important parts in printed Chinese character recognition system. The result of pre-processing will directly influence the ratio of recognition in the system. Therefore, the research for pre-processing, in Chinese character recognition, is highly valuable for applications and meaningful for theories.The main work of this paper is to study of pre-processing for printed Chinese character recognition, implemented pre-processing for the text image automatically. The work is as follow:Firstly, the quality of the document image based on digital camera, because of uneven illumination, noise and other factors, is very poor. Therefore, the progress of pre-processing has been changed. Because the area of the single character is smaller and the gray is more even, it can get better effect. After binarizing the entire doucument, the binarization needs to be done for the single character once again. Experiment results proved that the noise of character stroke is released and the character information is reserved entirely.Secondly, for the character thinning method, an improved Chinese character thinning method based on the primary mathematical morphological is presented in this paper. With a new group of structure element sequences proposed, the skeleton of original Chinese characters can be held very well, and the stroke is smoother. It is better on the fork point Based on this result of Chinese characters thinning, a new algorithm of stroke extraction method for Chinese characters is defined. Especially for the extraction of right-diagonal and left-diagonal stroke, the character has a great connectivity.Thirdly, a font identification technique based on multi-scale non-redundant wavelet texture analysis is discussed, the wavelet energy and the proportion of the wavelet energy feature is extracted. Experiment performed six fonts recognition, including: song font, hei font, kai font, fang song font, li shu, and you yuan. In addition, the main step in the process of printed Chinese character recognition pre-processing is presented. For example: skew angle detection and correction, layout analyze, Chinese character segmentation. It got an effective result.
Keywords/Search Tags:Printed Chinese Character Recognition, Pre-process, Binarization, Chinese Character Thinning, Font Identification
PDF Full Text Request
Related items