| During the last few decades, automatic mail sorting systems have been widely deployed around China, in which the technology has made progress from postcode recognition to address recognition. However, Recognizing Chinese address is still error-prone, especially when the images are low-quality and contain lots of noise. The errors can be categorized to three categories:1) Character segmentation is right, while character recognition result is wrong; 2) Wrong character segmentations result in false recognition; 3) Confusion of Chinese characters and numeral characters result in wrong output.Different recognizers may result in different mistakes when they are used to recognize a Chinese address. In this paper, we present a method of combining multiple Chinese address recognition outputs to improve Chinese address recognition accuracy. The proposed method consists of three steps:1) Align the recognizer outputs based on minimum edit distance and retrieve optimal pairwise alignment, then extend the pairwise alignment to multiple alignment.2) Build the lattice of candidate hypotheses based on the optimal multiple alignment.3) Use the Viterbi algorithm to select the maximum likelihood candidate sequence based on statistical language model.We evaluate our method on two real data sets which are Chinese printed envelope images captured from sorting machine. The first data set (SRI 1) consists of 1651 images (composed of 30,014 characters) which are segmented address images from envelop images. For the SRI1 data set, we test the performance of character recognition. The second data set (SRI2) consists of 3071 images which are Chinese mail images and the address area are not pre-segmented. For the SRI2 data set, we test the performance of the whole address on automatic mail sorting system. The results of the experiments show that the performance of our method is superior, compared to the single recognizers and Miyao’s method. |