In today’s information world, the Chinese information processing in various largefields has been widely used. This paper focuses on the maximum matching algorithmresearch and discussion based on the dictionary mechanism on the Chinese wordsegmentation algorithm. Because of the Chinese semantic complexity, a large number ofambiguity will appear after Chinese segmentation. In this paper the maximum matchingimproved algorithm is based on the analysis of the maximum matching algorithm so asto avoid ambiguous phrases of overlap type in the segmentation error and improved, inthe guarantee rate based on improved Chinesse segmentation accuracy.The Chinese word segmentation algorithm in this paper is based on avoidingoverlapping ambiguity string of the maximum matching algorithm. Firstly, this paperintroduces the current Chinese word segmentation algorithm, it include commonly usedin the word segmentation algorithm, commonly used the dictionary mechanism as wellas the theory about ambiguity to explain basic theories of the Chinese wordsegmentation algorithm; Secondly this paper describe the reverse dictionary mechanismbased on the Hash table according to the existing dictionary mechanism, the maximummatching improved algorithm,and the maximum matching improved algorithm foravoid ambiguous phrases of overlap type.This algorithm’s accuracy is improved basedon a range of segmentation rate.Finally, this paper achieved the maximun matching improved algorithm andexperiment, the results this algorithm has better performance improvement andpracticality.Because of the experiment dose not involve the problem of identification ofword, so the experimental results did not reach the standards of accuracy. |