There has been a rapid development in Chinese automatic abstracting in last 20 years. However, limitations still exist in automatic abstracting techniques, which represent as the non-completeness and high redundancy of the automatic abstraction.Specified study has been made in this paper for the correction of the limitations. At the beginning of the paper, a reverse maximum matching method based on the universal segmentation dictionary for the longest word-combination is proposed to modify the fine grained segmentation, followed with calculations and filter of term words. Then the weighting function of the sentence is summarized with the combination of other researchers' study and the text feature characters, which is applied in the sentence segmentation algorithm. An MMR equation has also designed based on the maximal marginal relevance theory. It is used in a new abstraction summarizing method in order to reduce the redundancy. In the end of the paper, a Chinese document automatic abstracting system is designed and implement. Experiments indicate that the automatic abstraction made by the system has a fine quality with completeness and low redundancy. |