Based On The Theme By The Chinese Single-document Summarization System

Posted on:2010-03-14

Degree:Master

Type:Thesis

Country:China

Candidate:Y H Zhang

Full Text:PDF

GTID:2208330332478204

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

Automatic abstracting is an important application in natural language processing area, which is also a difficult and challenging job. It has been widely used in information retrieval, information management, and digital library fields. So it will be of great theoretic value and practical significance for the research of automatic abstracting.Automatic abstracting based on statistics is an early researched and widely used method. The most advantage of this method is non-restrict of fields, and texts of different areas can use this method to get abstract. But the method has the shortcomings of incomprehensiveness, not conciseness and incoherence, which made the result of abstract can not perfectly.Based on the statistics automatic abstracting, this paper makes the two technologies of topic partition and abstract optimization into statistics automatic abstracting, which made the abstract more comprehensiveness, conciseness and coherence. The specific research contents as follows:1. Improved k-means algorithm is raised to divide the topics of text, which made the abstract more completely.2. Optimize the abstract based on the rough abstract, which made the abstract more concise and coherent.3. Design a Chinese single-document automatic abstracting system prototype based on the two steps above.Use intrinsic method to valuate the system, which include compare the system with the ideal abstract and compare the system with statistical automatic abstracting system and word2003 automatic abstracting system, and the result shows that the system is better than the other two systems.

Keywords/Search Tags:

Automatic Abstracting, Topic Partition, K-means Algorithm, Abstract Generation, Abstract optimization

PDF Full Text Request

Related items

1	Research Of Some Problems On Chinese Automatic Abstract Technology
2	Research And Implementation Of Abstract Automatic Generation Algorithm Based On Gensim
3	Research And Implementation Of Automatic Generation Method Of Internet News Abstract
4	Research And Implementation Of Automatic Abstract Generation System Based On Deep Learning
5	Research On Generation Of Invariant Based On Abstract Interpretation
6	Automatic Abstract Algorithm Research And Implementation Of The Post Processing
7	The Application Of Automatic Extraction Technology Of Text Abstract In Digital Printing
8	Research And Application Of Abstract Technology And Query Behavior Analysis In Search Engine Of Universities
9	Research On The Scope Of Program Variables Based On Abstract Interpretation
10	Research And Implementation Of News Web Abstract Algorithm