Font Size: a A A

Research And Implementation On Automatic Construction Of Complex Network Based On The Technology Of Information Extraction

Posted on:2010-12-23Degree:MasterType:Thesis
Country:ChinaCandidate:F ZhouFull Text:PDF
GTID:2178360278966010Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
Complex Network provide us a new perspective of complexity research, to compare, research and summarize a variety of Complex Networks have become one of the scientific research hotspots. With the development of Internet, the amount of unstructured and semi-structured information increase, Complex Network analysis based on the information is an inevitable trend; the technology of Information Extraction plays a more and more important role. Integrating with Information Extraction and Complex Network, we can extract the information of vertexes and edges that can provide basic data for the construction of Complex Network, and greatly expand the Complex Network applications. Integration of Information Extraction and Complex Network will be a new hot issue of research and application.In the thesis, based on the data of Chinese text, we carried out in-depth analysis and research on Information Extraction and related technologies, combined with the practical needs of the construction of Complex Network, achieved the following results:1. Automatic extraction of vertexes. We carry out in-depth study to some commonly used statistical models of entity recognition, and apply the models to the core algorithm of the vertexes' extraction, including using HMM to implement the Chinese word segmentation and part-of-speech tagging, using MEMM to implement the entity recognition, all of these have achieved good results.2. Automatic extraction of edges. Through extensive reading of domestic and foreign literatures, we sum up some of the most commonly used entity relationship extraction method, then according to the special needs, a new method of entity relationship extraction is proposed, which is accurate, flexible, good practicability.3. Automatic construction of complex network. Based on the implementation of the vertex and edge extraction algorithms, a prototype system of automatic construction of complex network is set up, which has friendly interface, good capacity of natural language processing, flexible software architecture and vivid display of results.Finally, the research and implementation on the construction of Complex network are summarized, and the prospects and the future directions are discussed.
Keywords/Search Tags:Complex Network, Information Extraction, Entity Extraction, Entity Relation Extraction
PDF Full Text Request
Related items