Font Size: a A A

Design And Implementation Of Document Information Analysis System

Posted on:2014-01-06Degree:MasterType:Thesis
Country:ChinaCandidate:J L ManFull Text:PDF
GTID:2248330398950573Subject:Software engineering
Abstract/Summary:PDF Full Text Request
With the advent of the information age, information has become increasingly diverse so that scientists have been able to get to a variety of data, academia into the era of contention of a hundred schools of thought. But it also gives scientists trouble, how to extract useful information from a wealth of network data and information, how to predict the trend of the development of their own field of study, how to expand their area of research effectively.To solve this problem, the major domestic international companies and research institutions launched tools of bibliometric and literature information analysis tools, such as SPSS, Bibexcel and so on. However, they have their own problems. Operation of some software is too complex, not easy to learn, what’s more, it takes a few days time learning software operating, greatly affect user productivity; some software are too monotonous, and need other software to complete the task of literature information analysis. General analysis of the information, only need a few functions, but have to install several tools, which causes a lot of inconvenience to the user.Through the study of current literature information analysis software, this paper designed a simple and comprehensive literature information analysis system, which has literature information management system, word frequency statistics, co-word analysis, cluster analysis and literature information query function. Literature information can be converted into the file format that can be identified by the major mainstream literature information analysis software. The system uses XML file as the literature information storage warehouse, which has more space-saving, and more convenient.Adding the new Cilin to clustering analysis, so the user can do extended through keyword. To establish Cilin, to strengthen the links in the disciplines also help stimulate the user’s inspiration. The transmission of data between systems using the way transport intermediate file, these intermediate files can also be applied in other document, to take full advantage of other analysis software, which can not only finish literature analysis independently, but also finish the work with other analytical software in conjunction. Through continuous improvement and testing, this system is able to achieve all the function in the requirements. And it would have certain value to develop other similar analysis software.
Keywords/Search Tags:Word Frequency Statistics, Cluster Analysis, XML, Co-Word Analysis
PDF Full Text Request
Related items