Font Size: a A A

Study And Implementation Of Web Information Gathering And Data Statistics Technology

Posted on:2011-04-26Degree:MasterType:Thesis
Country:ChinaCandidate:Z C LinFull Text:PDF
GTID:2178360308964808Subject:Software engineering
Abstract/Summary:PDF Full Text Request
In recent years, with the rapid development of Web technology. Internet resourcesbecome more and more abundant. In order to obtain useful information on the vast Internetfor people, various types of information retrieval services on Internet came into being, and hasbeen developed rapidly. We research on data gathering and data statistics domain based on asearch system for farm produces. Analysis the main problems that information gathering anddata statistics domain facing recently. Design and realize a information gathering module anda data statistics module for the system. Improve the accuracy of information searching. Addnew functions for obtain the statistics and graphics of related information.Because of the diversity and heterogeneity of Web information. We promote needs forinformation gathering. Information gathering, including information extraction andinformation integration. This paper analyzes the main problems of information gatheringcurrently. Due to the characteristics of the system, propose a template based informationextraction scheme and a global data structure based information integration scheme. Solve theproblems of multi data source information gathering. Provide a unified interface to a globalsearch for users.Vertical search engine, it builds to provide information of an industry or a topic for users.However, most of the information collected from the Web are simply descriptive information.For the special field practitioners or researchers, this descriptive information may not be ableto meet their needs. More often, they also need some statistically data. In this paper, weanalyze the data duplication and data missing during data statistics. According to the needs ofusers, we design and implement the data statistics module for the system. The module allowsusers to access seven important statistics and two charts of agricultural information. Users canalso set three conditions of varieties, regions and date to control the scope of statistical data.
Keywords/Search Tags:information gathering, information extraction, information integration, data statistics, data missing
PDF Full Text Request
Related items