Font Size: a A A

Design And Implementation Of Data Acquisition System Based On Web Crawler

Posted on:2016-07-11Degree:MasterType:Thesis
Country:ChinaCandidate:Y S ZhaoFull Text:PDF
GTID:2348330521451052Subject:Software engineering
Abstract/Summary:PDF Full Text Request
The great wealth of information on the Internet provides a lot of valuable information.Big data and its application is based on the value of mass information extraction,which provides a new way for the user interaction and business operations of the Internet and even a number of industry enterprises,and to a considerable extent,improve the user's network access experience and consumption patterns.The management information system not only can carry on the standardized data management to the special subject information,provides the data support for other business application,also provides the powerful support for the enterprise's business decision.However,because of the limitations of the traditional management system,it can not meet the rapid development of network applications.The data management information system is often in the closed environment of the system software,data generation and management are provided by the software system,which leads to the high cost of data management,data value is a single rigid and other issues.Web crawler provides a powerful technical support for modern Internet information retrieval,and provides a comprehensive data solution for the future management information system.This paper firstly introduces the main ways of Internet information collection technology,analyzes the basic principle and the latest progress of the web crawler technology,combined with the theme crawler and general crawler technology,design data collection system.In this paper,we propose a method of generating the authority of the site and the key words of the site.The algorithm based on VSM algorithm,which is based on the algorithm.After the completion of the system development,the function and performance test,basically to achieve the desired design objectives.In the end of this paper,we summarize the work of network design collection technology and system design,and point out the shortcomings of the system and the direction of future improvement.
Keywords/Search Tags:WebCrawler, Theme, VSM, MIS
PDF Full Text Request
Related items