Font Size: a A A

The Design And Implementation Of The Web Page Data Collection System

Posted on:2016-02-21Degree:MasterType:Thesis
Country:ChinaCandidate:W HeFull Text:PDF
GTID:2348330479954317Subject:Software engineering
Abstract/Summary:PDF Full Text Request
With the prosperous development of the Internet industry, Webpage browsing magnitude followed reached the peak, such as taobao, Tmall, Baidu, Sina a nd other domestic website inside the giant, more than ten million daily page views, each operating in Webpage, residence time of each page, each page jump between the sequence and so on information for internet company and even for countries are very valuable, based on these data we can calculate the majority of user habits, or calculate the user of the site on the preferences of each plate, or calculate the website is popular in which regional and so on. Internet company of these hidden attributes are always hungry, whether large companies or small companies, each data for them are like stars in the clear night sky, drifting, these small stars are involved with each other, far-reaching, as long as gently toggle any a little star, it will lead to another group of little stars produced tremendous change. For small companies, the user amount is not too large, the data acquisition will have many limitations, most of small companies are only concerned about the traffic.Webpage buried point system in the BS architecture and the traditional management mode based on SPM, introduced the embedded point and automatic submerged, database master-slave separation technology, the request to the server only to the aspect as a transparent image pixel as the goal, with all the collected data according to the need in the URL request, the server only needs to please jetty request log to find, do not require real-time processing request, and the use of distributed cache server as the transfer station, which makes the page increase in buried bit more simple and convenient, but also greatly improves the server's load capacity.Webpage data collection system in accurate records of the page and click on the loading of data at the same time, ensure the server stability, can accept one hundred thousand requests at the same time, data storage delay in five minutes. According to the online after the front end buried personnel responsible for feedback, buried points compared to the previous page new work is particularly simple and convenient, greatly saving manpower cost of front end. At the same time, Webpage data collection system for malicious server requests and the error of the read data are a set of custom validation mechanism, guarantee the safety and accuracy of data collection.
Keywords/Search Tags:Huge Data, High Load Capacity, Real Time, Accurate
PDF Full Text Request
Related items