Font Size: a A A

Research On The Methods For Obtaining Public Opinion Information Of Visitors Of Target Websites

Posted on:2018-09-18Degree:MasterType:Thesis
Country:ChinaCandidate:H ZhangFull Text:PDF
GTID:2348330533969615Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
With the rapid development of network technology,the Internet has become an important platform for the masses to obtain,post and disseminate information.However,the visitors with illegal purpose often disseminate vulgar culture and rumors to mislead public opinion,and even endanger the image of our country through specific foreign target websites.Therefore,it has become an important initiative to grasp the public opinion information of illegal visitors in specific target website timely.First of all,the method of acquiring public opinion information of visitors in foreign target websites is studied in this dissertation.The application scenarios,relevant definitions and integral structure are given in the course of research.A phased acquisition method for public opinion information of visitors in target websites is proposed,with analyzing the formation process of sensitive public opinion and characteristics of visitors' public opinion information.The methods involved in the process of acquisition are studied in detail.In the stage of the generation of sensitive public opinion,a method based on web crawler technology is proposed to obtain public opinion of illegal visitors from target websites.The firewall penetration method based on VPN proxy is researched to obtain information from target websites.The LDA latent topic Association algorithm is improved to detect topic information of webpages and judge sensitive public opinion pages in crawling process.A novel method of web page information extraction is proposed,which can load dynamic web page elements,to analyze and obtain the public opinion information of visitors.The results of experiment show that the proposed method can distinguish sensitive public opinion pages in target websites quickly,as well as obtain the authors' and illegal visitors' information of sensitive public opinion article stably and effectively.In the stage of comment and dissemination of sensitive public opinio n,a novel method based on technology of website cloning and browser caching is proposed.Based on the information obtained by crawler,the technology of dynamic website cloning is studied and an acquisition method of visitors' information in website background is implemented.By studying the strategy of browser caching,the cache script is designed to obtain the multi-faceted public opinion information of the visitors from webpage,which makes efforts to the richness of obtaining results.In order to solve the problem of inaccurate information obtained from illegal visitors who use proxy,an acquisition method is proposed to probe and obtain original IP information,Internet service provider information and local DNS information of proxy visitors.The research combines the advantages of website acquisition method and improves the acquisition effect,which is characterized by fast deployment speed and getting rich information.The result of experiment shows that this method can continuously obtain rich public opinion information of visitors and has fine probing result for visitors who surf the target websites through internal network and proxy technology.Finally,a prototype system is designed based on the proposed method for obtaining public opinion information of visitors of target websites.The running results of the system indicate that it can acquire,manage and display the public opinion information of visitors of target websites effectively,which verify the feasibility and effectiveness of the proposed method in the paper and has a broad prospect of application.
Keywords/Search Tags:network public opinion, target website, information acquisition, website cloning, browser caching
PDF Full Text Request
Related items