Font Size: a A A

Research Url Adverse Social Networks

Posted on:2015-03-31Degree:MasterType:Thesis
Country:ChinaCandidate:Y B ZhaoFull Text:PDF
GTID:2268330431954123Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
In recent years, the rapid development of such social networking questions and answers, fast growth in users. After years of accumulation, questions and answers to people-centred social networking is emerging as one of the main applications of the Internet. However, with the rapid development of social networks, Internet Security received a new challenge. First is the security of user information. In social networks, in order to better communicate, users tend to disclose their personal information. However due to the lack of security awareness and privacy measures, and user information are often illegal or were some unruly elements openly illegal use. Frequent users’privacy and security issues. Was followed by the spread of dirty URL. Due to the rapid dissemination of information in social networks, many hackers can use the social network dissemination of bad information. Hazard is dirty URL. Primarily a malicious URL, phishing URL, spam URL, porn URL. These users use social networking and social networking development poses a serious health hazard.This paper is one of the few articles for dirty URL in the current research questions and answers social networks. Effectively make up for the shortcomings of this area of research. In view of the current dirty URL appears in the question-and-answer type of social network conducted an in-depth study of the phenomenon. Firstly the dirty URL problems in social network analysis. Said one of hazards:spreading a malicious URL (including phishing URL), pose a threat to users, install malicious software, tapping user information, steal user passwords and other threats URL or distributing pornography, affect the network environment; or walk a great deal of advertising, making it difficult for users to find the answers they need. Misleading or if the user received a lot of bad information and harassment.Aiming at questions and answers social networking issues, this paper proposed solutions. First crawling Yahoo Answers website, extract the URL through which users answer questions and then judged urlvoid website. You can quickly identify the nature of the URL. Judging is a malicious URL. Ads URL, that is, with users not correspond to,the URL. Calculate the URL by text similarity matching values and issues. Web page keywords are used to extract text similarity computation. If the matching value is low, you think they are not relevant and thus judged to be not relevant URL.Finally through experiments and the test results are assessed. Through a large number of experiments and evaluation experiments and achieved good results.Our paper by researching dirty URL in the questions and answers type of social network, Yahoo answers by crawling, analysis, conclusions, and what results have been achieved:1, this paper is currently on the study of social networks are bad URL in one of the few articles, question and answer such social problems in the network, presented a framework to deal with, and have designed our systems through experiments;2, we improved the identification rate of dirty URL by S4platform with Yahoo. Won the ideal rate;3, we identify spam URL, ad URL by text similarity. Extends the application of text similarity principle;4, our system quickly distinguish a malicious URL, which greatly reduced the damage of malicious URL.
Keywords/Search Tags:ocial network, bad URL, extraction, identification
PDF Full Text Request
Related items