Font Size: a A A

Discovering Important Bloggers Based On Content And Link Analysis

Posted on:2007-02-19Degree:MasterType:Thesis
Country:ChinaCandidate:Y H YangFull Text:PDF
GTID:2178360185485661Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
Blog is a platform for information transfer based on RSS. It is a kind of mutual intermediary for author and reader with the style of the daily record. It is a new mutual mode for information diffusion. Compared with conventional webs, links in the blog space are more abundant and conversations between bloggers are more frequent. It brings convenience for users to publish information and discuss on the Internet.Popularity of blogs and the amount of information in the blog space increase fast. Many problems become serious, illegal advertisement and danger information containing unhealthy content emerge in an endless stream, which diffuse quickly through the channel of blogs. On the other hand, it is difficult for Internet users to find information they care about.In this paper, we focus on information filtering in the blog space, and discovering important bloggers to bring convenience to users. We identify and filter danger information based on a similarity measure method and gain a good result. We propose a method of ranking bloggers based on link analysis, which can exemplify the characteristics of blogs, and reduce the influence of link spamming. This method can also bring convenience to users for reading blogs, and it may supply a new methodology for information retrieval in the blog space. To ensure the reliability of the ranking results, we propose some evaluation indicators of important bloggers, and compare the grading results of 233 bloggers using our method with using other indicators. Correlation analysis is made to verify the consistency between this method and evaluation indicators. Besides, we simulate various phenomenon of link spamming, and calculate important values again using our method and based on the new link relations. We calculate correlation coefficients between important values of 1057 bloggers before link spamming and the values after that, which are higher than 0.9, which indicate that the effect caused by link spamming to our method is small.
Keywords/Search Tags:important blogger, link analysis, evaluation indicator, correlation analysis, information filtering
PDF Full Text Request
Related items