Font Size: a A A

Research Of Blog Ranking Algorithm Based On Link Analaysis

Posted on:2010-01-19Degree:MasterType:Thesis
Country:ChinaCandidate:Z H WangFull Text:PDF
GTID:2218330368499989Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
Blog, as a tool of personal information release, is gradually becoming an essential resource in information age. Along with the fast development of the community network, popularization and application of Social Software, the internet is striding in the community time gradually, people start to pay more attention to it and millions of users publish information on it. Meanwhile the number of bloggers is on the increase rapidly, it leads to inflation of information in the blogspace. Blog search technologies which help users to look for necessary blog information from a huge mass of data become more and more important. The blog page ranking algorithm is one of the most key techniques in blog search engine. As the most critical part, blog ranking algorithm has become the research hotspot. It has been receiving widespread attention.In this paper, after detailed analysis about structure and function of blog page and link analysis technology, based on the differences between blogpage and webpage about the link structure, we classified the link relationship in the blogpage, and create a unique blog page ranking algorithm. The concrete procedure is in the following:First analyzing the Blog characteristics, we choose the factors affecting the Blog sorting, trackbacks, tags, comments and so on.Then it extracts these factors. When we carries on the feature extraction, first filters web page noise, then through template matching to extract characteristic factors.In this foundation, we use the analysis of link-based Blog ranking algorithm to sort Blogpage. When we sort the Blog, first the links are divided into the structure of type-Trackback links and content links, that is, the internal links in blogpage, and the corresponding proposed sorting algorithms. First, the proposed innovative TBR algorithm evaluates the score of blog from blogger's reputation, in new page which do not have links or rarely, according to the authors evaluate score of the weblog. Second, for BPR algorithm, in carrying forward Iterative Markov process of the PageRank algorithms, the same tags, categories and links will get higher score. In this part, our algorithm also taking into account the pubish time of Blog. The proposed algorithm takes full account of the sort of factors affect the blog.Finally, we design and realize the proposed algorithm. After experiments and analysis of the algorithm, compared the previous algorithms, show that the proposed algorithm in the paper greatly improves the inquiries relation of sorted blogpages. At the same time, the test results also show that the proposed algorithm in this paper has a higher sensitivity in the current hot spots of society.
Keywords/Search Tags:link analysis, blog ranking, tag, trackback, indirect link
PDF Full Text Request
Related items