Font Size: a A A

Research On Financial Blog Crawling And Ranking Algorithm

Posted on:2010-07-30Degree:MasterType:Thesis
Country:ChinaCandidate:H ChenFull Text:PDF
GTID:2178360332457869Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
Blog is a media which combine personal and public nature together, it makes full use of the features of two-way interaction network, hypertext links, dynamic update and a wide range of covering. Its essence is not only to express personal ideas or record their own daily experience, but rather to select and link the Internet's most valuable information, knowledge and resources to provide shared resources for others from a personal point of view.With the rapid development of blog, it has brought a flood of blog resources. The question how to organize, retrieval, utilization the rich blog resources, and mining valuable information has caused widely public concern both in the scientific research sector and industrial sector, a variety of methods and techniques are being explored in the application. Currently, main google, baidu and other major search engine has increased the attention on the blog, but they still user the traditional sorting algorithm.This paper analysis the difference between Blog page and traditional Web page, design and implementation a blog reptiles system based on RSS structure. Contrast BlogRank, B2Rank and EigenRumor algorithms and do an in-depth study on the possible effects of the factors that ultimately sort results. And finally assess a finance blog search results sorting algorithm does not query-based.Design and implement the Haitianyuan blog system for providing a demonstrate platform for the financial blog posts clawing from the Internet. Integration it with the Haitianyuan system and provide common weblog service. Evaluate the result of the algorithm using heat of article, the result shows that it is feasible to find a non query-based blog sorting algorithm in particular area, 68% accuracy rate also verified the validity of the algorithm.
Keywords/Search Tags:finance, blog search, information retrieval, link analysis, page rank
PDF Full Text Request
Related items