Font Size: a A A

Design And Implimentation Of A Blog Retrieval System Combined With Community Structure

Posted on:2013-06-11Degree:MasterType:Thesis
Country:ChinaCandidate:B H ZhangFull Text:PDF
GTID:2268330392469325Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
In recent years, with the rapid development of web2.0era, blog is alsoexplosive growing as an important Internet service. However, it also brings theproblem of how to find the useful information from the massive amount ofresources lying both in blog users and blog papers. To solve this problem, blogretrieval system which can provide the retrieval service in blogosphere is created.Through analysis, we find the problem of the main blog retrieval systems is thatthey are still not jumping out of the web retrieval thinking, in which both thedemands of the users and some features of the blog resources are not taken intoaccount.With these deficiencies in the current blog search service, this paper combinedwith community structural characteristics to try a count of new exploration. themain content can be summed up in the following areas:Firstly, Found on the analysis of existing methods of community detect, weproposes a method to find the potential community in the blogosphere based onsome unique structural characters in blog in this paper. By extracting and filteringthe blog label information, we get some directing information to find the topicswhich attract the blog author. Using these directing information and clusteringmethods, we completed the task of finding community. Our experiment result showsthat this method can efficiently improve the accuracy of finding community in blogarea.Secondly, based on the analysis and mining of latent community characters inthe blogosphere, this paper proposes a blog author ranking method which combinesthe demands of blog retrieval system users with the structure of blogospherecommunity. This method takes into account the query relativity, the author topictendency and the query topic tendency. This method is modeled and calculated withthe result of latent community division. This paper’s method for blog authorsranking has distinct difference with the classical page ranking method as the mainfeature of this paper.Finally, this paper finished design and completion of a blog retrieval systembased on the above work. In the last chat, we gave out the organizational relationships and the implementation process of each module in the systemarchitecture, and for each sub-module we show the detailed design and dataorganization and storage; at the end of this paper, we show some illustrations for theactual effect of this blog retrieval system.
Keywords/Search Tags:blog search, label, latent community, author ranking
PDF Full Text Request
Related items