Font Size: a A A

Research And Implementation Of Digital Book Search System Based On User Click-through Data

Posted on:2009-10-07Degree:MasterType:Thesis
Country:ChinaCandidate:C YuanFull Text:PDF
GTID:2178360242983112Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Many developed and developing countries over the world have put large efforts on the development of digital library since the mid 1990's. Digital library has become an important means for people to access desired knowledge and information. Digital book search is the sustainable service that digital library should provide. This paper exclusively focuses on the development of digital book search and explores in depth the problem of search results ranking, so that the visitors of digital library can quickly find books satisfying their needs in the massive book resources.The traditional digital book search is based on the matching techniques of relational database. It can only find out the relevant book entries which contain the keywords the reader entered. Moreover, it lacks effective book ranking mechanism to sort results of relevant books, and ignores the popularity and quality of these books.The main work of this paper is summarized as follows: 1. Extract behavior information of user clicking on books out of the access logs, construct Correlation Graph of books read by users, and use random walk algorithm to rank the books by relevance. 2. Extract query words and book reading records out of the access logs, and utilize the clustering effect of random walks to cluster query words. 3. Crawl book score data from well-known online bookstores on the Internet, which act as another important measure for book ranking. 4. Propose an approach to integrating multiple book ranking infonnation for each class of query. The final ranking list of results of book search is gained by fusing text similarity, book score data from online bookstores and book ranking from Correlation Graph. 5. We have developed a digital book search system and deployed it in the CADAL portal using the above algorithms and techniques. Users have reported that the new book search system provides the more reasonable ranking of search results compared with the original book search module.
Keywords/Search Tags:Digital Library, Correlation Graph, BookRank, Query Clustering, Ensemble of Multiple Information Sources
PDF Full Text Request
Related items