Font Size: a A A

Design And Implementation Of Individualized RSS News Retrieval System

Posted on:2008-12-05Degree:MasterType:Thesis
Country:ChinaCandidate:Y BiFull Text:PDF
GTID:2178360245496835Subject:Software engineering
Abstract/Summary:PDF Full Text Request
Internet has leaded us to an era with large amount and high speed information, challenging the traditional way to access Internet, that is, whether can find and receive the interested information effectively and efficiently. The emergence of RSS (Rich Site Summary or Really Simple Syndication or RDF Site Summary) reader machine can solve the problem to some extent. By using RSS reader machine, there is no need to access the websites to find information, which gets rid of the frequent accessing to many websites everyday. Moreover, RSS reader machine can update the contents of these websites at a certain interim, which provides a solution to the update problem. However, there are too many duplicated information in RSS reader machine due to the duplication contents of these websites, which wastes a lot of time of users.This article improves the function of RSS reader machine. Realize individualized RSS news retrieval. Three functions are added: filter function of the news with the same contents, that is, when the article with the same topic and similar content appears in more than one website, RSS reader machine only displays the news of the website with the highest priority; individualized subscription function, that is, the news can be subscribed according to the interests of users; series news link function, that is, it can link to the related news distributed before.Firstly, this article analyzes the XML (Extensible Markup Language) file with Digester module, and tag the title of news analyzed by using the open module of Chinese word segmentation and speech tagging system. Then, these news are differentiated according to date. All of the searched news keywords are compared, categorized and stored according to the proposed criterion in this paper. Finally, the news with the highest priority is displayed and the other news with the same content are filtered; comparing the subscription keywords and / or forbidden keywords and news key words to realize the news subscription function; realizing the series news link function by the comparison of the keyword of different date classes under the criterion. According to the operation, statistics, comparison and analysis of system, and the introduction of the value of P and R and the evaluation parameter, a good result is achieved.
Keywords/Search Tags:news retrieval, RSS, speech tagging
PDF Full Text Request
Related items