Font Size: a A A

Study And Implementation Of Internet Tourism Resource Monitoring System

Posted on:2013-05-21Degree:MasterType:Thesis
Country:ChinaCandidate:M DuFull Text:PDF
GTID:2248330371967061Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
Nowadys, there are a wide variety of tourism resources on the Internet and tourism management adminstrations need to monitor these resources. At the same time, visitors also need to retrieve their own personalized and interested information, and it will take a lot of time and effort. In this thesis, the Internet tourism resource monitoring system has been researched and implemented. The main work is as follows:(1) Proposed and constructed a topic-specific crawling algorithm, and established starting URLs, topic keywords and URL prediction mechanism. The algorithm consists of three phases:initial crawling phase, learning phase and consecutive crawling phase. We used open directory project (ODP) to compute similarity and evaluate the results. Experimental results showed that the algorithm can collect relevant web pages more and more with the continuous execution of crawling process.(2) Proposeed an algorithm named TTR, based on text density, to extract content from Web pages. The algorithm first computes the pages’ text-to-tag ratio line by line and then separates content and non-content by clustering. Experimental results indicated that TTR can successfully extract content from diverse web pages.(3) Proposed a modified aRocchio algorithm to compute personal characteristics matrix and compared it with original Rocchio algorithm. We proposed mixed feature matching algorithm and devised a modified I-PageRank algorithm to sort the results. Experimental results showed that the proposed algorithms can greatly improve the proformance of system’s retrieve results.(4) Implemented an Internet tourism resource monitoring system. The system consists of topics cwaling, text extraction and personalized retrieval subsystem. We described the implementation of each subsystem in detail and tested each subsystem.In this thesis, we made research on the issue of monitoring of Internet tourism resource and implemented an Internet tourism resource monitoring system. The system can provide users with customized, comprehensive, real time travel resources collection, extraction and retrieval service, so as to bring great convenience to monitor tourism resource and to users’traveling.
Keywords/Search Tags:topic collection, text extraction, personalized retrieving, tourisim resource
PDF Full Text Request
Related items