Font Size: a A A

Research And Implementation Of Mobile Web Search System Based On Nutch

Posted on:2014-02-21Degree:MasterType:Thesis
Country:ChinaCandidate:M J GaoFull Text:PDF
GTID:2248330398970792Subject:Electronic Science and Technology
Abstract/Summary:PDF Full Text Request
With the popularity of the3G technology, mobile phones, portable computers and other mobile devices are becoming more common. More and more users can able to access the internet via mobile terminals conveniently. Thus, users have a more clear demand to get an intelligent and personalized search engine. The existing mobile search engines are mostly directly transferred from the local search engine. These search engines can only be used to search text relevant result, since they just regard the position information input by user as a normal text keyword. They can’t combine themselves with user’s location and other mobile information.However, mobile usersalways need to search some location related results. When they search a query, they hope they can get both text-related and location-closed web pagesfrom the search engine. Therefore, the existing mobile search engine can hardly provide ideal search results for mobile users.This paper is aiming to resolve this problem for mobile users and mainly research the resolution to get both text-related and location-closed web pages. This paper proposes a space search method to get the location-closed web pages by geotagging all webpages according to web pages’ description location in advance. Eventually, this paper implements a mobile WEB search system based on the existing open search engine-Nutch. This paper proposes a hybrid index structure based on Lucene and R-tree, as well as a "Node Priority Traversal Algorithm"which is corresponding to the hybrid structure. The mobile WEB search system uses this hybrid index to index both location and text content of web pages, and then uses the "Node Priority Traversal Algorithm" to give out located and text-related results to mobile users.This paper firstly describes the overall framework and structural design of the mobile WEB search system. Then the paper introduces the implementation details about each module, including geo-tagging in the web page preprocessing module, cluster enhancing hybrid index in the indexing module, and "Node Priority Traversal Algorithm" in the searching module. After that, this paper evaluates the function and performance of the mobile WEB search system. Finally, this paper proves the system can provide both text-related and location-closed web pages for mobile users and have a good performance.
Keywords/Search Tags:mobile WEB, search engine, Nutch, Lucene, R-tree, hybrid index
PDF Full Text Request
Related items