Font Size: a A A

Spider Crawling On Mobile Search Research And Implementation Strategy

Posted on:2011-07-13Degree:MasterType:Thesis
Country:ChinaCandidate:P QiFull Text:PDF
GTID:2178360308459330Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Mobile Search is the user whose are in the mobile communication network through the mobile terminal equipment, by using SMS, WAP, IVR and other search methods to obtain the required specific information search behavior. Mobile search technology is the core of the search engine and mobile devices. Both of them combine to produce consistent demand for mobile products and user search results. So as to by the fixed equipment and fixed communication network to the user's constraints to achieve anytime, anywhere access to information they need. Mobile search technology as a combination of communications technology combines both of features and technologies. Mobile search is the search engine in the mobile terminal extension. The mainstream search engines are an important trend future development. This web spider pass to adding JavaScript parser, asynchronous Ajax request interception return by the data and analysis to get more page content. Although faced with many problems at present, the rise of mobile search is the general trend. With 3G and 4G era of commercial, mobile search will gradually progress towards a new era of rapid development.This search engine from mobile the current development situation and existing types, this paper expounds the search engine climbed from page to the basic principle of information, network structure and the robot in WAP pages crawled took the role played by process, and analyzed the algorithm based on web climb. Through the analysis of the structure of the Web, and links to dig the species, according to research the mobile terminal equipment in practical application design a kind of high quality and high quality Web page to climb up the network robots take target and strategy, only take up most of the first layer value from the page.1. The main research contents and innovations include the following aspects: Based on the general network robot mechanism analysis and based on the algorithm of reptile open up take strategy analysis, this paper designs the system used in web crawler structure and summarizes the search engine crawler basic properties.2. Based on the study of search engine WAP system structure and WAP crawler basic principle of mobile communication platform, design a search pages effectively treats climbing the method. By taking strategy, including the climb up take strategy is not repeated; web pages take priority strategy, up to speed up access strategies and tactics and take Robot agreement, etc.3. Based on previous algorithm for PageRank based on the actual needs of mobile search results only choose the first 20 information back to the user.4. This chapter 6 is presented based on the crawler system of mobile reading software design project and realization method.5. Based on the last part of the operation of the system and the results of this research contents of realizing a simple analysis. Meanwhile, I move to search engines in 3G era and the subsequent application prospects of 4G made a prospect.
Keywords/Search Tags:mobile search engine, information retrieval, reptiles tree link analysis, focused crawler
PDF Full Text Request
Related items