Font Size: a A A

BT Forum Oriented Search Engine And Mobile Application Technology

Posted on:2021-01-25Degree:MasterType:Thesis
Country:ChinaCandidate:C SunFull Text:PDF
GTID:2428330623967783Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
With the rapid development of internet-related technologies and industries in recent years,the growth trend of information and data on the Internet is increasing exponentially.Surrounded by such a large amount of data,the traditional general search engine can no longer meet the users' needs,and it becomes more and more difficult for users to find the information they need in these contents,which promotes the further development of vertical search engines.Compared with general search engines,vertical search engines only focus on the data and information in a specific field,providing more accurate services for users who need information in this field,enhancing the user experience of these users.Under this background,many vertical search engines appear in people's vision.In addition,the BitTorrent protocol has been popular with a large number of users who want to download large files from the Internet,especially multimedia files.The only requirement for performing download tasks are the client software supporting the BT(BitTorrent)protocol and the corresponding torrent files.However,there is a lack of vertical search engines for bittorrent files on the Internet now while more magnetic search engines for DHT(Distributed Hash Table)networks are provided.Based on this,this thesis designs and implements an internet-oriented bittorrent vertical search engine system.In terms of search engines,this thesis uses the Nutch and Solr framework built a distributed vertical search engine.In this structure,Nutch is responsible for crawling web pages,parsing web pages and extracting key information.This thesis proposes and implements a filtering scheme for web pages containing torrent files download links in the Internet,and only analyzes and saves the filtered web page data,so as to make Nutch crawler a BT-oriented web page theme crawler.In addition,Solr is mainly responsible for the establishment of the index in the search engine and as the index server to respond to the user's query request.In combination with the usage scenario of this system,this thesis improves the default sorting method of Solr,proposes a calculation method of seed download heat based on half-life,and sorts the search results accordingly.In terms of the client,this thesis designs and implements an Android app as the client of the search engine.In this app,the query interface of search engine is encapsulated.Through this interface,users can access Solr server,then obtain search results in the app and download torrent files.In addition,for some BT forums on the Internet,this thesis proposes a set of solutions for users to quickly search and download the torrent files in these forums.This scheme is mainly based on the interaction mechanism between Android WebView and JavaScript,so that users can quickly and easily perform forum operations in the app,and then download the files.Finally,the function and performance of the system are tested through experiments,and the test results are analyzed.
Keywords/Search Tags:vertical search engine, Nutch, Solr, theme crawler, BitTorrent
PDF Full Text Request
Related items