Font Size: a A A

Research Of FTP Search Engine Indexing Technology Based On Kademlia

Posted on:2014-05-06Degree:MasterType:Thesis
Country:ChinaCandidate:X M ShiFull Text:PDF
GTID:2268330401982922Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
In recent years,because of Internet resources gradually diversified and distributed storage, distributed FTP search engine based on P2P technology has become a research hotspot of FTP resource retrieval,in which indexing technology has been the key to improve the FTP retrieval efficiency. According to the characteristics of object FTP resources retrieval and the defects of Kademlia model,In this paper, it put forward a distributed double-letters inverted indexing algorithm(a Distributed Double-letters Inverted Indexing Algorithm Based on a Containing Geographical Location Information Kademlia Model, referred to as DGKAD)which based on a containing geographical location information Kademlia modelin a peer-to-peer network. In order to improve the efficiency of resource retrieval, in DGKAD indexing algorithm, nodeID information include the physical location information of the node, thereby improving the the Kademlia layered network (overlay network) does not match the problem of the logical structure and physical structure, improving the efficiency of network communication and given that the search object is a file name of which character length is shorter, compared with DHT inverted indexing algorithm based on a standard Kademlia model(a DHT Inverted Indexing Algorithm Based on a Standard Kademlia Model Based abbreviated as DSKAD), using DGKAD indexing algorithm can avoid segmentation and improves the recall and accuracy of search results.Finally, through an experiment of simulation for DGKAD indexing algorithm, required logical path of hops, retrieval recall and precision rates in all aspects of FTP retrieval resources, results show that, DGKAD has many advantages of network bandwidth less consumption, resource positioning speed rapdly, higher recall and precision.
Keywords/Search Tags:P2P, FTPSearch Engine, Kademlia, Double-letter Inverted Indexing, DGKAD, DSKAD
PDF Full Text Request
Related items