Font Size: a A A

Research And Implementation Of Website Search Technology Based On Ajax/lucene

Posted on:2009-02-05Degree:MasterType:Thesis
Country:ChinaCandidate:S M DingFull Text:PDF
GTID:2198330332988671Subject:Computer technology
Abstract/Summary:PDF Full Text Request
A website search engine is the essential tool to discover the important information of the website; an efficient website search engine will help enhance the value of the websites. Although some network giants have started to take this road, but in the entire internet industry, restrained to the threshold of technology, the real website search engine technology has not been widely popular. So it has vital practical significance to study and develop website search engine.This thesis mainly studies some related technologies about the website search engine, includes Full-text Retrieval, Lucene, Ajax, Spider, Chinese Segment and so on. In this foundation, a website search engine has been realized. The test result indicated that this engine has the use value.The main work of this thesis:analyzes and designs a website search engine system, designs the overall architecture of the structure and sub-modules. Then, we study and improve several key problems of the search engine, which includes:Spider for the website,HTML parser, Chinese segment algorithm,improves Lucene's sorting algorithm; to better reflect the proportion of relations about different parts of the contents in the website, use frequency position weighted algorithm in the system.At last, using eclipse development platform combining several open sources API, a website search engine systems has achieved. then the system has been tested. The test result shows that the search engine system fully meets the requirements of small and medium-sized website.
Keywords/Search Tags:Full-text Retrieval, Lucene, Ajax, Spider, Chinese Segment
PDF Full Text Request
Related items