Font Size: a A A

Research And Implementation Of Subject-oriented Vertical Search Engine On Basic Educational Resources

Posted on:2010-04-26Degree:MasterType:Thesis
Country:ChinaCandidate:X L DiFull Text:PDF
GTID:2178360275489314Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
With the rapid development of Internet, the electronic information on Internet is also increasing rapidly. Internet has brought great convenience for people. But when people are enjoying the convenience, they have gradually found an issue which is needed to solve eagerly. The issue is how to find some resources which are needed and related to some specific field in the huge and a variety of information. When we use the general search engine to search some information related to one specific field , the results that have been searched usually cover all fields, existing much duplicate and useless information, so the general search engine can't provide very accurate retrieval service well for users. But the vertical search engine is different from the general search engine on searching some information related to one specific field. The vertical search engine is a professional search engine which focuses on a certain business, a specific group of people or some specific need. It is the subdivision and extension of search engine. It can provide more accurate and higher quality information for users.This paper takes resources of disciplines in the basis education field as the background , it has compeleted initially a vertical search engine which can search more accurate results by using and expanding lucene and heritrix. This paper includes the following contents.First,the paper uses and expands heritrix which is an open source web spider developed by Java to crawl information about disciplines in basis education field from Internet. Second, by using some kinds of api tools,the paper has completed the function that can extract and process the information which is crawled.Third, by researching lucene and related technology deeply, the paper expands and applies lucene to the system successfully which is designed in the paper, and makes lucene provide better full text retrieval service. Forth, the paper subjoins the function of collecting some query and the function of feedback to improve the interaction of the system.
Keywords/Search Tags:Vertical Search, Heritrix, Lucene, Information Extraction, Results Sort
PDF Full Text Request
Related items