Font Size: a A A

The Research And Implementation Of Vertical Search Engine Based On The Educational Field

Posted on:2012-02-26Degree:MasterType:Thesis
Country:ChinaCandidate:P XueFull Text:PDF
GTID:2178330335476657Subject:Educational technology
Abstract/Summary:PDF Full Text Request
Along with the rapid development of Internet, the resources on the internet grow explosively, including in the educational field. The web offers us rich and various educational resources. However facing with huge and unorderly resources, people do not own an effective tool to query the information. At present, people search information on Internet through Google, Baidu and other general search engine primarily. Generally speaking, the general search engine can meet the demands of user. But when users just want to query the related information quickly and accurately from a specific field or in a certain theme, this kind of search engine will be a little insufficient. The emergence of vertical search engine meets those specific demands well. Vertical search engine, it builds for a particular field and a particular group or a particular need, it has gradually become the popular and important tool to get the professional network information.This paper takes resources of disciplines in educational field as the background; it has be completed initially a vertical search engine which can search more accurate results by using and expanding Heritrix,Lucene and MVC. This paper includes the following contents.First, Researches the Heritrix. web crawler, expand its functions to obtain the related source only about the education.Second, by researching Lucene and related technology deeply, the paper expands and applies Lucene to the system successfully in order to make Lucene provide better full text retrieval service.Third, the paper has completed the function that can extract and process the HTML information, build the educational dictionary, make the JE-analysis embed in the system, build a good index database.The most but not the last one, improving the ranking algorithm "classic algorithm-PageRank" The new ranking algorithm, Non average PageRank support to solve the PageRank's disadvantage, which focuses authority of web and relation between the webs on the Internet. Search results shows that the educational vertical search engine efficiency is improved better.
Keywords/Search Tags:educational resources, vertical search engine, web crawlers, Ranking algorithm
PDF Full Text Request
Related items