Font Size: a A A

Research And Implementation Of Nutch-Based Search Engine For Agricultural Information

Posted on:2014-05-24Degree:MasterType:Thesis
Country:ChinaCandidate:Y ZhangFull Text:PDF
GTID:2268330401963274Subject:Software engineering
Abstract/Summary:PDF Full Text Request
Along with Google’s great success in the world, the whole world turn their eyes to the field of the search engine. Over a night, a variety of search services come. From Google, Yahoo, AltaVista, to Baidu, Sogou, Soso and so on, there are more and more search engine brands and services. In addition, the enterprise applications market, the demand for full-text information retrieval has been increased in a variety of document processing, content management software need to add full-text search function. At the same time, specifically for the needs of specialized vertical search engines in various industries are straight up.In this context, the search engine technology are developing rapidly. The various discussions searched articles,magazines and papers. All of a sudden, search technology become one of the hottest technologies.From its emergence, it has been a high threshold technology.It includes many advanced ideas and design of the academic field, the involved disciplines, including natural language processing, artificial intelligence, discrete mathematics, permutations and combinations, compiler theory. Designing a good performance, and very practical search engine is not easy.Our country is a large agricultural country. there are about800million farmers, agriculture is the backbone of our business, but the construction of agricultural information is very outdated. In order to promote agricultural development,we need bring together the integration of agricultural information resources for agricultural practitioners to provide professional, accurate agricultural information. Therefore,it is important to develop a search engine for agricultural information.This paper tells about the basic concepts of search engines, the development of history, the present situation and all aspects of developing a complete set of search engines, such as documents crawling, document analysis, document indexing, document sorting. A combination of the Apache open source project Nutch help to develop a search engine for agricultural information.
Keywords/Search Tags:Search Engine, Full-text Search, Vertical Search, Lucene, Nutch
PDF Full Text Request
Related items