Font Size: a A A

Research Of Semantic Search Engine Based On Nutch

Posted on:2015-01-04Degree:MasterType:Thesis
Country:ChinaCandidate:Y M HuangFull Text:PDF
GTID:2298330467450425Subject:Systems analysis and integration
Abstract/Summary:PDF Full Text Request
In recent years, the Internet technology has developed rapidly and the Internet information resources are getting richer and richer. Faced with vast network of information, how to extract the potential and valuable information rapidly and accurately is more and more important. Search engine is one of the main tools for getting Internet information. In the process of responding to users’requests, it just does some simple matches with keywords appeared in a document and the number of occurrences feed back to the users which is difficult to pinpoint the correlation of key words and search results. What’s more, many of the results are not the users really want, so it has not improved the low precision of research engine. In this study, we mentioned the concept of semantic search, through building Semantic repository and list, making the computer understand and identify human language. The technology of semantic research was used to serve the user retrieval better and the artificial intelligence helps users to realize the personalized search. Of course, It avoid occurring the zone area of information retrieval in a high limit.The specific research contents and results are as follows:firstly, there was a detailed introduction about Semantic level, semantic search and the search engine’s architecture. Secondly, we put forward our own opinions towards the current search engines which could not accurately "understand" keyword semantic information, just depending on the simple key word matching to search information. And then being on the basis of previous studies, semantic repository about semantic search was built in our study. Thirdly, In this paper, a small semantic search system was designed to imitate the concrete process of semantic search by building a relational database to simulate the semantic repository and realize information research by using the open source tool Nutch. The thought of this system design was that users search something by using key words. First, it was the semantic match in the semantic repository, which made the computer understand the results of human language indirectly, analyzing and getting the results. Then it output retrieve information according to the correlation. Considering the integrity and accuracy of semantic repository information, In this system, it detected key words which was inputted by users and click test results.Then, it maintained and updated semantic repository information at regular time.By analyzing the semantic database systems design and related results, we found that semantic database system had a higher precision than the traditional search engine. It would have a high research value in the semantic information search field.
Keywords/Search Tags:Semantic web, Semantic search, nutch, relevance
PDF Full Text Request
Related items