Font Size: a A A

Research And Realization Of Search Enginee On Topic-Specific Based On Ontology

Posted on:2010-06-09Degree:MasterType:Thesis
Country:ChinaCandidate:Z L JiangFull Text:PDF
GTID:2178360275951475Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Internet has been become the largest information sources base,but the sources are not ordered,But users gain information rapidly,inerrably,across-the-board is becoming a big problem.The invention of Search Engineer brings much convenience to search useful information for users.Due to the disadvantages of universal Search Engineers of lowed covering,untimely and imprecise results, they can not satisfy with the needs of engineers in special fields.Under the circumstances,search engineer on Topic-Specific came forth.But current many search engineers on Topic-Specific are based on syntax, they only can match web pages mechanically,and are lowed efficiency.The paper—Research and Realization of Search Engineer on Topic-Specific Based on Ontology which makes use of Ontology,analysis and deals with query of users.It makes search engineer can work intelligently,and enhances the efficiency.In gathering information,the paper gives the method of net Spider on Topic-Specific;in processing html documents,it carries through Chinese split words with MIK_CAnalyzer;it makes use of Ontology to execute search among concepts for overcoming the accuracy of results,and filtrates and order by PageRank technology and related concepts.The paper does the following work:(1) It expatiates concepts and current circs,analysis the basic principle, structure,and introduces the advances of search engineer on topic-specific.(2) It designs a net spider on topic-specific.It gives the strategy and model of net spider program design,and realizes the visit to related web page sources of internet,and save the web pages into data base.(3) It uses IF-IDF theory to index web pages.In processing index;first,it makes prime analysis to web pages in web data base;second,filtrates invalid contents,and deals with body contents by MIK_CAnalyzer,last it picks-up documents characters information and makes IF-IDF index files. (4) It designs ontology base on shoe.First,it fixes core concepts on shoe, develops ontology base by prot(?)g(?) with certain methods and rules.(5) It designs search component on ontology.After users input keywords,it carries out split words with MIK_CAnalyzer,gains prime keywords,and then match the prime keywords and concepts in ontology base.Finally,searches files in index data base with the standard keywords.(6) It realizes the system,tests some instances,and analysis results.The paper has the following traits on above work:(1) It combines with features of shoes,develop the ontology with prot(?)g(?), make some useful development.(2) It develops a specific spider program with related web pages.Aim at the current situation of search engineer on topic-specific in syntax,the system carry out a exploration in semantic,it can produce consulted meaning.
Keywords/Search Tags:ontology, search engineer on topic-specific, net spider, Chinese split words, search
PDF Full Text Request
Related items