Font Size: a A A

Research On Techniques Of Domain-Specific Intelligent Searching

Posted on:2008-01-13Degree:MasterType:Thesis
Country:ChinaCandidate:L SunFull Text:PDF
GTID:2178360215958224Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
The information in the Internet is increasing everyday, thus the Internet has become a largest information repository. But the information in the Internet is numerous and complicated, thus how to acquire the information in time and with high accuracy and completeness becomes an urgent task for people. Searching engines can resolve this problem in a certain extent, and can help users find their needed information conveniently. However, domain-specific websites and pages increase faster, which are professional, precise and in-depth, and then general searching engines can not satisfy the information need of special users. According to this case, domain-specific intelligent search technology has become an active focus in information processing recent years.In this thesis, domain-specific topic search engines are first compared with the general search engines in architecture, principle, key techniques, and the research status and development direction of topic search technology is also analyzed.And then, based on the research of the ontology technology's application in websites classification, an ontology-based sites information model is presented, including two key aspects: the website topic ontology structure and the components of the website information model, and the significance of the model applied in the topic search is also illustrated.In the following, the searching heuristic strategy of topic crawler is researched, several typical search algorithms are analyzed comparatively; Based on this, using an improved HITS algorithm, a multi-topic collaborative crawler is designed and the implement of software programming interfaces is introduced briefly.Finally, according to the above research, an intelligent search engine prototype system is designed, in which the ontology-based sites information model and the multi-topic collaborative crawler are applied. Then several typical experiments are done to testify the precision ratio, the recall ratio, and the satisfaction ratio of the prototype system, and these evaluation indicators have reached a higher level.
Keywords/Search Tags:intelligent search, domain, ontology, topic crawler
PDF Full Text Request
Related items