Font Size: a A A

The Design And Implementation Of Cross-language Navigational Search Engine

Posted on:2011-03-30Degree:MasterType:Thesis
Country:ChinaCandidate:P Y ZhuFull Text:PDF
GTID:2198330332988023Subject:Computer technology
Abstract/Summary:PDF Full Text Request
In recent years, with the rapid development of information technology, the internet is becoming one of the most important channels through which people release and retrieve information. While the users of the internet are global wide, the content in the internet are in multi-language, so the research of CLIR in the internet is showing more and more significance.Statistics show that the largest language in the internet by users is English, followed by Chinese. The thesis is inspired by the requirement that a Chinese-speaking user looks for a foreign website (especially English site) typing Chinese keywords in the search box.This thesis designs a special purpose IR system called CLNSE (Cross Language Navigational Search Engine) that includes two sub-system.The first sub-system is a meta search engine using the query translation technique in CLIR and a website URL finding algorithm.The second sub-system is a full-text database search system, which firstly employs a spider to crawl website directories and dig navigational websites information, then uses Google Translation to document translate the digged information and store it in Mysql database,finally adopts Ferret to full-text search the information.The thesis firstly introduces some related theories and key techniques such as IR,CLIR, meta search engine, spider and full text search,then describes the design and implement of CLNSE.This thesis evaluates the system and the primitive evaluation shows the design is feasible and CLNSE is superior to several target systems in cross language navigational search area.
Keywords/Search Tags:IR, CLIR, search engine, meta search engine, full-text database search
PDF Full Text Request
Related items