Font Size: a A A

The Application Of Suffix Array In Uyghur, Kazak, Kyrgyz Search Engine

Posted on:2013-02-27Degree:MasterType:Thesis
Country:ChinaCandidate:L H ZhaoFull Text:PDF
GTID:2248330374966819Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
With the rapid development of Internet and the WWW,the way of getting information through the manual way before, has developed to get information through computers,and even today getting information through the network.People want to search the required information quickly and accurately in the vast multitude of network information resource,is becoming more and more difficult,and also becomes more and more important,so search engine comes into being.The main purpose of search engine is to return the results of users’querying rapidly and accurately,and according to the relevance of the results put the larger relevance in front. So far,the total number of pages of Uyghur, Kazak, Kyrgyz is about50000or so,relative to other big languages for web page,is a very small number.The technology of the index of Uyghur, Kazak, Kyrgyz search engine used is inverted index which is popular at present,through access to information,analyzing the use of inverted index,there is also a certain limit.This paper by introducing the three technologies of index and analyzing of the advantages and disadvantages,on the basis of the Uyghur, Kazak, Kyrgyz multilingual search engine’s characteristics proposed to improve the construction of the index method,which is a new index technology at present--suffix arrays,and try to construct index with this method,and give full play to suffix arrays’advantages--phrase inquires,to return the results of users’querying rapidly and accurately.On account of there are several kinds of construction methods of suffix arrays,different methods have different characteristics.This paper carried out a contrast test between quick sort algorithm and DC3algorithm,which have better evaluation in the commonly used construction of suffix array,the experiment show the suitable algorithm of suffix array’s construction of the Uyghur, Kazak, Kyrgyz languages’characteristics,and through the desktop search experiment show the advantages of suffix array in the phrase inquires.Experiments proved that the construction of suffix arrays in Uyghur, Kazak, Kyrgyz languages can give full play to suffix array’s advantages--Phrase inquires, and improve the precision ratio.
Keywords/Search Tags:Search Engine, The Index Technology, Suffix Arrays, Desktop Search
PDF Full Text Request
Related items