Font Size: a A A

Study And Implementation Of Chinese Information Extraction From Chinese URL

Posted on:2010-10-17Degree:MasterType:Thesis
Country:ChinaCandidate:X ChenFull Text:PDF
GTID:2178360278966363Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
As the Internet becomes more and more popular in the world, the information amount in the Internet is exploding, which has exceeded the capability of people to search by hand. To solve this problem, search engine comes out, which is able to give user answers that is most close to their search keywords.Sorting is one of the most important steps when search engine gives users the searching result. Meanwhile, sorting depends on how close the webpage is to the keywords. As a part of the webpage, URL contains some useful information about the webpage. If we can get such information, it would be a great help for better sorting the searching result.This paper analysis the construction rules of URL, especially Chinese URL. Based on the analysis, we proposed and implemented a high performance algorithm to extract Chinese information from Chinese URL.
Keywords/Search Tags:Search engine, Sorting, Chinese URL, Information Extraction
PDF Full Text Request
Related items