Font Size: a A A

Research Based On Chinese Word Segmentation Technology In The Duplicate-inquiry System Of Corporate Name

Posted on:2012-12-13Degree:MasterType:Thesis
Country:ChinaCandidate:K ChenFull Text:PDF
GTID:2248330395455243Subject:Computer technology
Abstract/Summary:PDF Full Text Request
An enterprise name, when being applied for registration by an enterprise, shallbe examined and approved by the competent registration authorities. An enterprisename can be used only after it has been approved and registered, and the exclusiveright to its use shall be enjoyed within a prescribed scope. With the rapid developmentof Guizhou, the amount of enterprise is growing constanly.Facing so many informationabout corpoerate name, making full use of the computer systemwhich powerful andquery search function. And it also has important theoretical and practical significancein analysising and repeating inquiries or not.This paper focused on the characteristicsof enterprise name, and implement a new duplicate checking module of the corporatename, the key point of reserching in this paper is as follows:1.The using of Chinese Word Segmentation technology in the duplicate-inquirysystem of corporate name, Chinese word segmentation technology means a processwhich using the corresponding word segmentation algorithm to separate the text andeasily to deal with and understand the information by computer. Its range ofapplications is wide,mainly used in information retrieval,informationextraction,machine translation,natural language processing technology and so on. Inthis paper,it used a typical Chinese word segmentation algorithm based ondictionary-the Largest Forward Matching Algorithm.The idea of it is simple and easyto implement,but the result of the segmentation accuracy and the segmentation speedseems to be not ideal. For this problem, we add the keyword segmentation, at the sametime enhance the speed and the accuracy of the segementation.2.Aimed on the statutory basis of enterprise name,the processing module ofhomophone and polyphone will be introduce in this text, and proposed the conversionthat the shop name turns into Pinyin in corporate name.Besides using the exhaustivemethod to enumerate all pronunciaition sequence, and using it again to check thecorporate name duplicating or not, then completing the processing of corporante nameduplicate-inquiry.
Keywords/Search Tags:enterprise name, Chinese segmentation, segmentation algorithm
PDF Full Text Request
Related items