Font Size: a A A

Studies On Automatic Recognition Of Common Chinese Adverb's Usages Based On Statistics

Posted on:2011-01-04Degree:MasterType:Thesis
Country:ChinaCandidate:J H ZhangFull Text:PDF
GTID:2178330332458785Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
Researching on Automatic Recognizing usages of Modern Chinese Adverbs is one of the important contents of the NLP-oriented Chinese Adverbs Knowledge Base. To solve the problems of the existing rule-based method of adverbs' usages recognition, this paper bases on the previous work, and further study automatically recognizing Chinese adverbs'usages using statistical methods. Three statistical models, viz. CRF, ME, and SVM, are used to label several common Chinese adverbs' usages on the tagged corpus of People's Daily(1998.1) The experiments show that statistical-based method is effective in automatically recognizing of adverbs'usages and has good application prospects.According to the thought building the "Trinity" knowledge-base of functional words, this paper focuses on the important part of the adverb knowledge base—automatically recognizing usages of adverbs, and uses statistical-based method to realize automatically recognizing usages of adverbs.This article mainly includes:(1) According to Chinese Adverb Knowledge Base, we use the example data in the base as our corpus to examine the adverbs'rules, and analyze the problems of rules, and complete the adverb knowledge base.(2) We use the rule-based method to recognize adverbs'usages in our corpus. Then, we manually check the tagging results several times. Finally, formed the standard corpus and use it as the experiment corpus. At the same time, we further perfect the information dictionary and the rule base of adverbs'usages.(3) According to the shortcomings of the rule-based method, we realize automatically recognizing usages of adverbs, and further improve the recognition precision rate.In the end, this paper summarizes the research work, and the next research forecasted, and points out that the feasibility of combing the rule-based method and the statistical-based method on automatically recognizing adverbs'usages.
Keywords/Search Tags:Natural Language Process, Usage Recognition of Adverb, Conditional Random Fields, Maximum Entropy, Support Vector Machine
PDF Full Text Request
Related items