Font Size: a A A

The Name Of The Automatic Identification Of Chinese Institutions

Posted on:2004-12-05Degree:MasterType:Thesis
Country:ChinaCandidate:Y L ZhangFull Text:PDF
GTID:2208360092980727Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
The automatic and accurate identification of Chinese organization names is very significant to improve the accuracy of automatic word segmentation, and it will establish a good foundation for natural language comprehension, machine translation, information extraction and information retrieval.This paper presents a model to identify Chinese organiztion names based on statistics and rules, in which we use the conception of the reliability for the word segment and some appropriate rules to identify Chinese organization names. The statistics are used to establish the dictionaries of organization special words and organization foreside words etc, while those effective rules are given to confirm the left edge of organization names to improve the precision of identification.The preliminary experiment shows that the precision and recall rate respectively reach 94.17% and 91.50% by close test, while the precision and recall rate are 92.40% and 86.48% by open test.
Keywords/Search Tags:Natural Language Processing (NLP), Proper Noun, Chinese Organization Names, Uni-gram, Bi-gram
PDF Full Text Request
Related items