Font Size: a A A

Automatic Segmentation System Based On Ontology And Fuzzy Math

Posted on:2009-08-13Degree:MasterType:Thesis
Country:ChinaCandidate:H WuFull Text:PDF
GTID:2208360242991110Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Chinese automatic word segmentation is the fundamental task of the Chinese Information Processing. It becomes one of the bottlenecks in Chinese Information Processing. The Chinese word is-ambiguity is the key factor of precision of Chinese word segmentation. Many researchers have studied in this field. But come to the present condition, it can't satisfy the demand of application.This article has introduced the present situation and basic methods of Chinese automatic word segmentation, and has analyzed the difficulties of it. Deep study and research in HowNet semantics network as well as fuzzy mathematics have been done, the combined points between Chinese automatic word segmentation and them are found, and word segmentation plan based on HowNet and fuzzy mathematics is proposed: The input Chinese sentences are scanned twice. Firstly the words without ambiguity and words sequences with ambiguity are found out, secondly the right segmentation for ambiguity word sequences are found, so are the segmentation results for the entire sentences.HowNet semantics are used as knowledge source to build HowNet sememes network and words network, which record sememes and words so as the relationships of them. HowNet knowledge dictionary is the basic dictionary; making fuzzy mathematics rules for ambiguity. After the first segmentation based on"building word", HowNet words network and fuzzy rules are used to do the second segmentation.As the dis-ambiguity aspect, we presented the goal of designing a word segmentation system based on HowNet semantics networks and fuzzy reasoning mechanism, discussed the principle and schemes combined with"building words"thoughts, and developed a prototype system.
Keywords/Search Tags:Chinese automatic word segmentation, ambiguity, HowNet, fuzzy mathematics, build word II
PDF Full Text Request
Related items