Font Size: a A A

Research Of Chinese Chunks On The Basis Of Field Dictionary

Posted on:2012-06-21Degree:MasterType:Thesis
Country:ChinaCandidate:K H ShiFull Text:PDF
GTID:2218330338463485Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
Syntactic parsing is very difficult because there are a lot of natural language ambiguity. With the Method to Syntactic parsing, the task is divided into three sub-task:1) Chunk Identification,2) The analysis of Chunk,3) The relationship between Chunks.The purpose of this paper is to carry out the analysis of Chunk. The structure of Chunk is simple, mostly in the form of a simple phrase, and can be transformed into the relationship between words. In order to analyze the internal structure of chunks, this paper constructs a Field Dictionary. The Field Dictionary is based on the relationship between words. Making the upper and lower Relationship as the main relationship, The Field Dictionary is builded into a tree topology. Each word in the Field Dictionary corresponds to one of the nodes in the tree, and each node represents a word. Each word contains the code, part of speech and other properties. These properties are stored with XML.The aim to build the Field Dictionary is to carry out the analysis of Chunk. To this end, this paper carefully studied the internal structure of chunk and found out the relationship between internal structure and part of speech, thus forming a Chunk type decision strategies. Based on the Field Dictionary and decision strategies, it can rapidly determine the structure of the chunk. In the final stage of this article, the paper verifies the feasibility of the idea through running the program.
Keywords/Search Tags:shallow parsing, chunk, Field Dictionary, XML
PDF Full Text Request
Related items