Font Size: a A A

Research And Implement Of Management System Construction Of Kazakh Phrase Library

Posted on:2015-04-12Degree:MasterType:Thesis
Country:ChinaCandidate:L Z T B H D W L T EFull Text:PDF
GTID:2298330431991846Subject:Computer technology
Abstract/Summary:PDF Full Text Request
Establishment of Kazakh phrase corpus is a significant part of Kazakh languagecorpus and multi-level tree library construction. There are important significance of it:On the one hand, it has practical value to the construction of Kazakh tree library and itcan promote the study of Kazakh phrases’ structure and so on; On the other hand,After the establishment of hand annotated corpus of Kazakh language phrase libraryconstructed tree provides the raw material, reduce the difficulty of the tree libraryconstruction. In this paper, the phrase library building and complex phrases (mostlynoun phrase and verb phrase) to identify aspects of the study, and obtained thefollowing results:1) It studies the types of Kazakh phrases and designs norms of phraseclassification. It is a phrase commonly used in the syntax description of the type ofbasic types, and joined the phrase fragments, phrases such as tags, to ensure that thecomposition of the majority of the real corpus sentences phrases encountered in theanalysis can be well marked.2) The development of a complex phrase marked processing norms, principlesand fine-grained labeling phrases were described, the next step will be the greatestextent possible words are gathered in a phrase, if an adjective phrase and the nounfollowed by one of his the phrase will still be able to constitute its marked out a largernoun phrases.3) The phrase library to create and build tasks into phrases phrases constitute therule base library constructed in three parts Kazakh identify complex phrases.4) the design of a complex phrase recognition and extraction algorithm methodphrases constitute rules. As the phrase boundary information for the direction ofsimplification, which is matching the border presents a problem, it is also so as toallow the identification of complex phrases as possible. 5) presents a complex evaluation system phrase recognition. Phrases such complicated because the results reflect the identified words in a certain hierarchy, and the recognition result is greater, the evaluation is only to use the word counts, not truly reflect the performance of the identification system, the identification of the identified as counts.6). Implements a Kazakh language phrase library management system to build, phrase recognition takes the form of a rule-based, designed a rule of automatic acquisition method.
Keywords/Search Tags:Extract phrases constitute rules, identify complex phrases, naturallanguage processing, Kazakh phrase
PDF Full Text Request
Related items