Error-Driven Chinese Part-of-Speech Annotaion Rearch

Posted on:2008-01-07

Degree:Master

Type:Thesis

Country:China

Candidate:Y Wang

Full Text:PDF

GTID:2178360215983607

Subject:Control theory and control engineering

Abstract/Summary:

PDF Full Text Request

In the recent years, with the rapid development and enlargement of the Chinese Corpus and annotation technologies, a large scale of language block based at nationality language and different types of tagging feature musters appeared. The researches of the deep-processing methods and relevant algorithms are in need for the advancement of Nature Language Processing. Just like the other language, the first step to approach Chinese corpus knowledge is part-of-speech tagging. Annotation systems which can run on the computers supports the computational linguistics which have attracted wide concerns from the related fields such as Artificial Intellegence.There are several annotating solutions which mostly base statistical algorithm and rules which was writted manually. Such as the Maxent Entropy model and Hidden Markov ModelRule, which integrated different rules-templates can provide tagging tools for Natual Laguage. But the tagging results are not good enough to apply to the deep level annotation in the real text.According to the statiscal examples which are collected from multiwords annotation error results in system, this essay will introduce three parts of appending models for Part-of-Speech task based at Maxent Entropy model. A new error-based method composed of events with feature probability which was calculated in advanced was held out to choose features templates for multi-word.

Keywords/Search Tags:

error-driven, part-of-speech, annotation, maxent entropy

PDF Full Text Request

Related items

1	Chinese POS Tagging Employing Maxent And Word Clustering
2	Chinese Text Zero-watermarking Technique Based On Statistics Of Part-of-speech
3	Research On High-efficiency Speech Perceptual Hashing Authentication Algorithm Based On Instant Speech Communication
4	Research On Text Document Information Hiding
5	The Study Of3D Annotation System On Full3D Part Model
6	Study Of Chinese POS Tagging Based On Maximum Entropy
7	Network Resources Annotation Based On Chinese FrameNet Ontology
8	Chinese POS Tagging Based On Maximum Entropy
9	Research On Part-of-speech Tagging For Chinese Electronic Medical Records
10	Research And Implementation For Part-of-speech Taggingapply Inautomaticenglish Essay Scoring