Statistics-based Chinese Pos Tagging Method

Posted on:2005-10-28

Degree:Master

Type:Thesis

Country:China

Candidate:Y M Liang

Full Text:PDF

GTID:2208360122997456

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

With the development of computers, it is the inevitable trend that nature languages are used as Human-Computer interactive languages, which demands deeper and broader nature language processing. Part-of-speech tagging is a fundamental theme in nature language processing. It is signification to the tagging of Chinese corpus-based, machine translation and information retrieval of large scale text.In this paper, we study the method of the Chinese Part-of-Speech tagging and analyze the rule method and the statistic method. The amount of contextual information and the degree of data smoothing are two important parameters to evaluate performance of statistical model of Chinese Part-of-Speech tagging. This paper describes an extension to the hidden Markov model for Chinese Part-of-Speech tagging using Second-Order approximations for both contextual and lexical probabilities, as well as the traditional Viterbi algorithm is extended. The model makes use of more contextual information than standard statistical models. A smoothing algorithm based on the linear interpolation algorithm is introduced to solve the sparse data problem of the model. The new full Second-Order HMM has been proved to improve Chinese part-of-speech tagging accuracies and disambiguation accuracies over current models.

Keywords/Search Tags:

Full Second-Order Hidden Markov Model, Viterbi algorithm, Chinese Part-of-Speech tagging, Smoothing algorithm

PDF Full Text Request

Related items

1	HMM-based Chinese Part-of-Speech Tagging And Improvement
2	Chinese Part-of-Speech Tagging Based On Ameliorated Hidden Makov Model
3	Research On Kirghiz Basic Part-of-Speech Tagging Based On HMM
4	Statistical Based Mongolian Part-of-Speech Tagging Study And Realization
5	The Research Of Part-of-speech Tagging Based On Hidden Markov Model
6	Research On Chinese Part-of-speech Tagging Based On Semi Hidden Markov Model
7	Hidden Markov Model Parameters Estimation For Part-of-Speech Tagging
8	Application Of Hidden Markov Model In Part-of-Speech Tagging
9	Research On Laodian Participle And Part-of-speech Tagging Method
10	Study Of Kazak Part-of-Speech Tagging Based Upon HMM