Research On Chinese Hyponymy Relation Automatic Extraction

Posted on:2016-12-26

Degree:Master

Type:Thesis

Country:China

Candidate:S Y Chen

Full Text:PDF

GTID:2308330470976863

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

There are a variety of relations between words, such as Hyponymy, synonymy, antonymy, whole-Part relation etc. As one of the most important part, Hyponymy describes the base classification methods between objects. In the field of NLP(Natural Language Processing), the hypernymy is a kind of subordinate relation in semantic. For instance, those words which satisfy the conditions of Hyponymy can be described as "Word B is a kind of word A". We can say A is the hypernym of B, or B is the hyponym of A. Besides, A is the class of B, and B is one of instances of A.One of basic tasks of semantic extraction on words is how to extract the relation of hypernymy correctly and efficiently. For supporting the advanced knowledge extraction, this task try to convert those Non-formatting information into a hierarchy. In the fields of Machine Translation, textual Entailment and Information retrieval, Hyponymy plays a important part in supporting the task like ontology knowledge database extension, correct detection and improvement.This paper attempts to combine a series of methods to solve the problem of hyponymy acquisition and validation.For the acquisition task, by combined with the algorithm of pattern self-extension and Chinese word definition of Wikipedia, we propose a hyponymy extraction method based on Latent Dirichlet Allocation (LDA) modal.For the validation task, by calculating the Contextual Feature Similarity (SimCF) and Brown Clustering Similarity (SimBrown), we present a novel approach of hyponymy validation based on combination of Contextual Feature and Brown Clustering. Evaluation on CCF NLP&CC2012 Word Semantic Relation corpus shows that the approach achieved a good result.

Keywords/Search Tags:

Hyponymy Relation, Contextual Similarity, Brown Clustering Similarity, Pointwise Mutual Information, Pattern Matching, Clustering Validation

PDF Full Text Request

Related items

1	Research On Semantic Similarity Computation And Applications
2	Research On Algorithms Of Subspace Clustering Based On Pattern Similarity
3	The Method Of Chinese Synonym Extraction Based On Large-scale Corpus
4	The Research On Database Schema Matching System
5	Research Of Sequence Clustering Algorithm Based On Weighted Similarity
6	Similarity Measures In Cluster Analysis And Its Applications
7	Study On Similarity-based Text Clustering Algorithm And It's Application
8	Research And Application Of Spectral Clustering Based On Density Adaptive Neighborhood
9	Research On Clustering Algorithm Based On High Order Information
10	Extraction Of Entity Hyponymy And Synonymy Relations From Open Domain Texts