Construction Method Of Sense Embedding Based On Semantic Graph Clustering

Posted on:2021-09-22

Degree:Master

Type:Thesis

Country:China

Candidate:Z M Zhong

Full Text:PDF

GTID:2518306107968729

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

To make computers understand human language,the word embedding method represents the semantics of each word with a low-dimensional dense vector,which is a breakthrough in the field of natural language processing.However,there is a polysemy phenomenon in natural language,but all senses of a word can only be represented as a single word vector.To solve this problem,the sense embedding that represents each sense of a word as a vector has been the subject of several studies in recent years.To construct the sense embedding,the sense inventory of a word is needed to be obtained,and then a vector representation of each sense is generated.The sense inventory defined in the existing models is not accurate enough,and the process of generating vector representations is too simple.To solve these problems,a sense embedding constructing model based on semantic graph clustering is proposed.This model builds the sense distribution of the target word as a graph and induces the senses of the word dynamically according to the improved graph clustering algorithms.Moreover,the model does not directly specify the generation function of the sense vector,but first sets the optimization target function of the sense vector,and then iteratively solves the mapping function of the sense vector about the sense cluster.Furthermore,to show the application of the sense vectors in downstream tasks,the constructed word sense vectors are integrated into the word sense disambiguation task and entity disambiguation task to improve the traditional schemes.Among them,the word sense disambiguation task is studied in the general field,to solve the shortcomings of the traditional schemes that require a large amount of labeling data,a new scheme combining the local credibility of calculated by the sense vectors and the global popularity is proposed.The entity disambiguation task is studied in the manufacturing field,to eliminate the ambiguity caused by the use of the word vectors to express semantics in the traditional schemes,a new scheme for training classifiers using the features such as the semantic similarity represented by the sense vectors is designed.The experimental part uses three datasets to evaluate the quality of the constructed sense vectors and the performance of the sense vectors in two disambiguation tasks,confirming that the performance of the sense embedding constructed by graph clustering is improved by about 3% to 4% compared with the state-of-the-art Glo Ve word embedding and CWMS sense embedding.

Keywords/Search Tags:

sense embedding, graph-based clustering, disambiguation task

PDF Full Text Request

Related items

1	Author Name Disambiguation Based Rule And Graph Model
2	Research On Word Sense Disambiguation Based On GCN Model
3	Research On Word Sense Disambiguation Method Based On Word Embedding
4	Research On Word Sense Disambiguation Based On The Strategy Of Field Priority Selection
5	Research On Chinese Word Sense Disambiguation Method Based On Graph Model
6	Research On Graph Neural Network-Based Name Disambiguation Algorithm
7	An Unsupervised Approach To Word Sense Disambiguation Based On Second-order Context
8	The Research On Knowledge-based And Graph-based Word Sense Disambiguation Algorithms
9	Research And Implementation Of Key Technologies In Information Extration And Analysis Of Police Intelligence On The Internet
10	Research On Word Sense Disambiguation Based On Semi-supervised Model