A Research On Deep Learning Based Relation Extration With Improved Hypothesis

Posted on:2019-04-02

Degree:Master

Type:Thesis

Country:China

Candidate:S X Yang

Full Text:PDF

GTID:2428330545499759

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

Relation extraction is one of the important techniques in information extraction,which aims to extract the relations of labeled entity pairs in sentences.We can not only build knowledge graph on the basic of relation extraction,but also use this technique in text comprehension and machine translation.However,conventional supervised relation extraction methods need massive training dataset to ensure the model can be completely trained,then a lot of experienced experts are required to label the dataset,which is time and labor consuming.Besides,although conventional distant supervision relation extraction can expand the training dataset automatically,these methods will introduce some wrong labelling problems into dataset and decline its quality.The development of deep learning brings new probabilities to natural language processing.Nowadays,techniques like word embedding and neural network in deep learning have great performances on text semantic representation.Therefore,many researchers begin to extract relations at the aspect of deep leaning methods.Howto solve the noise problems in distant supervision data is a burning issue.In this thesis,we improve the basic hypothesis in distant supervision relation extraction methods and design a corresponding deep learning method to fulfill relation extraction task.The whole work can be divided into the following three parts:Firstly,to solve the noise data in distant supervision dataset,we improve the basic hypothesis at the aspect of semantics.In our hypothesis,sentence semantics will be helpful for generating the relation labels of entity pairs.Secondly,in order to increase the quality of whole dataset,we propose a cluster-based distant supervision relation extraction method Clustered DS to relabel the dataset on the basis of improved hypothesis.During the clustering period,the semantics information is conducive to redress the wrong labels in dataset.Finally,we incorporate deep learning into improved hypothesis and propose a Bi-GRU+Clustered DS(Bi-directional Gated Recurrent Unit + Clustered Distant Supervision)model.In this model,the Bi-GRU network is used to extract the hidden semantics of every sentences and the semantics approaching mechanism is designed to achieve better sentence semantic encoding.Experimental results on aligning Freebase knowledge base and the New York Times corpus(NYT)demonstrate that the proposed Clustered DS can settle some wrong labelling and miss labelling problems in distant supervision dataset.Further experiments also show that the proposed Bi-GRU+Clustered DS outperform some of the state-of-art relation extraction methods in precision and recall.

Keywords/Search Tags:

neural relation extraction, deep learning, bi-directional GRU, semantics-improved distant supervision hypothesis

PDF Full Text Request

Related items

1	Neural Relation Extraction Based On Distant Supervision Approaches
2	Study On Denoising Algorithm For Distant Supervision Relation Extraction
3	Research On Relation Extraction Based On Distant Supervision Labeled Data
4	Using Distant Supervision And Representation Learning For Entity Relation Extraction
5	Distant Supervision For Relation Extraction Based On Deep Learning
6	Sample Denoising In Distant Supervision For Relation Extraction
7	Research On Distant Supervision Relationship Extraction Based On Deep Reinforcement Learning
8	Reasearch On Relation Extraction Based On Distant Supervision
9	Research And Application Of Relation Extraction Based On Distant Supervision
10	A Chinese Entity Relation Extraction Method Based On Distant Supervision