Neural Network-based Open Information Extraction And Its Application

Posted on:2022-01-15

Degree:Master

Type:Thesis

Country:China

Candidate:Y C Li

Full Text:PDF

GTID:2518306482489434

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Open Information Extraction(Open IE)aims to extract structured informational tuples from unstructured text,which can find out valuable information automatically and is one of the most promising method in solving the information overload problem.Currently,this technique is widely used in many industrial applications and plays an important role in many downstream NLP tasks.Previous methods of Open IE mainly rely on neural-network-based sequence labeling model.However,these kinds of approaches cannot identify confused entity mentions accurately,and the extraction speed is relative slow leading to problems when the systems are applied on super large-scale datasets and real-time applications.On the other hand,the usage of Open IE technique on Query-Based Text Summarization task is not well studied.Previous summarization methods mainly base on neural-network-based generative model,which cannot ensure the factual correctness and completeness of generated summaries.To solve the above problems,this research proposes two novel Open IE algorithms and one Open IE-based text summarization model.1.To enable the model to identify confused entity mentions,we propose a multi-level features based Open IE model.First,we propose predicate-specific embedding layer to involve sentence-level features in the argument boundary identification process.Then,we introduce coattention mechanism to build span-level features to classify the category of entity mentions.The proposed method significantly outperforms previous Open IE systems on OIE2016 and ReOIE2016 benchmarks.2.To accelerate the information extraction process,we propose an Open IE model with a sliding window-based grouping algorithm.The proposed method first uses a simplified SPO tagging schema to identify all information phrase within the inputs,and then introduce the sliding window-based grouping algorithm to group the information into tuples.The experimental results on SAOKE benchmark show that this approach shorten the overall extraction time significantly with a comparable extraction precision.3.To solve the unfaithful and unmeaningful generation problem of query-based text summarization systems,we proposed a knowledge-augmented query-oriented text summarization model.First,the model constructs knowledge graph via Open IE technique,and propose the Graph Self Attention(GSA)module to select useful part of the knowledge graph.Then,we introduce the Conditional Self Attention(CSA)module to generate summaries according to the topic in which the query is concerned.

Keywords/Search Tags:

Open Information Extraction, Information Extraction, Text Summarization, Knowledge Graph

PDF Full Text Request

Related items

1	Research On Open Text Information Extraction In Chinese Knowledge Graph
2	Study On Methods And Their Applications Of Text Automatic Summarization And Information Extraction
3	Research And Application Of Information Extraction Technology Oriented To Operator Tariff Knowledge Map
4	Research On The Key Techniques Of Web Information Intelligent Acquisition
5	Research On Information Extraction And Automatic Classification Method For Open Access Journal Websites
6	The Design And Implementation Of Building Knowledge Graph System Based On Information Extraction
7	Research On Information Extraction And Fusion Of Knowledge Graph For Unstructured Data
8	The Design And Implementation Of Knowledge Extraction Service For Constructing The Knowledge Graph Of The Financial Domain
9	Design And Implementation Of E-commerce Information Extraction System Based On Knowledge Graph
10	Construction And Analytics Of An Chinese Enterprise Knowledge Graph