Research And Application Of Regression Network Method For Nested Named Entity Recognitio

Posted on:2024-08-31

Degree:Master

Type:Thesis

Country:China

Candidate:J H Yu

Full Text:PDF

GTID:2568307130958409

Subject:Electronic information

Abstract/Summary:

PDF Full Text Request

Named entity recognition is one of the basic tasks in natural language processing.In natural language texts,there are a large number of named entities with nested structures.Traditional methods using sequence annotation models for entity recognition cannot effectively identify named entities with nested structures.In order to effectively identify nested named entities,this paper designs a regression network model that can end-to-end identify entities with nested structures.The research work of this article can be divided into the following two parts for introduction:(1)Propose a nested named entity recognition model based on regression network(RN).The model first maps sentences to a saturated higher-order abstract semantic representation through a deep neural network.Then generate candidate text bounding boxes on this abstract semantic representation.Finally,drawing inspiration from the idea of object detection algorithms in the image field,a text based regression algorithm was designed to construct an end-to-end multi-objective learning framework.This framework can simultaneously predict the classification scores of entity candidates and refine their spatial positions in sentences,and can optimize the model globally by sharing parameters.Through experiments,it has been shown that the RN model outperforms the relevant work by about 5% and 2% in F1 values on the ACE2005 Chinese and English datasets,respectively,and has also achieved competitive performance on the GENIA dataset.(2)Propose a full length nested named entity recognition model based on regression networks.The previous model required enumerating a large number of text borders,resulting in high computational complexity and inability to identify long entities through regression.However,there are a large number of long entities in real text.If the method of the previous model is used to identify long entities,the semantic weakening is severe,and generating candidate entities can easily lead to high computational complexity and data imbalance.To address these two issues,this paper proposes a multi granularity full length nested named entity recognition model based on regression networks,which does not need to exhaustively list all possible candidate entity spans,Only a small portion of the span needs to be verified and all possible entities in the sentence can be identified through regression operations.In addition,for long entities,multi granularity sentence representations are used to encode semantic features of different semantic granularity.In this experiment,even if a small number of entity spans were enumerated,the model still remained competitive,achieving 87.35% and 80.85% F1 scores on the ACE2005 and GENIA datasets.

Keywords/Search Tags:

Nested named entity recognition, Regression network, Multi granularity, Full length named entity recognition

PDF Full Text Request

Related items

1	Research On Nested Named Entity Recognition Based On Deep Learning
2	Chinese Nested Named Entity Recognition Research
3	Research On Chinese Nested Named Entity Recognition Based On Graph Neural Networ
4	The Method Of Nested Named Entity Recognition In Microblog
5	Research On Complex Entity Recognition And Class Increment Problem In Named Entity Recognition
6	Research On Boundary-based Nested Named Entity Recognition Method
7	Design And Implementation Of Financial Nested Named Entity Recognition System
8	The Study Of A Method For Nested Named Entity Recognition Assisted By Wiktionary
9	Research On Nested Named Entity Recognition Method Combining Boundary Detection And Span Classificatio
10	Multi-level And Multi-modal Named Entity Recognition