Research On Biomedical Event Extraction Based On Pre-trained Language Model

Posted on:2021-05-13

Degree:Master

Type:Thesis

Country:China

Candidate:Y Wei

Full Text:PDF

GTID:2404330605452786

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Studying the complex network of relationships between biological entities plays an important role in revealing the mysteries of organisms,and this knowledge information often exists in biomedical literature.How to automatically mine useful structured information from a large number of biomedical literatures has become an important research topic.Biomedical events focus on the specific behavior of biomedical molecules,and define more fine-grained complex relationships than coarse-grained binary relationships,which are of great significance for drug development and disease prevention.Biomedical event extraction has gradually become a research hotspot.The purpose of biomedical event extraction is to identify event trigger and related arguments.In recent years,biomedical event extraction has received more and more attention,methods based on rules,traditional machine learning and neural networks have been proposed successively.However,due to the phenomenon of nested medical events and the fact that the same trigger word may have multiple event types,the event extraction performance of the existing methods is not very high.To this end,this thesis divides event extraction into three stages: trigger recognition,arguments detection and post-processing,and proposes a new event extraction method.Considering that the pre-trained language model can effectively solve the problem of polysemy,and can generate a variety of semantic representations for the same trigger word according to different contexts,two different strategies are used for applying the pre-trained language model to trigger recognition.First,use a pre-trained language model to extract deep contextualized word representations,and then combine the deep contextualized word representations with traditional pre-trained word embeddings into a bi-directional long short-term memory network,and then label them with conditional random fields;the second strategy is to use a pre-trained language model for fine-tuning,the classification results can be directly processed with softmax,can also be labeled by CRF or BiLSTM-CRF.After the trigger recognition has been completed,and then arguments detection is performed.This thesis proposes twomulti-classification models for arguments detection.The first model is based on self-attention and entity attention.Self-attention focuses on different important parts of a sentence,the output of the self-attention is encoded with Bi-LSTM,and combined with entity attention,the semantic information of the sentence can be fully obtained;the second model utilizes the pre-trained model to encode the context information and candidate pair information,and the deep semantic information obtained is more abundant;these sufficient or rich semantic information facilitates the classification of complex relationships.After the trigger recognition and arguments detection are completed,this thesis uses rule-based post-processing method to generate events.Experiments on the MLEE dataset show that the method proposed in this thesis effectively improves the performance of event extraction,especially the extraction of complex nested events.

Keywords/Search Tags:

Biomedical Event extraction, Trigger Recognition, Arguments Detection, Self-Attention, Pre-trained Language Model

PDF Full Text Request

Related items

1	Research On Detecting Biomedical Event And Its Trigger
2	Research On Medical Entity Recognition And Event Extraction For Chinese Electronic Medical Records
3	Research On Depression Tendency Detection Method Based On Text Pre-Trained Model
4	Research And Application Of Multiple Relation Extraction Algorithm For COVID-19 Based On Deep Learning
5	Research Of Antibiotic Resistance Event Extraction Based On Attention Mechanism And Contrastive Learning
6	Extraction Of Chemical-protein Interaction Relationships From Biomedical Texts Based On Deep Learning
7	Research On The Deep Learning Methods Of Biomedical Text Mining
8	Research On Medical Named Entity Recognition Based On XLNet-CRF
9	The Research Of Biomedical Event Extraction Based On Combinational Learning And Self-training
10	Research On The Key Techniques Of Biomedical Text Mining