Abstract,Rationale,Stance:a Joint Learning For Scientific Claim Verification

Posted on:2023-01-19

Degree:Master

Type:Thesis

Country:China

Candidate:Z W Zhang

Full Text:PDF

GTID:2568306623952259

Subject:Computer Science and Technology

Abstract/Summary:

Scientific claim verification is a type of fact verification that aims to verify the authenticity of a proposed claim.Due to the rapid growth of the scientific literature,it is difficult for scientists to stay up-to-date with the latest findings.This challenge is especially acute during pandemics,such as the current Covid-19 pandemic,due to the risk of making decisions based on outdated or incomplete information.Therefore,the study of scientific claim verification systems has strong research significance and practical application value.Most existing work is dominated by pipeline models,which have the problem of error propagation.Therefore,this thesis proposes an approach called ARSJoint,which jointly learns modules for three tasks in a machine reading comprehension framework by including claim information.Meanwhile,to address the lack of information exchange between modules in the pipeline model,this thesis introduces a regularization term between the sentence attention score of abstract retrieval and the estimated output of rational selection to enhance the information exchange and constraints among tasks.Experimental results on the Sci Fact dataset show that the proposed joint learning approach has good performance and can effectively curb the error propagation problem.The performance of candidate abstract retrieval before claim verification determines the performance of subsequent claim verification models.Most existing candidate abstract retrieval methods are based on traditional machine learning methods,which have significant limitations.Based on this,this thesis trained an additional ARSJoint model for candidate abstract retrieval.However,neural network-based abstract retrieval methods often take too long,so the ARSJoint model for abstract retrieval was distilled into a smaller model to improve the retrieval speed.Experimental results demonstrate that the neural network-based abstract retrieval method achieves better performance than traditional machine learning methods.As the Sci Fact dataset is a small fact verification dataset,it has the problems of overfitting and poor generalization.To address this issue,this thesis proposes a method to pre-train the model on a similar large fact verification dataset called FEVER.The model is first pre-trained on the FEVER dataset to obtain generic knowledge,and then it is fine-tuned using the Sci Fact dataset for better handling of this specific dataset.Experimental results show that the pre-trained and then fine-tuned model achieves better performance on the SciFact dataset.

Keywords/Search Tags:

Fact verification, Joint learning, Pipeline model, Pre-training, Model distillation

Related items

1	Research And Implementation Of Fact Verification For Tabular Data
2	Research And Implementation Of Fact Verification Technology On Social Media
3	Research On Financial Text Generation Method Based On Knowledge Distillation And Pre-training Model
4	Ontology-Based Dialogue State Tracking And Its Knowledge Distillation Method
5	Research And Implementation Of Compression Algorithm For Multilevel Knowledge Distillation Model Based On Feature Map
6	Research On Key Methods Of Parallel Machine Learning Model Training
7	Research On Model Distillation Via Diversity Knowledge Transfer
8	A Study Of Model Compression Approaches To Deep Learning-based Sequence Models
9	Research On Construction Of Chinese Enterprises External Image Based On "Fact-value" Model
10	Research On Cross-Language Sentiment Analysis Method Based On Pre-training Model