Research On Text Similarity Based On Bert

Posted on:2021-02-22

Degree:Master

Type:Thesis

Country:China

Candidate:J S Xie

Full Text:PDF

GTID:2428330611965581

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

The study of semantic textual similarity is an important task of natural language processing.By accurately recognizing semantic textual similarity,we can achieve more accurate coding and representation of texts such as short sentences and sentences,and further promote the development of other natural language processing tasks.At present,deep network models are the mainstream for processing natural language processing tasks.In terms of semantic textual similarity,the work of deep learning can be basically divided into two categories: one is to modify the network model structure so that the model can better fit the semantic textual similarity problems;the other is learning a universal or unique sentence coding representation,and further predicting semantic similarity through cosine similarity or fully connected layers.The first type of method often has a good effect on a certain type of data set,but the effect has a bottleneck and cannot be generalized to other learning tasks.Another type of method is very convenient for predicting different learning tasks,and the effect is generally good,but it needs to be trained on a large-scale corpus.At the same time,a common sentence representation may ignore the small changes in the sentence order or vocabulary of a sentence,making the true meaning of expression is not well understood.In view of the shortcomings of the current research work,this paper proposes a new model called TSBert based on Bert pre-trained language model that is more suitable for processing semantic textual similarity tasks.The model uses a layered structure: input layer,adaptation layer,coding layer,fusion layer and output layer.The input layer uses the same input processing as the pre-trained Bert model.The adaptation layer uses an adapter to reduce the training parameters of the model,and at the same time is beneficial to update the parameters of the new network layer.On the one hand,the coding layer extracts the CLS vectors existing in the Bert model of the adaptation layer,on the other hand,it adds a new feature processing network,which combines the feature extraction capabilities of RNN and CNN and trains new feature vectors.The fusion layer fuses the two feature vectors of the coding layer with respect to the sentence itself,and is expressed as the feature vector of the semantic textual similarity task.The output layer uses three fully connected layers and the corresponding activation function to make the final feature selection of the feature vector of the fusion layer,and iterates the model parameters through the mean square error function as the loss function.The TSBert model not only considers a general sentence representation,but also constructs a local feature extraction network R-CNN.The combination of the two makes the model perform well on semantic textual similarity tasks.Among them,R-CNN network combines the characteristics of convolutional neural network and recurrent neural network to further identify possible changes in textual semantic.In order to verify the performance of the TSBert model,semantic textual similarity experiments were conducted on the SICK dataset and the STS benchmark dataset.On the set,the Pearson correlation coefficient and Spearman correlation coefficient used for performance evaluation reach the most advanced accuracy in the results of all current papers.

Keywords/Search Tags:

Natural language processing, Text similarity, Pre-trained language models, Deep learning

PDF Full Text Request

Related items

1	Improvement And Compression Of Pre-Trained Language Models For User-Generated Texts
2	Research On Entity Relation Extraction Technology Based On Deep Learning
3	Research Based On Pre-trained Language Models And Knowledge Enhancement For Aspect-based Sentiment Analysis
4	Research And Application On Controllable Text Generation Based On Pre-trained Language Models
5	Research On Knowledge-Enhanced Pre-trained Language Models
6	Modeling And Learning Of Representations For Natural Language Sentence-level Structures
7	Intelligent Device Text Classification Method Based On Natural Language Processing
8	A Technology Of Generating SQL Through Chinese Natural Language Queries Based On Deep Learning
9	Shallow and deep learning for audio and natural language processing
10	Research On News Text Summarization Algorithm Based On Pre-trained Language Model