Research On Text Spelling Check Based On Contrastive Learning And Multi-Task Learning

Posted on:2023-11-15

Degree:Master

Type:Thesis

Country:China

Candidate:Z Y Lin

Full Text:PDF

GTID:2558307070483894

Subject:Engineering

Abstract/Summary:

PDF Full Text Request

With the development of new media technology,more and more information is disseminated on the Internet with text as the carrier,and these texts sometimes fail to convey the information accurately due to spelling errors and other reasons and even lead to misinterpretation of the original meaning and deviation from the core socialist values.The text spelling check method based on deep learning has become a hot research topic in recent years.Therefore,this paper focused on text spelling check methods based on contrastive learning and multi-task learning.Firstly,to address the problem that current pretrained models mainly extract semantic features and lack consideration of phonological and visual information,this paper proposed a spelling check pretrained model CLBERT(Contrastive Learning BERT).The model incorporates both phonological and visual knowledge in the pretraining process and combines the mask optimization with the confusion set and confusion word frequency so that the final encoded text vector can contain semantic,phonological,and visual information at the same time,and the pretrained model can be more suitable for text spelling check.Secondly,to address the problems that existing text spelling check methods tend to ignore global information and insufficient learnable information.This paper proposed a correct sentence discrimination auxiliary task,and through a multi-task joint learning framework,the correct sentence discrimination,spelling error detection,and spelling error correction are jointly optimized to provide the model with global and local information,error location information and semantic information,respectively,to alleviate the problem of insufficient information learnable to the model in the training process.Further,the network hierarchical sharing structure is designed according to the task characteristics to improve the model performance on the text spelling check.Finally,this paper conducts experiments on the Chinese spelling check dataset SIGHAN,and verifies that the proposed method outperforms existing methods and can effectively improve the performance of spelling error detection and correction by comparing with various methods.

Keywords/Search Tags:

Contrastive Learning, Pre-training, Multi-Task Learning, Text Spelling Check

PDF Full Text Request

Related items

1	The Research On Text Summarization Based On Pre-trained Models
2	Chinese Spelling Check Based On Neural Network
3	Research On Text Classification And Short Text Clustering Technology Based On Contrastive Learning
4	Telecom Complaint Text Classification Based On Adversarial Training And Contrastive Learning
5	Research On Deep Learning Error Correction Method Of Chinese Text
6	Research On Chinese Spelling Check Technology Based On Machine Learning
7	Research On Sentence Representation Based On Contrastive Learning And Deep Neural Network
8	Towards Multi-Document Driven Task-Oriented Dialogue
9	Research And Application Of Text Sentiment Analysis Based On Deep Learning
10	Research On Scene Image Recognition And Segmentation Based On Contrastive Learning