With the popularization of the Internet,the number of Chinese texts has shown a rapid growth trend.In the process of producing Chinese texts,a large number of errors have occurred due to many factors such as manual input errors and algorithm recognition errors,which greatly affect the readability and usability of Chinese texts.Therefore,Chinese text error correction has attracted more and more attention,which has an important impact on speech recognition,dialogue system and other related research.Traditional Chinese text error correction methods can only solve a single type of Chinese text errors,but in actual text data,there may be multiple different types of errors in a sentence.Therefore,how to design an effective Chinese text error correction method to automatically detect and correct many types of text errors in Chinese text is the focus of this article.Chinese text error correction methods for multiple error types need to solve the problem of inaccurate error positioning and multiple types of errors coexist.In view of the above problems,this paper proposes a Chinese text error detection method based on local semantics and a Chinese text correction integration method for multiple error types.Aiming at the problem of inaccurate error positioning,the Chinese text error detection method based on local semantics enhances the local semantic correlation of word vectors on the basis of the pre-trained language model,thus improving the performance of error detection.Experiments show that the F1 values of the detection level,recognition level and positioning level on the publicly evaluated data set have increased by an average of 0.94%,3.74% and5.03% respectively compared with the comparison baseline model F1,achieving the goal of improving the effect of text error detection.In view of the correction problem of multiple types of errors coexisting,the Chinese text correction integration method for multiple types of errors builds a text correction method for single type of errors for spelling errors,redundancy errors,missing errors and word order errors,and then establishes the correction priority of text errors by analyzing the impact of various error types on the overall performance of text correction,and finally integrates the single type of error correction methods according to this priority.Experiments on the public data set have verified the effectiveness of correcting the integration method.Finally,according to the above Chinese text error detection and correction methods,the Chinese text error correction prototype tool is designed and implemented to provide an overall technical solution for many types of Chinese text error correction tasks. |