Jin Yong is one of the most eye-catching novelists of Chinese literature in the 20th century,and his martial arts novels undoubtedly have superb standards.The love and hatred in the novels of Jin Yong have always been talked about by people.Language is the carrier and transmitter of moving stories and rich spirits.The language of Jin Yong’s novels has its own unique style.At present,the academic perspective of Jin Yong’s novels is mostly focused on literature,aesthetics,etc.,especially the lack of relevant research using quantitative methods.Based on the relevant theories and methods of statistics,this article uses measurement methods to conduct a comprehensive and objective analysis of Jin Yong’s 15 novels to explore the language style characteristics of his novels.It also carried out a quantitative analysis of a suspected work "Wolong Ji" by Jin Yong,and judged whether the author of the suspected work was Jin Yong.This article mainly explores the language style of the novel text from four grammatical units such as words,vocabulary,sentences,and paragraphs.In different grammatical units,this article selects different statistical indicators for data statistics.In addition,the text similarity was calculated and tested using a method based on the combination of TF-IDF algorithm and LSI algorithm with cosine similarity,sentiment analysis and chi-square test.The content of this article can be divided into two parts:the study of the language style of 15 works recognized by Jin Yong and the falsification of the suspected work"Wolong Ji".As far as the study of the language styles of 15 works recognized by Jin Yong is concerned,the statistical results show that the average word length,word length dispersion,vocabulary density,average sentence length,sentence length dispersion,and average paragraph length of the 15 novels of Jin Yong exist difference,and presents a disordered state.There are certain rules to follow in terms of punctuation,word size,vocabulary richness,part-of-speech distribution,sentence length distribution,etc.In terms of the use of punctuation marks,Jin Yong ’s 15 novels have a high degree of consistency;the amount of words used is relatively stable in the small and super long novels of Jin Yong’s novels;in terms of vocabulary richness,the vocabulary diversity of short-length and medium-length novels is higher than that of long-length novels,and the difference between novels of the same magnitude is small;in terms of the distribution of parts of speech,the internal distribution order of real words and the internal distribution order of function words in Jin Yong ’s 15 novels are consistent;In the distribution of long and sentence lengths,the specific distribution patterns of the novels are similar.In addition,the text similarity calculation results show that the short story "Yuenv Jian" has the lowest similarity with other Jin Yong novels,and is much lower than the similarity between other novels.In the six short stories,the similarities between the two novels " A Deadly Secret" and " Ode to Gallantry" and the four other novels are relatively low,while the similarities between the other four novels such as "Flying Fox of Snowy Mountain" are relatively high The similarity has a certain degree of consistency with the creation time.Among the six super-length novels,"Sculpture Trilogy" has a high degree of similarity,while the latest serialization of" The Deer and the Cauldron" and other novels are relatively high.In sentiment analysis,in general,Jin Yong’s novels show neutral emotional tendencies.As far as the falsification research of the suspected work "Wolong Ji" is compared,the statistical results of "Wolong Ji" are compared with Jin Yong’s novels,and it is found that it is ranked in the top in terms of word size,word length distribution,glyph-like symbol ratio,vocabulary density and frequency.There are no significant differences between"Wolong Ji" and Jin Yong’s novels in terms of language characteristics such as 100 words,part-of-speech distribution,average sentence length,and sentence length distribution.The use of punctuation marks,average word length,word length dispersion,unique word frequency,word frequency distribution,sentence length dispersion,average paragraph length,similarity calculation,and sentiment analysis are significantly different.Among them,the text similarity calculation is a text discrimination method with high credibility.Based on the statistical results and analysis of various parts,it is unlikely that the suspected work "Wolong Ji" is Jin Yong’s work. |