Font Size: a A A

Identify High-Usage Literature Based On The Characteristics Of The Paper

Posted on:2020-03-23Degree:MasterType:Thesis
Country:ChinaCandidate:N ZhaoFull Text:PDF
GTID:2428330596468127Subject:Information Science
Abstract/Summary:PDF Full Text Request
Citation frequency is the most widely used index in the evaluation of academic papers.At the same time,scholars pay more and more attention to the download,click and other utility indexes of online papers.The Usage of academic Usage index published by WoS platform puts forward a new perspective in the field of paper evaluation.This study compared and analyzed the external factors such as the author,the published journal,the research institution,and the internal factors such as the length of the paper,the number of authors,and the number of references of the high Usage paper,and tried to predict the high Usage paper through machine learning,so as to explore the causes of high Usage paper.The original data of this study are the quotations of 11008 papers published under the subject of "COMPUTER SCIENCE ARTIFICIAL INTELLIGENCE" in the WoS database in 2013.According to descending order of Usage index,a total of 550 papers in the top 5% were selected as the collection of high Usage papers.The title information in the full record format was downloaded as the data set of high Usage paper.In addition,550 literatures of high Usage paper were taken out in order of release time,and the title information in the full record format was downloaded as the data set of low Usage paper.All data will be downloaded in November 2018,and the bibliographic information of 1100 papers will be obtained as the original data.Then,according to the author's institution and journal information,the index data of the author's institution and journal will be obtained in WoS ESI database and JCR annual report respectively.The research found that:(1)The characteristics of the paper itself: the co-authored rate of the two kinds of papers were relatively high,the number of references of the high Usage paper was significantly greater than that of the low Usage paper,while the number of authors,the length of the paper,there was no significant difference between the two kinds of paper collections.(2)The author the paper: no matter the number of articles issued by the author,h index,or the total cited frequency,the author of low Usage paper is obviously in a weak position compared with the author of high Usage paper.(3)The institution of the paper: there is no significant difference in the publication amount,total cited frequency,cited frequency/published amount between the two institutions.What is different from the quantitative index of the author of the paper.The influence of the author's institution on the value of the paper has not risen to a significant level.(4)The total citation frequency of journals with high Usage was significantly higher than that of journals with low Usage.The paper of high Usage was also slightly higher in cited half-life and cited half-life,but there was no significant difference in the impact factor JIF,the amount of publication,and the impact value of the paper.The results showed that the high Usage paper mainly focused on the journals with many references,solid researc h foundation,great influence of the author,great influence of the journal,relatively new citation and slow aging speed.(5)the comparison shows that the classification model of decision tree has the best prediction effect on Usage index,and the model has a high degree of matching,and the prediction accuracy in the test set is more than 87%.
Keywords/Search Tags:High Usage, literature, Prediction, Influencing Factor, Cause Analysis
PDF Full Text Request
Related items