Font Size: a A A

Tracing The Source Of Microblog's Hot Topics Based On Propagation Path

Posted on:2020-05-19Degree:MasterType:Thesis
Country:ChinaCandidate:F X ZhouFull Text:PDF
GTID:2428330620960035Subject:Information and Communication Engineering
Abstract/Summary:PDF Full Text Request
With the rapid development of mobile Internet,online social platforms are used by more users and play an increasingly important role in information dissemination.Many topics have become hot spots through the spread of social software,causing widespread concern and discussion.Therefore,it is of great significance to explore and grasp the trend of public opinion,combat illegal speech,and maintain a harmonious and orderly Internet environment for the hot topic of Sina Weibo,the mainstream social platform in China.The research on Sina Weibo mainly includes two parts,the mining of hot topics on Weibo and the tracking and traceability of hot topics on Weibo.The latter part of the work is based on the results of the first half.The main work and achievements of this thesis are as follows:1)For the mining of hot topics on Weibo,the LDA model is used to deal with the vector sparsity problem of microblog short texts.The paper first combines the microblog texts with similar label semantics to increase the length of the text to be modeled,and then use LDA.The model is modeled,and the K-Means clustering algorithm is further used to cluster the modeled texts to get hot topics.Experiments on the real topic data of Sina Weibo show that the method can effectively reduce the confusion of LDA model and improve the accuracy of topic mining.2)For the traceability of the hot topic of Weibo,this paper constructs the Weibo propagation path and uses the Page Rank algorithm to calculate the most influential users in the propagation path,and uses the user as the source of the topic.The propagation path is divided into explicit forwarding and implicit forwarding.When a microblog is forwarded through the forwarding function of the Sina Weibo platform,the microblog is considered to be explicitly forwarded and the explicit forwarding path is determined.For microblogs that are not explicitly forwarded,this paper calculates the probability of implicit forwarding by text similarity and release time correlation.When the implicit forwarding probability is greater than the set threshold,Weibo is considered to be propagated by implicit forwarding.For the case that there may be an implicit forwarding probability between a microblog and multiple microblogs being greater than the threshold,this paper determines the microblog published by the user with the highest similarity as the microblog that is implicitly forwarded.After constructing the propagation path,this paper uses Page Rank algorithm to calculate the most influential users in the propagation path,so as to trace the source of microblog hot topics.
Keywords/Search Tags:Sina Weibo, hot topic, propagation path, traceability
PDF Full Text Request
Related items