Font Size: a A A

The Design And Implementation Of Forum Data Analysis System Based On Web Crawler

Posted on:2020-04-23Degree:MasterType:Thesis
Country:ChinaCandidate:X LiFull Text:PDF
GTID:2428330590450653Subject:Software engineering
Abstract/Summary:PDF Full Text Request
Game forum is an important feedback channel for players' suggestions.Usually game developers need to pay close attention to the trend of public opinion in the forum to find out the existing and potential problems in the game.However,the quality of forum data is uneven.Manual collection of valuable post information from a large number of forum posts requires a certain amount of time and effort,and is prone to errors.In order to provide more agile and efficient feedback to the effective information feedback from forum players,how to automatically acquire forum data and make relevant screening analysis of the data has become a key way to improve work efficiency.The system mainly realizes the analysis and processing of game suggestion feedback forum data,including three main modules: data extraction,data analysis and analysis results display.Python development tool PyCharm and MySQL database management system are used in the system development.The relevant Python libraries used mainly include the natural language processing toolkit Jieba library,the operation of MySQL database toolkit PyMySQL library and the wordcloud library for drawing word clouds.The system crawls the forum data content through the Web crawler,parses the data by using the Beautiful Soup database,extracts the data to be analyzed and saves it to the database,and uses the Jieba database to process the Chinese word segmentation of the analyzed data.On this basis,the value of the word segmentation results is scored,and the results are displayed in the form of html.In addition,according to different screening conditions,high-frequency words satisfying the conditions can be displayed through word cloud,so that users can quickly grasp the high-frequency information of the forum.The system effectively extracts and analyses the forum data,and the system makes the forum data acquisition more convenient,fast and intuitive.To a certain extent,it saves the time and energy invested by relevant data followers in manual screening of the content of Posts and improves work efficiency.
Keywords/Search Tags:Web Crawler, Chinese Word Segmentation, Word cloud
PDF Full Text Request
Related items