Recently, with the rapidly development of Internet, microblog, among so many socialnetworking platforms, has become one of the most favorite applications. Its rapiddevelopment also makes the microblog data collection and analysis more important, asmining and analyzing these big data can reveal more important information within them.The mining result can also be used in network opinion analyzing, business forecasting, etc.Among them, the collection and storage of information, information extraction micro-blogis the basic work. In this thesis, we present the use of micro-blog data network simulationlog technology and related micro-blog platform based API to collect the micro-blog data,and the crawled data include the account information, micro-blog content, friend list,review information, etc. This thesis compares the performance and testes the dataperformance in different ways, from the processing speed, ease of use, accuracy ratio,comprehensiveness and other aspects of the performance. The experimental and practicalshow that, the simulated technology proposed in this thesis is more comprehensive andfaster, and it can meet the actual demand. |