Font Size: a A A

Research On The Use Of Microblogging Classification Algorithm Optimization Process Of Diease Prevention And Control

Posted on:2017-03-13Degree:MasterType:Thesis
Country:ChinaCandidate:Z G WangFull Text:PDF
GTID:2348330503482490Subject:Software engineering
Abstract/Summary:PDF Full Text Request
After the Internet has brought the third industrial revolution, more profound change in the lifestyle of people around the world. On China's domestic situation, microblogging is the darling of the Internet era. Microblogging this new era of products from scratch, to develop universal and far more is to have nationwide coverage of Internet products. Currently, the organization can be extracted from the data analysis process and store data transmission network in the mass text messages. Algorithm of this paper is the use of these large-scale data, according to the specific needs of disease prevention and control, timely and accurate statistical analysis to extract valuable information to the outbreak of a timely warning and prevention of seasonal infectious diseases provide guidance for significance analysis of data.In this paper, the main work is as follows.First, microblogging, unstructured text and a brief analysis of Big Data technologies, including the status of development of micro-Bo, the theory of characteristics, process unstructured text in each step of covering technology, big data technology, cloud computing technology know how.Secondly, the use of Sina microblogging application to the right to use API, develop algorithms to obtain a certain amount of micro-blog text, after pretreatment of these texts were used as training data and analyze the data. Process analysis includes calculating Corpus, feature extraction and a key part of evaluation of the results were extracted.Finally, according to the specific requirements of the CDC, we will analyze the results of fully formatted data to produce the report screen displays. Also need to run this classification algorithm Hadoop platform configuration and Map-Reduce algorithm is briefly described.
Keywords/Search Tags:Disease prevention and control, microblogging text, Hadoop technology
PDF Full Text Request
Related items