Font Size: a A A

Research On Data Mining And Task Scheduling Mode Based On Cloud Computing

Posted on:2018-07-30Degree:MasterType:Thesis
Country:ChinaCandidate:T ZhaoFull Text:PDF
GTID:2348330518476358Subject:Statistics
Abstract/Summary:PDF Full Text Request
With the continuous reform and innovation of Internet technology,social and economic development and progress,big data cloud computing technology in people's work life are more and more widely used.In order to meet the needs of users,Internet companies grasps the dynamic development of the market information through big data cloud computing technology,and clear their own business development goals,to provide users with better service.This paper mainly studies the model of data mining combined with task scheduling in cloud computing environment.Firstly,the valuable data of the web page is extracted by data mining technology,and then the text data is used to integrate the same data.Finally,an efficient scheduling algorithm is proposed to provide information for the users.The scheduling algorithm is the core of this model.The extraction of web page text data is the input of text classification algorithm and scheduling algorithm.The text classification is the preprocessing step of the subsequent scheduling algorithm.The contents of this paper are as follows:(1)This dissertation propose a combination of data mining and task scheduling model,more efficient and timely to provide services for users.(2)Text mining and text classification in the cloud environment.Using the crawler principle to obtain web data,achieve in the cloud data classification under the parallel processing in Hadoop platform.(3)A algorithm of cloud computing is proposed.A balanced factor model is designed,and the clonal operator of the immune algorithm is used to propose a balanced clone scheduling algorithm.
Keywords/Search Tags:Cloud computing, Data mining, Text classification, Task scheduling
PDF Full Text Request
Related items