Font Size: a A A

Research On Multidimensional Performance Data Prediction And Anomalies

Posted on:2015-05-31Degree:MasterType:Thesis
Country:ChinaCandidate:Jean Steve HIRWAFull Text:PDF
GTID:2308330476452887Subject:Computer Science and Engineering
Abstract/Summary:PDF Full Text Request
Resource availability often changes continuously in the cloud environment. Therefore, schedulers demand resource prediction help to make productive scheduling decision. Comparatively, the scheduler needs a long time load prediction information to guide scheduling process because in cloud computing tasks usually take a significant amount of time to execute, even if most proposed prediction approaches are directed to one step ahead or short term load prediction. As sometimes resources may be distributed and heterogeneous, those proposed methods could not lead to the good outcome when making long-term prediction. Most of the resource prediction methods were designed with the goal of getting the prediction much accurate. However, in some cases, priorities are given to long-term prediction.Anomaly detection techniques have been developed for some specific fields, while others were developed for general purpose. By applying those techniques we can judge and di?erentiate whether a data is representing a normal or anomalous behavior. This happens when an anomaly detection approach define a region that is demonstrating a normal behavior and provides a region where the data does not belong to the defined normal area as anomaly. Some techniques detect anomalies in an unlabeled test data set under supposition that the majority of instances in the data are normal by investigating instances. Other techniques demand a data set that has been labeled as normal or abnormal and use training as classifier. There are also other techniques that construct a model showing normal behavior from given normal training data set.Researches have been carried out in di?erent features(CPU load, Memory usage, disk spaces and network tra?c). Many prediction models were evolved to predict single or multidimensional data. In this research, we will mainly focus on finding out new models and analyze prediction behavior from di?erent multi-resource and determine the highest accuracy. In order to enhance performance, anomalies or outliers will not be ignored.
Keywords/Search Tags:Machine learning · Time Series · Statistics
PDF Full Text Request
Related items