The Research And Improvement Of Fault Monitoring Based On Cloud Platform

Posted on:2015-08-22

Degree:Master

Type:Thesis

Country:China

Candidate:N N Zhu

Full Text:PDF

GTID:2298330467463094

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

With the popularity of internet technology and the improvement of information technology, the demands of information are higher and higher from all the areas of the society, the data that will be processing is also increasing. Cloud computing has been applicated from the concept to the practical application. It’s development has matured, it has been developed for the customization available cloud, scalable cloud, service-oriented cloud of private or public cloud. The quality of service is very important for cloud platform, monitorning is an important part of the cloud computing platform, it is the premise of network analysis, systems management, job scheduling, load balancing, event prediction, fault detection and recovery operations. It can help dynamic resource use, testing services defects, found the patterns of users, resource scheduling module auxiliary decision of cloud platform. It can improve the quality of service on cloud platform.BC-PDM(Big Cloud of Parallel Data Mining) is a system that based on the world’s largest telecom companies in the business intelligence applications, and it is designed for mass data to provide efficient, accurate and conveninent data analysis services. The system is developed based on Hadoop cluster, this paper describes the research and implementation of fault monitorning based on cloud platform.This paper describes the background and the status of the research firstly, then for the demands of the project itself, give the overall design and the design of each module design. This article uses the open source monitorning tools of Ganglia and Nagios. Through the in-depth research of the monitorning tools, I summarizes its working principle and the advantages and disadvantages and so on. The system will combine the advantages of ganglia and Nagios and optimization the fault tolerance mechanisms of ganglia simultaneously. The system achieve fault monitoring and resource monitoring. Ganglia and Nagios both have some problem on the storage of monitoring data. The system makes the monitoring data to mysql database through the persistent storage tool and make the data management and data analysis, optimization the storage problems of monitoring data.The system uses the open source montoring tools of Ganglia and Nagios, then make the requirements analysis of the system, the key research of the system, finally completed the resource monitoring and fault monitoring. The system realized the physical resources, virtual resources, services and resources for comprehensice monitoring and analysis of resource utilization. According to the analysis,it achieve the fault monitoring through email, text messages and other means to monitor the source and fault. To ensure the normal operation of the cloud platform.Finally applying the research all above, the cloud platform monitoring system is implemented, and its running results show that this strategy is feasible and effective.

Keywords/Search Tags:

cloud platform resource, monitoring, faultmanagement, Ganglia

PDF Full Text Request

Related items

1	Research And Implementation Of Cloud Monitoring System Based On Ganglia
2	Approach To Resource Monitoring And Evaluation In Situation Of Cloud Computing Platform
3	A Large-scale Resource Monitoring And Safety Assessment System In Cloud Compute Environment
4	Design And Implementation Of Resource Monitoring System Based On Cloud Computing Platform
5	Hadoop Cluster Monitoring System Based On Ganglia
6	Research And Implementation Of Mointoring System For Cloud Resource
7	Design And Implementation Of Cloud Platform Resource Monitoring Management System
8	Research And Implementation Of Elastic Cloud Platform Management And Monitoring System
9	Research And Implementation Of Open Source Cloud Platform Monitoring Visualization And Resource Allocate
10	Research And Implementation Of Resource Monitoring Technology Based On Cloud Computing Platform