Research And Application Of Large-Scale Distributed System Monitoring Technology

Posted on:2018-08-28

Degree:Master

Type:Thesis

Country:China

Candidate:S C Feng

Full Text:PDF

GTID:2348330512983443

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Distributed system become the mainstream of large-scale websites and applications because of its scalability and fault-tolerance.Distributed tracing system and distributed performance monitoring system play important roles in distributed system for failure diagnosis,resource monitoring and system stability.However,there are also fundamental challenges that are unique to distributed system,including inefficient failure diagnosis,low-value data collection and high overhead during monitor data search.The contributions of this paper are concluded as follows:This paper proposes a tail-based sampling schema.Distributed tracing system for large scale situation,abnormal data take very few part.Traditional sampling schema cost high overhead to take reduce operation for collecting call chain.The tail-based sampling schema proposes every component judge the call separately,thus reduce the overhead of completing the chain only in abnormal situation.This paper proposes a failure diagnosis method of call chain based on decision tree.Call chain's failure is difficult to quickly and accurately diagnose.In this method,feature extraction is carried out on the known abnormal call chain data,and diagnose the cause of collected failures quickly.This paper proposes an efficient data index mechanism which optimizes time-series data aggregation.Since performance monitor data of distributed system is time-series data.This mechanism combine with synopsis forest which is an effective time-series data aggregation algorithm.Then combine Hbase mechanism optimizes the aggregation query speed and designed a hash mechanism to solve Hbase distributed hot pot problem.In conclusion,the paper introduces the JTang Tracer(Distributed Tracing System),which can trace and analyze the call chain and display visually.Optimized the overhead during call chain collecting and time-series data's aggregation operations,and proposes a schema for distributed system failure diagnosis.

Keywords/Search Tags:

Distributed tracing system, Call chain, Monitor sampling, Failure diagnosis, Aggregation

PDF Full Text Request

Related items

1	The Automotive Failure Analysis And Diagnosis System Research And Application Based On Industry Chain Collaboration Platform
2	Design And Implementation Of Distributed Tracing Alarm Diagnosis System
3	Network Fault Diagnosis Based On OpenFlow
4	Design And Implementation Of Distributed Systems Tracing Infrastructure Based On Adaptive Sampling
5	Failure Diagnosis Hardware System And Realization Of Bearing Fatigue Life Test
6	Design And Implementation Of Monitoring System Oriented Distributed Call Center
7	Design And Implementation Of Failure Diagnosis System In Radio Receiver
8	Research And Application Of The High-pressure Spray Descaling System's Status Monitor And Faults Diagnosis System Based On Distribute Monitor And Diagnosis System
9	Service Failure Diagnosis Of Service Function Chain In NFV
10	Study And Application Of Intelligent Failure Diagnosis Methods For The Coal Preparation Plant Equipments