Font Size: a A A

The Design And Implementation Of Real-time Log Collection And Analysis System For Multi-data Centers

Posted on:2019-12-18Degree:MasterType:Thesis
Country:ChinaCandidate:C ZhouFull Text:PDF
GTID:2428330590975228Subject:Software engineering
Abstract/Summary:PDF Full Text Request
In the era of big data,real-time collection and analysis of log data is gradually becoming an important requirement of various Internet companies.The user behavior and system operation status recorded in the log not only help the enterprise to achieve digital operation,but also is the main reference information for back-end R&D and operation and maintenance personnel to diagnose online issues.With the continuous development of the business,the system volume has also continued to expand,making the log collection and analysis of this relatively simple task become more and more complex.In enterprise applications,log files are distributed and stored in various data centers.At this stage,neither the collection nor the analysis of logs can meet the increasing real-time requirements.Therefore,it is urgent to develop a log service platform to collect,analyze,and store relevant information in real time.The research content of this thesis mainly includes:(1)Combined with the company's existing log system issues and new application requirements,the architecture of the log service platform was redesigned,and convenient,reliable,and low-cost design requirements were proposed.(2)Based on Flume's log collection module,data filtering,multi-threaded log collection,interceptor personalized configuration and automatic restart of the daemon process are realized,which improves the efficiency and reliability of log collection.(3)Based on Kafka's message middleware module,improved producer and consumer client,realizes message idempotency design,cluster message routing and message synchronization function across data centers,making message delivery and storage more efficient,Stable and scalable.(4)Based on Storm real-time analysis module,it realizes the extraction,filtering,parsing and storage of logs,and completes the notification and statistics of abnormal information,and achieves real-time log monitoring,personalized recommendation and real-time analysis of user behavior.(5)Forming a visual log management platform,greatly simplifying the user's operation process and providing an efficient and reliable visualization service.The system mainly solves the problems of real-time log collection,message asynchronous communication,log real-time analysis and result display,and realizes collection,aggregation,analysis,storage and display of multiple data center logs.It provides users with a distributed,high-performance and scalable one-stop log service platform to help the company improve operation and maintenance efficiency and reduce enterprise costs.
Keywords/Search Tags:log data, real-time collection, real-time analysis, message middleware
PDF Full Text Request
Related items