Font Size: a A A

Research On The Technology Of Gathering And Distributing Large-capacity Real-time Early Warning Data Based On Kafka

Posted on:2022-02-25Degree:MasterType:Thesis
Country:ChinaCandidate:S Y CheFull Text:PDF
GTID:2518306524994519Subject:Master of Engineering
Abstract/Summary:PDF Full Text Request
With the rapid development of information warfare,the means and capabilities of obtaining early warning data have been improved,and data construction in the field of early warning in my country has received extensive attention.The main problems currently facing include: the traditional transmission of intelligence data has not met the characteristics of large data volume,diverse data types,and high data transmission efficiency of the current early warning data;early warning information systems established by various departments have formed data barriers and a large number of early warnings.It is difficult to efficiently integrate and share data resources.Therefore,it is necessary to establish a basic platform for the unified collection and distribution of data resources.Based on the actual project requirements of the China Institute of Electronics Science,this thesis has carried out research on the collection and distribution of large-capacity real-time early warning data based on Kafka.The main work is summarized as follows:(1)Based on the analysis of the actual application requirements of the early warning data system,the Kafka-based early warning data collection and distribution system architecture was built,focusing on the core modules of the system-security management,data collection,data collection and distribution,data storage,and collection and distribution Management and data retrieval are designed.(2)Aiming at the problems of native load balancing in Kafka clusters,a dynamic load balancing algorithm is proposed,which uses the load indicators collected during the operation of each agent node to calculate the load value,and gives the load balancing strategy of Leader migration and replica migration.Value testing and flow monitoring verify the effectiveness of the proposed method.Aiming at the problem of low efficiency of massive data query,a data retrieval optimization method based on Elastic Search is proposed,which builds a secondary index through Elastic Search to improve the efficiency of data retrieval.Through data writing and retrieval tests,it can well meet the requirements of early warning data storage and retrieval.(3)According to the architecture of the early warning data collection and distribution system,specific implementations are given for the security management module,data collection module,data collection and distribution module,data storage module,collection and distribution management module,and data retrieval module.On this basis,a test environment for the collection and distribution system of early warning data based on Kafka was built to test the functional requirements and performance indicators to verify the correctness of the system design.
Keywords/Search Tags:Early Warning Data, Data collection and distribution, Kafka, Load Balancing, Secondary Index
PDF Full Text Request
Related items