| In order to ensure the safety of people’s dining tables,governments at all levels and related enterprises have established a series of agricultural products traceability systems,realizing the traceability of the whole link of agricultural products from the field to the table.However,most agricultural products traceability systems are still developed based on the relational database,which can not well describe the relationship between the entities in the correlation data,nor can it effectively explore the value behind the traceability data.In order to solve the application problem of storage and analysis scenarios of large-scale correlation data,graph database and distributed graph data analysis technology were bred.They can store correlation data as a graph model composed of nodes and edges,and can run various graph algorithms above the graph model for data mining.The traceability data of agricultural products is mainly composed of the relationship between enterprises and agricultural products,which conforms to the characteristics of large-scale correlation data.Based on this,this paper integrates the graph data related technology with the traceability data of agricultural products,and designs and implements the agricultural product traceability data analysis platform by using the graph database Neo4 j and Spark,a distributed computing system,so as to extract the valuable information hidden in the traceability data and further guarantee the security of agricultural products.This paper summarizes the relevant techniques and algorithms,and introduces the operation principle and execution process and role in the analysis of agricultural products.Then detailed analysis of platform function requirements,and with the micro service idea as the design concept,according to the results of the demand analysis platform function modular division and overall architecture design,using UML diagram,architecture,function execution diagram and timing diagram describes the specific function design of the platform,according to the NGDS and Cypher design data analysis algorithm,according to Graph Frames library design map data distributed computing function.Then,according to the design scheme,Spark,Neo4 j and HDFS were introduced as core components of data analysis and storage,and Nacos and Sentinel were introduced as basic components of service discovery and traffic monitoring;Java framework developing traceability data management and PC interface using Spring Boot;distributed data analysis interface using Python framework Flask;and front-end page and graph data visualization function using Java Script framework Vue.js and Cytoscape.js.Finally,the specific functions of the platform are introduced,and some functions are tested and compared to verify the practicability and reliability of the platform.Finally complete agricultural products traceability data analysis platform for traceability data enterprises,government or related scientific research workers,provides the traceability data storage,acquisition,visualization function,and support for traceability data using all kinds of figure algorithm data analysis,used to explore the circulation process of agricultural products,find the key enterprises and divided enterprise community.The application of the platform enables relevant organizations and personnel to quickly make decisions and respond to the safety problems of agricultural products;and more,enterprises can optimize the supply chain and industrial chain of agricultural products,improve enterprise efficiency,and enable more consumers to buy affordable and safe agricultural products. |