Research On Graph Processing Acceleration Mechanism Based On High Bandwidth Memory

Posted on:2024-02-25

Degree:Master

Type:Thesis

Country:China

Candidate:S Xue

Full Text:PDF

GTID:2568306929990839

Subject:Microelectronics and Solid State Electronics

Abstract/Summary:

PDF Full Text Request

Under the background of information age,Graph Processing has been widely used in various realistic scenarios,such as commodity recommendation,social network,financial risk control,network security and knowledge graph.With the arrival of 5G era,the scale of graph data is showing explosive growth,so it is particularly important to improve the performance of graph computing.In addition,because of the rapid development of graph computing system,far more than the common memory,so the system performance is often limited by memory,and high bandwidth memory is one of the effective strategies to break through this limitation.Although high-bandwidth memory has advantages such as high bandwidth,high efficiency and low power consumption,the switching architecture adopted by high-bandwidth memory to achieve global addressing will affect the utilization of bandwidth.Therefore,how to realize the full utilization of high-bandwidth memory and improve the graph computing performance is still one of the main challenges facing this field.In order to improve the bandwidth utilization rate of high bandwidth memory in graph computing system,this paper proposes a memory access optimization strategy and applies it to a general graph computing system.Based on three-layer Clos switching network architecture,this strategy adopts fixed route orthogonal link scheduling algorithm,which can make full use of high bandwidth memory while having global addressing function,and has simple module design,less resource consumption and better scalability,so that it can support more parallel processing units.When the strategy is applied to graph computing system,the impact of high bandwidth memory read and write delay is reduced by using burst transmission,the memory access efficiency is improved by using bitmap,and the parallelism is improved by using multiple processing elements and HBM’s pseudo channels.In this dissertation,the breadth-first algorithm and web ranking algorithm are tested experimentally in graph computing systems using different memory schemes.The experimental results show that the test performance of the two graph algorithms in the graph computing system using a single HBM stack is improved by 5.91-6.02 times compared with that of DDR.The performance of global addressing mode is 4.15-4.48 times better than that of DDR.The bandwidth utilization of scheduling optimization schemes is 1.32 to 1.45 times higher than that of global addressing schemes.

Keywords/Search Tags:

High bandwidth memory, memory access scheduling optimization, switching network, Graph Processing, FPGA

PDF Full Text Request

Related items

1	Research On Graph Partitioning And Scheduling Method For The Multi-FPGA Computing Platform
2	Research On Optimization Of Memory Access Scheduling Based On DDR3
3	Memory Access Optimization Method Of CNN Inference Task On FPGA
4	Trace-Based Modeling And Adapting Of Multi-Channel DDR Memory Controller
5	Research On Key Techniques Of Compiler-based Memory Access Analysis And Optimization
6	Multi-Task External Memory Graph Processing System Based On I/O Scheduling
7	Techniques To Tackle Memory Interference In Multicore Systems
8	Scheduling Research For Memory Controller Based On Density Of Memory Access
9	Design Of Key Circuits And Co-Optimization Of Hardware And Software For Processing-in-Memory
10	Switching in magnetic memory element of toggle mode magnetoresistive random access memory