Research And Design Of Floating Gate Based Vector-Matrix Multiplier For Neural Network

Posted on:2021-05-04

Degree:Master

Type:Thesis

Country:China

Candidate:M F Jiang

Full Text:PDF

GTID:2428330602497453

Subject:Electronic Science and Technology

Abstract/Summary:

PDF Full Text Request

With the development of deep learning in recent years,neural networks have be-come more and more widely used in many fields such as image recognition,speech recognition,and natural language processing.However,as the network architecture becomes more and more complex,the amount of computation required by neural net-works has increased dramatically.At the same time,the large amount of input and output data required makes data transmission a major bottleneck that limits speed.In addition,computing and data transmission bring tremendous power consumption,mak-ing neural network applications difficult to deploy on edge devices.Research on the new computation architecture can solve the data transmission bottleneck caused by the separation of memory and computation of the von Neumann architecture through in-memory processing.Aiming at the application requirements of neural networks,this thesis designed a vector-matrix multiplier(VMM)with processing in memory(PIM)architecture based on floating-gate cells of NOR flash.The main research work and innovation of this thesis are as follows:1.The principle of the analog multiplication on floating gate cells was researched.The floating gate cells worked in the linear region when performing analog multiplica-tion.Two floating gate cells were used to store one weight differentially.Compared with the single floating gate cell,the computation linearity was improved.The distribu-tion of weights' temperature coefficients was reduced,and zero temperature coefficients can be achieved through compensation.2.A analog vector-matrix multiplication circuit was designed based on floating gate cells,including the design of flash memory array,current subtraction circuit,cur-rent buffer circuit,and temperature compensation load circuit.The flash memory array realized data storage and multiply-accumulate operations.The current subtraction cir-cuit and buffer circuit implemented current subtraction and impedance transformation to accurately calculate the multiply-accumulate results.The load circuit compensated the temperature characteristics of the floating gate pair,so that the analog multiplication results had good temperature characteristics.3.Auxiliary circuits for the interface between digital and analog were designed,including behavior-level modeling of digital-to-analog converters(DAC)and analog-to-digital converters(ADC),and analog vector-matrix multiplication circuits were con-nected to external digital circuits.4.The MNIST neural network was used for circuit verification,and a full ana-log circuit implementation scheme was proposed.System was based on the proposed vector-matrix multiplication circuit,and the circuit scheme expansions of the convo-lutional layer,pooling layer,and fully connected layer were proposed.For the small application,the system could complete the recognition on-chip,and directly output the recognition results,eliminating the need for ADC and external data scheduling,reduc-ing the power consumption.The circuit design was based on the XMC 65nm floating gate process,with a power supply voltage of 1.8V,Simulation was performed using Cadence Spectre,and behavioral modeling was described using Verilog-A.The accuracy of the system was 4 bits(inputs were 4-bit unsigned,weights were 5-bit signed,outputs were 4-bit un-signed with ReLU activation function).The size of flash array for vector-matrix analog multiplication was 32×32.The maximum clock frequency was 26MHz,and power con-sumption was 0.7～1.8mA.The layout area used for analog computation was 0.028mm2.The temperature coefficient of the stored weights was less than 0.06%/℃ from-40℃ to 130℃.The throughput was 1.66GOPS,and the power efficiency was 0.47～1.09TOPS/W.The MNIST neural network had 1718 weights.The recognition rate was 4.15×103fps,and the recognition accuracy rate was 97.67%.

Keywords/Search Tags:

Processing in memory, Vector-matrix multiplier, Flash, Temperature com-pensation

PDF Full Text Request

Related items

1	Temperature-aware Data Allocation Strategy For 3D Flash Memory
2	Temperature-Aware Optimization Strategies For 3D Charge-Trap Flash Memory
3	Temperature-aware Persistent Data Management For LSM-tree On 3D NAND Flash Memory
4	Read Failure Analysis For Flash Memory At Low Temperature And Low Vcc
5	Theoretical And Experimental Study On The Optical Vector Matrix Multiplier
6	Study On The Distribution Of Optical Vector-matrix Multiplier
7	Research On QC-LDPC Decoding For NAND Flash Memory
8	Performance Enhancement For Flash Memory Based Storage System
9	Research And Design Of Flash Memory Used For DSP
10	Based On Flash Infrared Camera Data High Speed Memory