Hardware Acceleration for Tensorized Neural Networ

Posted on:2019-05-07

Degree:M.S

Type:Thesis

University:University of California, Santa Barbara

Candidate:Gan, Yiming

Full Text:PDF

GTID:2478390017987751

Subject:Computer Engineering

Abstract/Summary:

Machine learning has gained success in many application domains including medical data analysis, finance, computer vision, and so forth. However, many popular machine learning models (e.g., deep neural networks) are both data-intensive and computationally expensive: they require high-volume data samples to train the networks, millions to billions of parameters to describe the model, and large-scale computations to complete the optimization or inference. Therefore, deep learning can cause unaffordable energy and run-time cost on a hardware platform. In this paper, we present a way of accelerating deep neural networks as well as compressing weights used by designing hardware acceleration for tensor train decomposition layers in deep neural networks. By utilizing hardware acceleration on tensorized neural networks, we achieved massive memory saving on two fully -connected layers. Parameters shrink 4880644x and 3195660x respectively. At the same time, we achieve speed up at 2600x and 2900x compared to original matrix multiplication process.

Keywords/Search Tags:

Hardware acceleration, Neural

Related items

1	Hardware Acceleration for Tensorized Neural Networ
2	Research On Hardware Acceleration Of 3D Convolutional Neural Network Algorithm Based On DSP
3	Research On The Compression And Hardware Acceleration Based On Convolutional Neural Network
4	High Performance Artificial Intelligence Computing With Algorithm-hardware Co-design
5	Research And Optimization Of Neural Network Acceleration Algorithm
6	Research On Hardware Acceleration Based On FPGA Of Convolutional Neural Network And Elliptic Curve Algorithm
7	Research On The Hardware Acceleration Mechanism For SDN/NFV
8	Acceleration System Design And Implement For Convolutional Neural Network Based On SOC FPGA
9	Research And Implementation Of Hardware Acceleration Of Convolutional Neural Network Based On ZYNQ
10	Hardware Accelerator Design Of Convolutional Neural Networks For Low Power And High Performance