Sketching Based Fast Tensor Computation

Posted on:2022-11-02

Degree:Master

Type:Thesis

Country:China

Candidate:X Y Cao

Full Text:PDF

GTID:2518306764962829

Subject:Automation Technology

Abstract/Summary:

PDF Full Text Request

In recent years,the emergence of large-scale datasets has increased the burden of software and hardware for data processing,transmission,and storage.Sketching refers to a subclass of the randomization-based algorithms.It uses a structured random sketching matrix to map the input data to the low dimensional sketched subspace,while retaining some useful information of the data.Using sketching techniques,some statistics of the original data can be approximately estimated in the sketched subspace directly.Therefore,algorithm acceleration and data dimensionality reduction can be realized at the cost of accuracy loss to some extent.This paper mainly studies the sketching algorithms to improve their approximation accuracy and computation efficiency in the application of tensor computation.Robust Tensor Power Method(RTPM)is a tensor CANDECOMP/PARAFAC decomposition algorithm.It needs to calculate tensor contractions in the process of each power iteration,resulting in low computational efficiency.Tensor Sketch(TS)has been used to accelerate RTPM,but the existing TS-based RTPM(TS-RTPM)is data-oblivious and does not make use of the distribution information of input data.This paper proposes the first data-driven TS framework named TS-Fast-RTPM-Net(TS-FRTPM-Net),which improves the efficiency of TS-RTPM by unrolling the outer iteration of TS-RTPM algorithm into Fast Power Iterations(FPI)modules of a neural network.Specifically,TSFRTPM-Net uses stochastic gradient descent method to optimize TS value matrices and RTPM initial matrices,and uses two greedy algorithms to optimize TS location matrices,which improves the approximation accuracy of TS-RTPM.In addition,by computing power iterations parallelly in FPI modules,the acceleration of TS-RTPM is realized.The experimental results show that compared with TS-RTPM,TS-FRTPM-Net has advantages in accuracy,speed and memory consumption,plus transferability to some extent.The existing sketching techniques have some limitations: Count Sketch(CS)is essentially a sketch for vectors.When the input is a high-order tensor,it needs to sketch along its vectorization.Therefore,constructing the CS matrix requires a pair of hash functions with a large size,which increases the memory consumption for the hash functions.In addition,for some tensors with specific structures,such as rank-1 tensors,CS cannot use its structural characteristics to accelerate sketch computation.TS uses multiple pairs of small-sized hash functions to construct the sketching matrix,so the storage consumption is much lower than CS.And it possesses a fast algorithm for rank-1 tensor.However,the construction of the TS matrix inevitably leads to hash confliction,which results in low approximation accuracy.Combining the advantages of CS and TS,this paper proposes a new sketching technique named Fast Count Sketch(FCS).FCS constructs sketching matrix in a similar way as TS,but uses a method similar to public overflow area to avoid hash confliction.While improving the approximation accuracy of TS,FCS inherits the advantages of TS in low storage complexity and efficient computation for rank-1 tensors.Theoretical analysis and numerical experimental results show that compared with CS and TS,FCS has advantages in approximation accuracy or computational efficiency.

Keywords/Search Tags:

randomized algorithms, tensor decomposition, dimensionality reduction, robust tensor power method, data-driven algorithms

PDF Full Text Request

Related items

1	Research On Models And Algorithms For Low-tubal-rank Tensor Recovery
2	Research On Theory And Applications Of Sparsity Reconstruction Of High-Dimension Data From Quantized Measurements
3	Research On Tensor-based Video Watermarking Algorithms
4	Research On Methods And Theory Of Tensor Learning For Complex High-dimensional Data
5	Research On Theory And Applications Of Low-Tubal-Rank Tensor Recovery
6	Research On Data Dimensionality Reduction Algorithms Based On Matrix Decomposition Learning
7	Design And Implementation Of Distributed Tensor Train Decomposition Algorithms
8	Research On Data Recovery Method Based On Tensor
9	Study Of Tensor Signal Processing-Based Classification Algorithms For Polarimetric SAR
10	High-dimensional Image Restoration Algorithms Based On Tensor