Parallel Stochastic Gradient Descent Algorithm On Large-scale High-dimensional And Sparse Data

Posted on:2021-04-08

Degree:Master

Type:Thesis

Country:China

Candidate:W Qin

Full Text:PDF

GTID:2428330611487195

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

With the rapid development of computer networks in modern society and the explosive growth of information data,the advent of big data has facilitates the development of recommendation systems,which have improves the quality of people's daily lives.High-dimensional sparse matrices are often used in recommendation systems to quantify the relationships among users and items in an incomplete matrix.In order to obtain some useful information from high-dimensional sparse matrices,researchers have propose various big data analysis methods,in which latent factor analysis has been shown to efficiently obtain and represent information from high-dimensional sparse matrices.Recommendation systems based on latent factor analysis commonly adopt random gradient descent as the learning algorithm,while random gradient descent as a sequence algorithm has considerable time overhead and low scalability when dealing with large-scale industrial problems.To solve the above problems,this paper proposes some novel parallel strategies to improve the convergence rate and computational efficiency of the model.The main elements of the study are as follows.(1)The application of latent factor analysis methods in recommender systems is outlined,the problems of stochastic gradient descent algorithms in parallel are theoretically analyzed,and current parallel implicit feature models based on stochastic gradient descent are studied and analyzed.(2)Momentum combined with a parallel stochastic gradient descent algorithm is proposed.The algorithm adds momentum effects to the stochastic gradient decline and parallelizes the algorithm with a novel data segmentation strategy.Experiments on large-scale industrial datasets have shown that the algorithm improves the convergence speed and computational efficiency of the model.(3)A hierarchical parallel algorithm based on random gradient descent is proposed.The algorithm is parallelized by two hierarchies,and experiments on large-scale,sparse,real data sets show that hierarchical parallel implicit feature models based on random gradient descent have higher acceleration performance when solving large-scale matrix factor decomposition.

Keywords/Search Tags:

big data, Latent Factor Analysis, High-Dimensional and Sparse (HiDS) Matrix, Stochastic gradient descent, Parallel computing

PDF Full Text Request

Related items

1	Research On Latent Factor Analysis Optimization Algorithm Based On Stochastic Gradient Descent
2	Research On High-speed Convergent Latent Factor Analysis Molde Based On Particle Swarm Optimization
3	The Reseach And Application Of Stochastic Gradient Descent And Dual Coordinate Descent Algorithm
4	Research On Adaptive Latent Factor Analysis Model Based On Improved Particle Swarm Optimization
5	Research On Methods For GPU Based Parallel Acceleration Of Matrix Computation
6	A Research Of Stochastic Gradient Descent Algorithm
7	Research On Algorithm Of Sparse DOA Estimation With Matrix Filter Under Strong Interference Environment
8	Research Of Stochastic Parallel Gradient Descent Based On Segmentation Random Disturbance
9	Imbalanced Stochastic Gradient Descent Online Algorithm For SVM
10	Research On The Convergence Performance Of A Generalized Momentum To Accelerate The Non-negative Latent Factor Model