Comparative Study On Sparse Principal Component Analysis

Posted on:2022-11-06

Degree:Master

Type:Thesis

Country:China

Candidate:W L Yin

Full Text:PDF

GTID:2518306749967129

Subject:Applied Statistics

Abstract/Summary:

PDF Full Text Request

Principal Component Analysis is a commonly used data feature extraction method based on variable covariance matrix to process,compress and extract sample information.It has been widely used in many fields such as biology,medicine,machine learning and informatics.However,practical applications often face the following challenges: First,each principal component is a linear combination of the original variables and the loading coefficients are mostly non-zero,which makes it difficult to explain the meaning of the principal components;produces �wrong�results.The introduction of sparse PCA improves the application of PCA methods to the above challenges.Sparse Principal Component Analysis combines the LASSO sparsity penalty idea with the principal component analysis method to make the load coefficients sparse,so as to achieve the effect of dimensionality reduction and interpretation.At the same time,different sparsity penalties can also produce sparse principal components of different properties.According to the characteristics of each sparse principal component method,this paper summarizes them into three categories,namely the conventional sparse principal component analysis method,the sparse principal component analysis method that obtains the maximum explained variance,and the sparse principal component analysis method with orthogonal or irrelevant principal component analysis method.In particular,we selected one of the most representative methods from each category,followed by the sparse principal component analysis(SPCA)method of Zou et al.(2006),sparse PCA via regularized SVD(SPCA-r SVD)of Shen and Huang(2008)and Qi et al.'s(2013)norm selection-based sparse principal component analysis(CN-SPCA)method.The basic models and algorithms of each method are described in detail,and the aim is to compare different types of sparse PCA methods.The results of simulation study and case analysis show that the three different sparse PCA methods can extract the principal components with sparse features and improve the interpretability of the principal components.However,compared with the conventional principal component analysis method,the proportion of explained variance extracted by the three different sparse principal component analysis methods has decreased.Among them,the proportion of explained variance extracted by the SPCA-r SVD method among the three methods is always the highest,while the CN-SPCA method extracts uncorrelated sparse principal components but has a lower proportion of explained variance.The results of this study provide a certain reference for selecting a suitable sparse principal component analysis method.

Keywords/Search Tags:

Data dimensionality reduction, Principal component analysis, Sparse principal component analysis, Simulation

PDF Full Text Request

Related items

1	Research On Data Stream Dimensionality Reduction Algorithm
2	Secure And Efficient Dimension-reducing Ranked Query Method For Encrypted Cloud Data
3	Research On Centered Weight Based Principal Component Analysis
4	Construction Method Of Principal Component Networks And Its Application
5	Application Of Principal Component Analysis And Clustering In Science And Technology Data Analysis
6	Research On Dimensionality Reduction Of Gene Expression Data Based On Traditional Feature Extraction And Deep Learning
7	Research On Face Recognition Algorithm Based On Principal Component Analysis
8	Parameter Selection of Sparse Functional Principal Component Analysis with fMRI data
9	Research On Feature Extraction Based On Principal Component Analysis
10	A Research Of Key Technology Of Dimensionality Reduction Of High Dimensional Data