Passer à la navigation principale Passer à la recherche Passer au contenu principal

Sparse Principal Component Analysis with Missing Observations

  • College of Computing

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionChapitreRevue par des pairs

Résumé

In this paper, we study the problem of sparse Principal Component Analysis (PCA) in the high dimensional setting with missing observations. Our goal is to estimate the first principal component when we only have access to partial observations. Existing estimation techniques are usually derived for fully observed data sets and require a prior knowledge of the sparsity of the first principal component in order to achieve good statistical guarantees. Our contributions is essentially theoretical in nature. First, we establish the first information-theoretic lower bound for the sparse PCA problem with missing observations. Second, we study the properties of a BIC type estimator that does not require any prior knowledge on the sparsity of the unknown first principal component or any imputation of the missing observations and adapts to the unknown sparsity of the first principal component. Third, if the covariance matrix of interest admits a sparse first principal component and is in addition approximately low-rank, then we can derive a completely datadriven choice of the regularization parameter and the resulting BIC estimator will also enjoy optimal statistical performances (up to a logarithmic factor).

langue originaleAnglais
titreProgress in Probability
EditeurBirkhauser
Pages327-356
Nombre de pages30
Les DOIs
étatPublié - 1 janv. 2013
Modification externeOui

Série de publications

NomProgress in Probability
Volume66
ISSN (imprimé)1050-6977
ISSN (Electronique)2297-0428

Empreinte digitale

Examiner les sujets de recherche de « Sparse Principal Component Analysis with Missing Observations ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation