Passer à la navigation principale Passer à la recherche Passer au contenu principal

FEDRA: A fast and efficient dimensionality reduction algorithm

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Contemporary data-intensive applications generate large datasets of very high dimensionality. Data management in high-dimensional spaces presents problems, such as the degradation of query processing performance, a phenomenon also known as the curse of dimensionality. Dimensionality reduction (DR) tackles this problem, by efficiently embedding data from high dimensional to lower dimensional spaces. However, the large scale and dynamism of generated data calls for methods of low time and space complexity, features that are hardly combined in the majority of existing DR algorithms. Motivated by this fact, in this paper we propose FEDRA, a fast and efficient dimensionality reduction algorithm that uses a set of landmark points to project data to a lower dimensional Euclidean space. FEDRA is both faster and requires less memory than other comparable algorithms, without compromising the projection's quality. We theoretically assess the quality of the resulting projection and provide a bound for the error induced in pairwise distances. Furthermore, we present two extensions of FEDRA that improve the quality of the projection, suitable for applications that can tolerate higher processing costs. We prove the validity of our claims both theoretically and experimentally, by comparing our algorithm against prominent approaches, such as FastMap, LMDS, PCA, SVD and Random Projection.

langue originaleAnglais
titreSociety for Industrial and Applied Mathematics - 9th SIAM International Conference on Data Mining 2009, Proceedings in Applied Mathematics 133
Pages505-516
Nombre de pages12
étatPublié - 1 déc. 2009
Modification externeOui
Evénement9th SIAM International Conference on Data Mining 2009, SDM 2009 - Sparks, NV, États-Unis
Durée: 30 avr. 20092 mai 2009

Série de publications

NomSociety for Industrial and Applied Mathematics - 9th SIAM International Conference on Data Mining 2009, Proceedings in Applied Mathematics
Volume1

Une conférence

Une conférence9th SIAM International Conference on Data Mining 2009, SDM 2009
Pays/TerritoireÉtats-Unis
La villeSparks, NV
période30/04/092/05/09

Empreinte digitale

Examiner les sujets de recherche de « FEDRA: A fast and efficient dimensionality reduction algorithm ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation