Passer à la navigation principale Passer à la recherche Passer au contenu principal

A MOM-based ensemble method for robustness, subsampling and hyperparameter tuning

  • CNRS-AgroParisTech Université Paris-Sud-Paris Saclay Orsay

Résultats de recherche: Contribution à un journalArticleRevue par des pairs

Résumé

Hyperparameter tuning and model selection are important steps in machine learning. Unfortunately, classical hyperparameter calibration and model selection procedures are sensitive to outliers and heavy-tailed data. In this work, we construct a selection procedure which can be seen as a robust alternative to cross-validation and is based on a median-ofmeans principle. Using this procedure, we also build an ensemble method which, trained with algorithms and corrupted heavy-tailed data, selects an algorithm, trains it with a large uncorrupted subsample and automatically tunes its hyperparameters. In particular, the approach can transform any procedure into a robust to outliers and to heavy-tailed data procedure while tuning automatically its hyperparameters. The construction relies on a divide-and-conquer methodology, making this method easily scalable even on a corrupted dataset. This method is tested with the LASSO which is known to be highly sensitive to outliers.

langue originaleAnglais
Pages (de - à)1202-1227
Nombre de pages26
journalElectronic Journal of Statistics
Volume15
Numéro de publication1
Les DOIs
étatPublié - 1 janv. 2021

Empreinte digitale

Examiner les sujets de recherche de « A MOM-based ensemble method for robustness, subsampling and hyperparameter tuning ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation