Passer à la navigation principale Passer à la recherche Passer au contenu principal

Logistic regression with missing covariates—Parameter estimation, model selection and prediction within a joint-modeling framework

  • TraumaBase Group
  • Hôpital Beaujon
  • INRIA Institut National de Recherche en Informatique et en Automatique

Résultats de recherche: Contribution à un journalArticleRevue par des pairs

Résumé

Logistic regression is a common classification method in supervised learning. Surprisingly, there are very few solutions for performing logistic regression with missing values in the covariates. A complete approach based on a stochastic approximation version of the EM algorithm is proposed in order to perform statistical inference with missing values, including the estimation of the parameters and their variance, derivation of confidence intervals, and also a model selection procedure. The problem of prediction for new observations on a test set with missing covariate data is also tackled. Supported by a simulation study in which the method is compared to previous ones, it has proved to be computationally efficient, and has good coverage and variable selection properties. The approach is then illustrated on a dataset of severely traumatized patients from Paris hospitals by predicting the occurrence of hemorrhagic shock, a leading cause of early preventable death in severe trauma cases. The aim is to improve the current red flag procedure, a binary alert identifying patients with a high risk of severe hemorrhage. The method is implemented in the R package misaem.

langue originaleAnglais
Numéro d'article106907
journalComputational Statistics and Data Analysis
Volume145
Les DOIs
étatPublié - 1 mai 2020
Modification externeOui

SDG des Nations Unies

Ce résultat contribue à ou aux Objectifs de développement durable suivants

  1. SDG 3 - Bonne santé et bien-être
    SDG 3 Bonne santé et bien-être

Empreinte digitale

Examiner les sujets de recherche de « Logistic regression with missing covariates—Parameter estimation, model selection and prediction within a joint-modeling framework ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation