Passer à la navigation principale Passer à la recherche Passer au contenu principal

Predicting over-indebtedness on batch and streaming data

  • Université Paris-Saclay
  • CNRS LTCI

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Detecting over-indebtedness, the difficulties meeting household payment commitments, poses multiple Big Data challenges for banking institutions. We present a novel data-driven framework for predicting over-indebtedness on real-world data. A warning mechanism that generates predictions 6 months ahead, improving the chances of financial recovery. This framework is based on the combination of feature selection and supervised learning techniques, and uses data balancing for fine-tuning the predictive models. We propose two versions of the framework based on state-of-the-art batch and streaming learning techniques. To the best of our knowledge, the proposed framework is the first to cast over-indebtedness prediction as a stream learning problem. The appeal of stream learning rises from the large amount of data continuously generated, and the fact that batch models become obsolete over time as financial data evolves, while stream models are continuously updated as new data is available. We use credit data from two banks from the Groupe BPCE (the second-largest banking institution in France) and apply multi-metric criteria to evaluate model performance and fairness. Test results show the framework's interbank applicability and that the proposed batch and stream frameworks outperform the current solution for both single and multi-metric criteria. Additionally, the generic structure of the framework serves as a template for systematically approaching similar classification problems.

langue originaleAnglais
titreProceedings - 2017 IEEE International Conference on Big Data, Big Data 2017
rédacteurs en chefJian-Yun Nie, Zoran Obradovic, Toyotaro Suzumura, Rumi Ghosh, Raghunath Nambiar, Chonggang Wang, Hui Zang, Ricardo Baeza-Yates, Ricardo Baeza-Yates, Xiaohua Hu, Jeremy Kepner, Alfredo Cuzzocrea, Jian Tang, Masashi Toyoda
EditeurInstitute of Electrical and Electronics Engineers Inc.
Pages1504-1513
Nombre de pages10
ISBN (Electronique)9781538627143
Les DOIs
étatPublié - 1 juil. 2017
Modification externeOui
Evénement5th IEEE International Conference on Big Data, Big Data 2017 - Boston, États-Unis
Durée: 11 déc. 201714 déc. 2017

Série de publications

NomProceedings - 2017 IEEE International Conference on Big Data, Big Data 2017
Volume2018-January

Une conférence

Une conférence5th IEEE International Conference on Big Data, Big Data 2017
Pays/TerritoireÉtats-Unis
La villeBoston
période11/12/1714/12/17

Empreinte digitale

Examiner les sujets de recherche de « Predicting over-indebtedness on batch and streaming data ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation