Challenges of Machine Learning for Data Streams in the Banking Industry

  • Mariam Barry
  • , Albert Bifet
  • , Raja Chiky
  • , Jacob Montiel
  • , Vinh Thuy Tran

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Banking Information Systems continuously generate large quantities of data as inter-connected streams (transactions, events logs, time series, metrics, graphs, process, etc.). Such data streams need to be processed online to deal with critical business applications such as real-time fraud detection, network security attack prevention or predictive maintenance on information system infrastructure. Many algorithms have been proposed for data stream learning, however, most of them do not deal with the important challenges and constraints imposed by real-world applications. In particular, when we need to train models incrementally from heterogeneous data mining and deployment them within complex big data architecture. Based on banking applications and lessons learned in production environments of BNP Paribas - a major international banking group and leader in the Eurozone - we identified the most important current challenges for mining IT data streams. Our goal is to highlight the key challenges faced by data scientists and data engineers within complex industry settings for building or deploying models for real word streaming applications. We provide future research directions on Stream Learning that will accelerate the adoption of online learning models for solving real-word problems. Therefore bridging the gap between research and industry communities. Finally, we provide some recommendations to tackle some of these challenges.

Original languageEnglish
Title of host publicationBig Data Analytics - 9th International Conference, BDA 2021, Proceedings
EditorsSatish Narayana Srirama, Jerry Chun-Wei Lin, Raj Bhatnagar, Sonali Agarwal, P. Krishna Reddy
PublisherSpringer Science and Business Media Deutschland GmbH
Pages106-118
Number of pages13
ISBN (Print)9783030936198
DOIs
Publication statusPublished - 1 Jan 2021
Event9th International Conference on Big Data Analytics, BDA 2021 - Virtual, Online
Duration: 15 Dec 202118 Dec 2021

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13147 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference9th International Conference on Big Data Analytics, BDA 2021
CityVirtual, Online
Period15/12/2118/12/21

Keywords

  • Banking
  • Challenges
  • IT
  • Production
  • Streaming

Fingerprint

Dive into the research topics of 'Challenges of Machine Learning for Data Streams in the Banking Industry'. Together they form a unique fingerprint.

Cite this