StreamMLOps: Operationalizing Online Learning for Big Data Streaming & Real-Time Applications

  • Mariam Barry
  • , Jacob Montiel
  • , Albert Bifet
  • , Sameer Wadkar
  • , Nikolay Manchev
  • , Max Halford
  • , Raja Chiky
  • , Saad El Jaouhari
  • , Katherine B. Shakman
  • , Joudi Al Fehaily
  • , Fabrice Le Deit
  • , Vinh Thuy Tran
  • , Eric Guerizec

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Continuously learning and serving from evolving streaming data and serving in real-time is a challenging problem. Traditionally, data is partitioned and processed in batches to train machine learning (ML) models. In industrial applications, static models' performance drops over time (model degradation, concept drift), requiring new models to be trained with recent data and redeployed in production. The scientific community has been studying online and adaptive methods to address batch-learning limitations and continuously train AI tasks for industrial applications such as cyber-security, AIOps, anomaly scoring, and drift detection in stock markets. This paper deals with the MLOps aspects of deploying such online and dynamic models to address the requirements in the production systems for real-time applications. Our architectures - based on open-source tools such as Kafka and River - demonstrated how online learning methods could be scaled horizontally in production to meet the demands of a high-velocity streaming pipeline. We demonstrate an MLOps strategy to perform incremental learning from streaming data and continuously deploy the online learning model without pausing the inference pipeline. Indeed, the design satisfies requirements such as model versioning, monitoring, audibility and reproducibility of prediction in both a supervised and semi-supervised setting. Our experiments - for malicious URLs detection task - performed on high-dimensional and feature-evolving streaming data (more than 3 million features) establish the effectiveness and efficiency of online learning models compared to batch (static) machine learning regarding both time and space complexity. Finally, we provide some best practices on data engineering for deploying online models to process a real-time feature stream in production environments. Code is publicly available for reproducibility.

Original languageEnglish
Title of host publicationProceedings - 2023 IEEE 39th International Conference on Data Engineering, ICDE 2023
PublisherIEEE Computer Society
Pages3508-3521
Number of pages14
ISBN (Electronic)9798350322279
DOIs
Publication statusPublished - 1 Jan 2023
Event39th IEEE International Conference on Data Engineering, ICDE 2023 - Anaheim, United States
Duration: 3 Apr 20237 Apr 2023

Publication series

NameProceedings - International Conference on Data Engineering
Volume2023-April
ISSN (Print)1084-4627

Conference

Conference39th IEEE International Conference on Data Engineering, ICDE 2023
Country/TerritoryUnited States
CityAnaheim
Period3/04/237/04/23

Keywords

  • AIOps
  • Data Streaming
  • MLOps
  • Online Learning
  • Production
  • Real-Time
  • Serving

Fingerprint

Dive into the research topics of 'StreamMLOps: Operationalizing Online Learning for Big Data Streaming & Real-Time Applications'. Together they form a unique fingerprint.

Cite this