Skip to main navigation Skip to search Skip to main content

Stream2Graph: Dynamic Knowledge Graph for Online Learning Applied in Large-scale Network

  • Mariam Barry
  • , Albert Bifet
  • , Raja Chiky
  • , Saad El Jaouhari
  • , Jacob Montiel
  • , Aissa El Ouafi
  • , Dhiaddedine Yousfi
  • , Eric Guerizec
  • , Aurel Nobial
  • Institut Polytechnique de Paris
  • University of Waikato
  • ISEP
  • Amazon Machine Learning Solutions Lab
  • Paribas

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Knowledge Graphs (KG) are valuable information sources that store knowledge in a domain (healthcare, finance, e-commerce, cyber-security.). Most industrial KGs are dynamic by nature as they are updated regularly with streaming data (customer activity, network traffic, application logs, IT process). However, extracting insights from continuously updated data comes with major challenges, particularly in big data settings. In this paper, we address the following challenges: 1) ingesting heterogeneous data, 2) training and deployment of predictive models on continuously evolving data, and 3) implementation of data pipelines for updating and maintaining the KG in production. We cover multiple aspects of this process, from knowledge collection to its operationalization. We propose Stream2Graph, a stream-based system for building and updating the knowledge base dynamically in real time. Then we show how graph features can be used in downstream online machine learning models. The solution speeds up big data stream learning and knowledge extraction to enhance Graph-based AI applications. Experimental results show the effectiveness of our solution for knowledge base construction and improvement of big data learning capabilities. Using data from Stream2Graph resulted in speedups for training and inference time in the range from 547x to 2000x in downstream ML models. Finally, we provide the lessons learned from applying graph-based online learning on large-scale network processing high-velocity streaming data.

Original languageEnglish
Title of host publicationProceedings - 2022 IEEE International Conference on Big Data, Big Data 2022
EditorsShusaku Tsumoto, Yukio Ohsawa, Lei Chen, Dirk Van den Poel, Xiaohua Hu, Yoichi Motomura, Takuya Takagi, Lingfei Wu, Ying Xie, Akihiro Abe, Vijay Raghavan
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages2190-2197
Number of pages8
ISBN (Electronic)9781665480451
DOIs
Publication statusPublished - 1 Jan 2022
Externally publishedYes
Event2022 IEEE International Conference on Big Data, Big Data 2022 - Osaka, Japan
Duration: 17 Dec 202220 Dec 2022

Publication series

NameProceedings - 2022 IEEE International Conference on Big Data, Big Data 2022

Conference

Conference2022 IEEE International Conference on Big Data, Big Data 2022
Country/TerritoryJapan
CityOsaka
Period17/12/2220/12/22

Keywords

  • Banking
  • Big data
  • Knowledge Graph
  • Online Machine Learning
  • Real-Time
  • Streaming
  • Telecommunication

Fingerprint

Dive into the research topics of 'Stream2Graph: Dynamic Knowledge Graph for Online Learning Applied in Large-scale Network'. Together they form a unique fingerprint.

Cite this