TY - GEN
T1 - Continuous Health Monitoring of Machinery using Online Clustering on Unlabeled Data Streams
AU - Le-Nguyen, Minh Huong
AU - Turgis, Fabien
AU - Fayemi, Pierre Emmanuel
AU - Bifet, Albert
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022/1/1
Y1 - 2022/1/1
N2 - Maintenance is an important support function to ensure the reliability, safety, and availability in the railway. Lately, machine learning has become a major player and allows practitioners to build intricate learning models for machinery maintenance. Commonly, a model is trained on static data and is retrained on new data that exhibit novelties unknown to the model. On the contrary, online machine learning is a learning paradigm that adapts the models to new data, thus enabling adaptive, lifelong learning. Our goal is to leverage online learning on unlabeled data streams to enhance railway machinery maintenance. We propose Continuous Health Monitoring using Online Clustering (CheMoc) as an unsupervised method that learns the health profiles of the systems incrementally, assesses their working condition continuously via an adaptive health score, and works efficiently on streaming data. We evaluate CheMoc on a real-world data set from a national railway company. The results show that CheMoc discovered relevant health clusters, as confirmed by a domain expert, and processed the data of an entire year under two hours using only 600 MB of memory.
AB - Maintenance is an important support function to ensure the reliability, safety, and availability in the railway. Lately, machine learning has become a major player and allows practitioners to build intricate learning models for machinery maintenance. Commonly, a model is trained on static data and is retrained on new data that exhibit novelties unknown to the model. On the contrary, online machine learning is a learning paradigm that adapts the models to new data, thus enabling adaptive, lifelong learning. Our goal is to leverage online learning on unlabeled data streams to enhance railway machinery maintenance. We propose Continuous Health Monitoring using Online Clustering (CheMoc) as an unsupervised method that learns the health profiles of the systems incrementally, assesses their working condition continuously via an adaptive health score, and works efficiently on streaming data. We evaluate CheMoc on a real-world data set from a national railway company. The results show that CheMoc discovered relevant health clusters, as confirmed by a domain expert, and processed the data of an entire year under two hours using only 600 MB of memory.
KW - maintenance
KW - online clustering
KW - railway
UR - https://www.scopus.com/pages/publications/85147979106
U2 - 10.1109/BigData55660.2022.10021002
DO - 10.1109/BigData55660.2022.10021002
M3 - Conference contribution
AN - SCOPUS:85147979106
T3 - Proceedings - 2022 IEEE International Conference on Big Data, Big Data 2022
SP - 1866
EP - 1873
BT - Proceedings - 2022 IEEE International Conference on Big Data, Big Data 2022
A2 - Tsumoto, Shusaku
A2 - Ohsawa, Yukio
A2 - Chen, Lei
A2 - Van den Poel, Dirk
A2 - Hu, Xiaohua
A2 - Motomura, Yoichi
A2 - Takagi, Takuya
A2 - Wu, Lingfei
A2 - Xie, Ying
A2 - Abe, Akihiro
A2 - Raghavan, Vijay
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE International Conference on Big Data, Big Data 2022
Y2 - 17 December 2022 through 20 December 2022
ER -