Generative Deep Learning for Internet of Things Network Traffic Generation

Mustafizur R. Shahid, Gregory Blanc, Houda Jmila, Zonghua Zhang, Herve Debar

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The rapid development of the Internet of Things (IoT) has prompted a recent interest into realistic IoT network traffic generation. Security practitioners need IoT network traffic data to develop and assess network-based intrusion detection systems (NIDS). Emulating realistic network traffic will avoid the costly physical deployment of thousands of smart devices. From an attacker's perspective, generating network traffic that mimics the legitimate behavior of a device can be useful to evade NIDS. As network traffic data consist of sequences of packets, the problem is similar to the generation of sequences of categorical data, like word by word text generation. Many solutions in the field of natural language processing have been proposed to adapt a Generative Adversarial Network (GAN) to generate sequences of categorical data. In this paper, we propose to combine an autoencoder with a GAN to generate sequences of packet sizes that correspond to bidirectional flows. First, the autoencoder is trained to learn a latent representation of the real sequences of packet sizes. A GAN is then trained on the latent space, to learn to generate latent vectors that can be decoded into realistic sequences. For experimental purposes, bidirectional flows produced by a Google Home Mini are used, and the autoencoder is combined with a Wassertein GAN. Comparison of different network characteristics shows that our proposed approach is able to generate sequences of packet sizes that behave closely to real bidirectional flows. We also show that the synthetic bidirectional flows are close enough to the real ones that they can fool anomaly detectors into labeling them as legitimate.

Original languageEnglish
Title of host publicationProceedings - 2020 IEEE 25th Pacific Rim International Symposium on Dependable Computing, PRDC 2020
PublisherIEEE Computer Society
Pages70-79
Number of pages10
ISBN (Electronic)9781728180038
DOIs
Publication statusPublished - 1 Dec 2020
Event25th IEEE Pacific Rim International Symposium on Dependable Computing, PRDC 2020 - Perth, Australia
Duration: 1 Dec 20204 Dec 2020

Publication series

NameProceedings of IEEE Pacific Rim International Symposium on Dependable Computing, PRDC
Volume2020-December
ISSN (Print)1541-0110

Conference

Conference25th IEEE Pacific Rim International Symposium on Dependable Computing, PRDC 2020
Country/TerritoryAustralia
CityPerth
Period1/12/204/12/20

Keywords

  • Autoencoder
  • Deep Learning
  • Generative Adversarial Network
  • Internet of Things
  • Network Security

Fingerprint

Dive into the research topics of 'Generative Deep Learning for Internet of Things Network Traffic Generation'. Together they form a unique fingerprint.

Cite this