Labeling sensing data for mobility modeling

Jesse Read, Indre Žliobaite, Jaakko Hollmén

Research output: Contribution to journalArticlepeer-review

Abstract

In urban environments, sensory data can be used to create personalized models for predicting efficient routes and schedules on a daily basis; and also at the city level to manage and plan more efficient transport, and schedule maintenance and events. Raw sensory data is typically collected as time-stamped sequences of records, with additional activity annotations by a human, but in machine learning, predictive models view data as labeled instances, and depend upon reliable labels for learning. In real-world sensor applications, human annotations are inherently sparse and noisy. This paper presents a methodology for preprocessing sensory data for predictive modeling in particular with respect to creating reliable labeled instances. We analyze real-world scenarios and the specific problems they entail, and experiment with different approaches, showing that a relatively simple framework can ensure quality labeled data for supervised learning. We conclude the study with recommendations to practitioners and a discussion of future challenges.

Original languageEnglish
Pages (from-to)207-222
Number of pages16
JournalInformation Systems
Volume57
DOIs
Publication statusPublished - 1 Apr 2016
Externally publishedYes

Keywords

  • Hidden Markov models
  • Multi-label
  • Sensor fusion
  • Sensory data

Fingerprint

Dive into the research topics of 'Labeling sensing data for mobility modeling'. Together they form a unique fingerprint.

Cite this