TY - JOUR
T1 - Locater
T2 - Cleaning wifi connectivity datasets for semantic localization
AU - Lin, Yiming
AU - Jiang, Daokun
AU - Yus, Roberto
AU - Bouloukakis, Georgios
AU - Chio, Andrew
AU - Mehrotra, Sharad
AU - Venkatasubramanian, Nalini
N1 - Publisher Copyright:
© 2020, VLDB Endowment. All rights reserved.
PY - 2020/1/1
Y1 - 2020/1/1
N2 - This paper explores the data cleaning challenges that arise in using WiFi connectivity data to locate users to semantic indoor locations such as buildings, regions, rooms. WiFi connectivity data consists of sporadic connections between devices and nearby WiFi access points (APs), each of which may cover a relatively large area within a building. Our system, entitled semantic LOCATion cleanER (LO-CATER), postulates semantic localization as a series of data cleaning tasks-first, it treats the problem of determining the AP to which a device is connected between any two of its connection events as a missing value detection and repair problem. It then associates the device with the semantic subregion (e.g., a conference room in the region) by postulating it as a location disambiguation problem. LO-CATER uses a bootstrapping semi-supervised learning method for coarse localization and a probabilistic method to achieve finer localization. The paper shows that LOCATER can achieve significantly high accuracy at both the coarse and fine levels.
AB - This paper explores the data cleaning challenges that arise in using WiFi connectivity data to locate users to semantic indoor locations such as buildings, regions, rooms. WiFi connectivity data consists of sporadic connections between devices and nearby WiFi access points (APs), each of which may cover a relatively large area within a building. Our system, entitled semantic LOCATion cleanER (LO-CATER), postulates semantic localization as a series of data cleaning tasks-first, it treats the problem of determining the AP to which a device is connected between any two of its connection events as a missing value detection and repair problem. It then associates the device with the semantic subregion (e.g., a conference room in the region) by postulating it as a location disambiguation problem. LO-CATER uses a bootstrapping semi-supervised learning method for coarse localization and a probabilistic method to achieve finer localization. The paper shows that LOCATER can achieve significantly high accuracy at both the coarse and fine levels.
U2 - 10.14778/3430915.3430923
DO - 10.14778/3430915.3430923
M3 - Article
AN - SCOPUS:85097261161
SN - 2150-8097
VL - 14
SP - 329
EP - 341
JO - Proceedings of the VLDB Endowment
JF - Proceedings of the VLDB Endowment
IS - 3
ER -