TY - GEN
T1 - Exploiting innocuous activity for correlating users across sites
AU - Goga, Oana
AU - Lei, Howard
AU - Parthasarathi, Sree Hari Krishnan
AU - Friedland, Gerald
AU - Sommer, Robin
AU - Teixeira, Renata
PY - 2013/1/1
Y1 - 2013/1/1
N2 - We study how potential attackers can identify accounts on different social network sites that all belong to the same user, exploiting only innocuous activity that inherently comes with posted content. We examine three specific features on Yelp, Flickr, and Twitter: the geo-location attached to a user's posts, the timestamp of posts, and the user's writing style as captured by language models. We show that among these three features the location of posts is the most powerful feature to identify accounts that belong to the same user in different sites. When we combine all three features, the accuracy of identifying Twitter accounts that belong to a set of Flickr users is comparable to that of existing attacks that exploit usernames. Our attack can identify 37% more accounts than using usernames when we instead correlate Yelp and Twitter. Our results have significant privacy implications as they present a novel class of attacks that exploit users' tendency to assume that, if they maintain different personas with different names, the accounts cannot be linked together; whereas we show that the posts themselves can provide enough information to correlate the accounts. Copyright is held by the International World Wide Web Conference Committee (IW3C2).
AB - We study how potential attackers can identify accounts on different social network sites that all belong to the same user, exploiting only innocuous activity that inherently comes with posted content. We examine three specific features on Yelp, Flickr, and Twitter: the geo-location attached to a user's posts, the timestamp of posts, and the user's writing style as captured by language models. We show that among these three features the location of posts is the most powerful feature to identify accounts that belong to the same user in different sites. When we combine all three features, the accuracy of identifying Twitter accounts that belong to a set of Flickr users is comparable to that of existing attacks that exploit usernames. Our attack can identify 37% more accounts than using usernames when we instead correlate Yelp and Twitter. Our results have significant privacy implications as they present a novel class of attacks that exploit users' tendency to assume that, if they maintain different personas with different names, the accounts cannot be linked together; whereas we show that the posts themselves can provide enough information to correlate the accounts. Copyright is held by the International World Wide Web Conference Committee (IW3C2).
KW - Account correlation
KW - Geotags
KW - Language
KW - Location
KW - Online social networks
KW - Privacy
KW - User profiles
UR - https://www.scopus.com/pages/publications/84887308184
U2 - 10.1145/2488388.2488428
DO - 10.1145/2488388.2488428
M3 - Conference contribution
AN - SCOPUS:84887308184
SN - 9781450320351
T3 - WWW 2013 - Proceedings of the 22nd International Conference on World Wide Web
SP - 447
EP - 457
BT - WWW 2013 - Proceedings of the 22nd International Conference on World Wide Web
PB - Association for Computing Machinery
T2 - 22nd International Conference on World Wide Web, WWW 2013
Y2 - 13 May 2013 through 17 May 2013
ER -