NetSpam: A Network-Based Spam Detection Framework for Reviews in Online Social Media

  • Saeedreza Shehnepoor
  • , Mostafa Salehi
  • , Reza Farahbakhsh
  • , Noel Crespi

Research output: Contribution to journalArticlepeer-review

Abstract

Nowadays, a big part of people rely on available content in social media in their decisions (e.g., reviews and feedback on a topic or product). The possibility that anybody can leave a review provides a golden opportunity for spammers to write spam reviews about products and services for different interests. Identifying these spammers and the spam content is a hot topic of research, and although a considerable number of studies have been done recently toward this end, but so far the methodologies put forth still barely detect spam reviews, and none of them show the importance of each extracted feature type. In this paper, we propose a novel framework, named NetSpam, which utilizes spam features for modeling review data sets as heterogeneous information networks to map spam detection procedure into a classification problem in such networks. Using the importance of spam features helps us to obtain better results in terms of different metrics experimented on real-world review data sets from Yelp and Amazon Web sites. The results show that NetSpam outperforms the existing methods and among four categories of features, including review-behavioral, user-behavioral, review-linguistic, and user-linguistic, the first type of features performs better than the other categories.

Original languageEnglish
Article number7865975
Pages (from-to)1585-1595
Number of pages11
JournalIEEE Transactions on Information Forensics and Security
Volume12
Issue number7
DOIs
Publication statusPublished - 1 Jul 2017
Externally publishedYes

Keywords

  • Social media
  • fake review
  • heterogeneous information networks
  • social network
  • spam review
  • spammer

Fingerprint

Dive into the research topics of 'NetSpam: A Network-Based Spam Detection Framework for Reviews in Online Social Media'. Together they form a unique fingerprint.

Cite this