Skip to main navigation Skip to search Skip to main content

A BERT-Based Transfer Learning Approach for Hate Speech Detection in Online Social Media

  • Institut Polytechnique de Paris

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Generated hateful and toxic content by a portion of users in social media is a rising phenomenon that motivated researchers to dedicate substantial efforts to the challenging direction of hateful content identification. We not only need an efficient automatic hate speech detection model based on advanced machine learning and natural language processing, but also a sufficiently large amount of annotated data to train a model. The lack of a sufficient amount of labelled hate speech data, along with the existing biases, has been the main issue in this domain of research. To address these needs, in this study we introduce a novel transfer learning approach based on an existing pre-trained language model called BERT (Bidirectional Encoder Representations from Transformers). More specifically, we investigate the ability of BERT at capturing hateful context within social media content by using new fine-tuning methods based on transfer learning. To evaluate our proposed approach, we use two publicly available datasets that have been annotated for racism, sexism, hate, or offensive content on Twitter. The results show that our solution obtains considerable performance on these datasets in terms of precision and recall in comparison to existing approaches. Consequently, our model can capture some biases in data annotation and collection process and can potentially lead us to a more accurate model.

Original languageEnglish
Title of host publicationComplex Networks and Their Applications VIII - Volume 1 Proceedings of the 8th International Conference on Complex Networks and Their Applications, COMPLEX NETWORKS 2019
EditorsHocine Cherifi, Sabrina Gaito, José Fernendo Mendes, Esteban Moro, Luis Mateus Rocha
PublisherSpringer
Pages928-940
Number of pages13
ISBN (Print)9783030366865
DOIs
Publication statusPublished - 1 Jan 2020
Event8th International Conference on Complex Networks and their Applications, COMPLEX NETWORKS 2019 - Lisbon, Portugal
Duration: 10 Dec 201912 Dec 2019

Publication series

NameStudies in Computational Intelligence
Volume881 SCI
ISSN (Print)1860-949X
ISSN (Electronic)1860-9503

Conference

Conference8th International Conference on Complex Networks and their Applications, COMPLEX NETWORKS 2019
Country/TerritoryPortugal
CityLisbon
Period10/12/1912/12/19

Keywords

  • BERT
  • Fine-tuning
  • Hate speech detection
  • Language modeling
  • NLP
  • Social media
  • Transfer learning

Fingerprint

Dive into the research topics of 'A BERT-Based Transfer Learning Approach for Hate Speech Detection in Online Social Media'. Together they form a unique fingerprint.

Cite this