An reinforcement learning-based speech censorship chatbot system

  • Shaokang Cai
  • , Dezhi Han
  • , Dun Li
  • , Zibin Zheng
  • , Noel Crespi

Research output: Contribution to journalArticlepeer-review

Abstract

The rapid development of artificial intelligence (AI) technology has enabled large-scale AI applications to land in the market and practice. However, plenty of security issues have been exposed to society while AI technology has brought many conveniences to humankind, especially for the chatbot with online learning. This paper proposes a speech censorship chatbot system with reinforcement learning, which is mainly composed of two parts: the aggressive speech censorship model and the speech purification model. The aggressive speech censorship can combine the context of user input sentences to detect aggressive speech and respond to the rapid evolution of aggressive speech. According to the situation of the chatbot that is polluted by large numbers of aggressive speech, the speech purification model has the capacity to "forget" the learned malicious data through reinforcement learning rather than rolling back to the early versions. In addition, by integrating few-shot learning, the speed of speech purification is accelerated while reducing the influence on the quality of replies. The experimental results show that our proposed method reduces the probability of generating aggressive speeches and that the integration of the few-shot learning improves the training speed rapidly while effectively slowing down the decline in BLEU values.

Original languageEnglish
Pages (from-to)8751-8773
Number of pages23
JournalJournal of Supercomputing
Volume78
Issue number6
DOIs
Publication statusPublished - 1 Apr 2022

Keywords

  • Bi-GRU
  • Chatbots
  • Reinforcement Learning
  • Speech Censorship

Fingerprint

Dive into the research topics of 'An reinforcement learning-based speech censorship chatbot system'. Together they form a unique fingerprint.

Cite this