Frame-Based Detection of Figurative Language in Tweets [Application Notes]

Diego Reforgiato Recupero, Mehwish Alam, Davide Buscaldi, Aude Grezka, Farideh Tavazoee

Research output: Contribution to journalArticlepeer-review

Abstract

This paper analyzes the problem of figurative language detection on social media, with a focus on the use of semantic features for identifying irony and sarcasm. Framester, a novel resource that acts as a hub between FrameNet, WordNet, VerbNet, BabelNet, DBpedia, Yago, DOLCE-Zero and others, has been used to extract semantic features from text. These semantic features are used to enrich the representations of tweets with event information using frames and word senses in addition to lexical units. The data set used for experimentation purposes contains tweets taken from different corpora including both figurative (containing irony and sarcasm) and non-figurative language. Two major tasks were performed: (i) detecting figurative language in tweets in a dataset containing both figurative and non-figurative tweets, (ii) classifying tweets containing irony and sarcasm. A 10-fold cross-validation experiment shows that the obtained accuracy for both tasks increases significantly when the semantic features such as linguistic frames and word senses are used in addition to lexical units, indicating that they may be important clues for figurative language. The approach was developed on top of Apache Spark so that it is easily scalable to much higher volumes of data, allowing for real-time analysis.

Original languageEnglish
Article number8870226
Pages (from-to)77-88
Number of pages12
JournalIEEE Computational Intelligence Magazine
Volume14
Issue number4
DOIs
Publication statusPublished - 1 Nov 2019
Externally publishedYes

Fingerprint

Dive into the research topics of 'Frame-Based Detection of Figurative Language in Tweets [Application Notes]'. Together they form a unique fingerprint.

Cite this