SemEval-2018 task 3 : irony detection in English tweets

Publication type
Publication status
Van Hee, C., Lefever, E., & Hoste, V.
Proceedings of The 12th International Workshop on Semantic Evaluation
Association for Computational Linguistics (New Orleans, Louisiana)
SemEval-2018 workshop at the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL HLT 2018) (New Orleans, Louisiana)
View in Biblio
(externe link)


This paper presents the first shared task on irony detection: given a tweet, automatic natural language processing systems should determine whether the tweet is ironic (Task A) and which type of irony (if any) is expressed (Task B). The ironic tweets were collected using irony-related hashtags (i.e. #irony, #sarcasm, #not) and were subsequently manually annotated to minimise the amount of noise in the corpus. Prior to distributing the data, hashtags that were used to collect the tweets were removed from the corpus. For both tasks, a training corpus of 3,834 tweets was provided, as well as a test set containing 784 tweets. Our shared tasks received submissions from 43 teams for the binary classification Task A and from 31 teams for the multiclass Task B. The highest classification scores obtained for both subtasks are respectively F1= 0.71 and F1= 0.51 and demonstrate that fine-grained irony classification is much more challenging than binary irony detection.