logo

Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset

Authors

  • Fabeela Ali Rawther

    Amal Jyothi College Of Engineering
    Author
  • Abhinay A K

    Amal Jyothi College of Engineering,
    Author
  • Anagha Tess B

    Amal Jyothi College of Engineering,
    Author
  • Alan Joseph

    Amal Jyothi College of Engineering,
    Author
  • Adham Saheer

    Amal Jyothi College of Engineering,
    Author

Abstract

Most machine learning models are not only highly 
dependent on difficult datasets but also on the quality of labeled 
data they are trained on, especially for offensive content detection. 
In this paper, we study the TweetEval dataset to provide a 
comparison of its ground truth with manually annotated labels; 
inter-annotator agreements are applied here as a metric for 
assessing the consistency of annotation. Cohen’s Kappa coefficient 
is used to quantify how much each pair of annotators agreed and 
where they differed. In-depth examination of missed classifications 
demonstrates other difficulties with manual labelling: subjective 
interpretation, context dependency, and annotator bias. The in- 
sights gathered demonstrate how manual annotation can have 
positive and negative effects on further model training practices, 
highlighting the importance of standardized annotation guidelines. 
In their actions, the findings contribute to enhancing offensive 
content detection models by advocating dataset reliability and the 
reduction of inconsistencies in labeling.

Keywords:

—TweetEval Dataset, Annotation Consistency, Inter- Annotator Agreement,Cohen’s Kappa,, Offensive Language Detection, Hybrid Models,Annotator Bias
Views 0
Downloads 0

Published

20-06-2025

Issue

Section

Articles

How to Cite

[1]
Fabeela Ali Rawther, Abhinay A K, Anagha Tess B, Alan Joseph, and Adham Saheer, “Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset”, IJERA, vol. 5, no. 1, Jun. 2025, Accessed: Apr. 23, 2026. [Online]. Available: https://ijera.in/index.php/IJERA/article/view/312

Similar Articles

11-20 of 149

You may also start an advanced similarity search for this article.