Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset

Fabeela Ali Rawther; Abhinay A K; Anagha Tess B; Alan Joseph; Adham Saheer

Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset

Authors

Fabeela Ali Rawther

Amal Jyothi College Of Engineering

Author
Abhinay A K

Amal Jyothi College of Engineering,

Author
Anagha Tess B

Amal Jyothi College of Engineering,

Author
Alan Joseph

Amal Jyothi College of Engineering,

Author
Adham Saheer

Amal Jyothi College of Engineering,

Author

Abstract

Most machine learning models are not only highly
dependent on difficult datasets but also on the quality of labeled
data they are trained on, especially for offensive content detection.
In this paper, we study the TweetEval dataset to provide a
comparison of its ground truth with manually annotated labels;
inter-annotator agreements are applied here as a metric for
assessing the consistency of annotation. Cohen’s Kappa coefficient
is used to quantify how much each pair of annotators agreed and
where they differed. In-depth examination of missed classifications
demonstrates other difficulties with manual labelling: subjective
interpretation, context dependency, and annotator bias. The in-
sights gathered demonstrate how manual annotation can have
positive and negative effects on further model training practices,
highlighting the importance of standardized annotation guidelines.
In their actions, the findings contribute to enhancing offensive
content detection models by advocating dataset reliability and the
reduction of inconsistencies in labeling.

Keywords:

—TweetEval Dataset, Annotation Consistency, Inter- Annotator Agreement,Cohen’s Kappa,, Offensive Language Detection, Hybrid Models,Annotator Bias

Downloads 64

Full Text (PDF)

Published

20-06-2025

Issue

Vol. 5 No. 1 (2025): IJERA

Section

Articles

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

How to Cite

[1]

Fabeela Ali Rawther, Abhinay A K, Anagha Tess B, Alan Joseph, and Adham Saheer, “Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset”, IJERA, vol. 5, no. 1, Jun. 2025, Accessed: Jul. 21, 2026. [Online]. Available: https://ijera.in/index.php/IJERA/article/view/312

Download Citation

Indexed By

Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset

Authors

Fabeela Ali Rawther

Abhinay A K

Anagha Tess B

Alan Joseph

Adham Saheer

Abstract

Keywords:

Published

Issue

Section

License

How to Cite

Similar Articles

Similar Articles

Comparative Analysis of Text Classification Models for Offensive Language Detection on Social Media Platforms

Survey of Machine Learning and Deep Learning Approaches for Automated Hate Speech Detection and Sentiment Analysis in Multilingual Contexts

A Review of Machine Learning and Deep Learning Approaches for Offensive Text Detection

TrueNews: AI Powered Detection of Manipulated Text and Images

Pneumonia Detection From Chest X-Rays Using Deep Learning : A Comprehensive Review

Comparative Study of Deep Learning Models for Pneumonia Classification

MindPulse: Employee Mental Health Detection and Attrition Prediction App

Survey on AI Malware Detection Methods and Cybersecurity Education

SPEAK: An AI-Based Assistive Video Communication System for Speech and Sign Language Translation

InsightAI: Bridging Natural Language and Data Analytics