Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset

Fabeela Ali Rawther; Abhinay A K; Anagha Tess B; Alan Joseph; Adham Saheer

Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset

Authors

Fabeela Ali Rawther

Amal Jyothi College Of Engineering

Author
Abhinay A K

Amal Jyothi College of Engineering,

Author
Anagha Tess B

Amal Jyothi College of Engineering,

Author
Alan Joseph

Amal Jyothi College of Engineering,

Author
Adham Saheer

Amal Jyothi College of Engineering,

Author

Abstract

Most machine learning models are not only highly
dependent on difficult datasets but also on the quality of labeled
data they are trained on, especially for offensive content detection.
In this paper, we study the TweetEval dataset to provide a
comparison of its ground truth with manually annotated labels;
inter-annotator agreements are applied here as a metric for
assessing the consistency of annotation. Cohen’s Kappa coefficient
is used to quantify how much each pair of annotators agreed and
where they differed. In-depth examination of missed classifications
demonstrates other difficulties with manual labelling: subjective
interpretation, context dependency, and annotator bias. The in-
sights gathered demonstrate how manual annotation can have
positive and negative effects on further model training practices,
highlighting the importance of standardized annotation guidelines.
In their actions, the findings contribute to enhancing offensive
content detection models by advocating dataset reliability and the
reduction of inconsistencies in labeling.

Keywords:

—TweetEval Dataset, Annotation Consistency, Inter- Annotator Agreement,Cohen’s Kappa,, Offensive Language Detection, Hybrid Models,Annotator Bias

Downloads 0

Full Text (PDF)

Published

20-06-2025

Issue

Vol. 5 No. 1 (2025): IJERA

Section

Articles

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

How to Cite

[1]

Fabeela Ali Rawther, Abhinay A K, Anagha Tess B, Alan Joseph, and Adham Saheer, “Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset”, IJERA, vol. 5, no. 1, Jun. 2025, Accessed: Apr. 26, 2026. [Online]. Available: https://ijera.in/index.php/IJERA/article/view/312

Download Citation

Indexed By

Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset

Authors

Fabeela Ali Rawther

Abhinay A K

Anagha Tess B

Alan Joseph

Adham Saheer

Abstract

Keywords:

Published

Issue

Section

License

How to Cite

Similar Articles

Similar Articles

DaceStudio: AI-Driven Code Editing for Next-Gen Software Development

JEWELLERY SHOPPING WITH FACIAL RECOGNITION

Interview Preparation System: A Smart Platform for Technical and Behavioral Readiness

AN EFFECT OF DISTANCE MEASURES IN CLASSIFYING LARGE DATASETS

Lamer.Ind: A Smart and Interactive Online Textile Platform

Augmented Neat Algorithm For Enhanced Cognitive Interaction (NEAT-X)

GITSHUB - A COMPREHENSIVE PLATFORM FOR ACADEMIC NETWORKING, MENTORSHIP, AND CAREER DEVELOPMENT