Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset

Fabeela Ali Rawther; Abhinay A K; Anagha Tess B; Alan Joseph; Adham Saheer

Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset

Authors

Fabeela Ali Rawther

Amal Jyothi College Of Engineering

Author
Abhinay A K

Amal Jyothi College of Engineering,

Author
Anagha Tess B

Amal Jyothi College of Engineering,

Author
Alan Joseph

Amal Jyothi College of Engineering,

Author
Adham Saheer

Amal Jyothi College of Engineering,

Author

Abstract

Most machine learning models are not only highly
dependent on difficult datasets but also on the quality of labeled
data they are trained on, especially for offensive content detection.
In this paper, we study the TweetEval dataset to provide a
comparison of its ground truth with manually annotated labels;
inter-annotator agreements are applied here as a metric for
assessing the consistency of annotation. Cohen’s Kappa coefficient
is used to quantify how much each pair of annotators agreed and
where they differed. In-depth examination of missed classifications
demonstrates other difficulties with manual labelling: subjective
interpretation, context dependency, and annotator bias. The in-
sights gathered demonstrate how manual annotation can have
positive and negative effects on further model training practices,
highlighting the importance of standardized annotation guidelines.
In their actions, the findings contribute to enhancing offensive
content detection models by advocating dataset reliability and the
reduction of inconsistencies in labeling.

Keywords:

—TweetEval Dataset, Annotation Consistency, Inter- Annotator Agreement,Cohen’s Kappa,, Offensive Language Detection, Hybrid Models,Annotator Bias

Downloads 0

Full Text (PDF)

Published

20-06-2025

Issue

Vol. 5 No. 1 (2025): IJERA

Section

Articles

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

How to Cite

[1]

Fabeela Ali Rawther, Abhinay A K, Anagha Tess B, Alan Joseph, and Adham Saheer, “Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset”, IJERA, vol. 5, no. 1, Jun. 2025, Accessed: Apr. 25, 2026. [Online]. Available: https://ijera.in/index.php/IJERA/article/view/312

Download Citation

Indexed By

Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset

Authors

Fabeela Ali Rawther

Abhinay A K

Anagha Tess B

Alan Joseph

Adham Saheer

Abstract

Keywords:

Published

Issue

Section

License

How to Cite

Similar Articles

Similar Articles

A Two-Stage Deep Learning Framework for Skin Lesion Detection and Classification Using ResNet18 and EfficientNet-B4

INDIAN SIGN LANGUAGE RECOGNITION USING YOLOV5

Multiple Disease Detection using Machine Learning

CARDAMOM PLANT DISEASE DETECTION USING ROBOT

A Machine Learning Approach to Fake News Detection

HEALTH GUARD-A Multiple Disease Prediction Model Based on Machine learning

A Review on Comparison of VGG-16 and DenseNet algorithms for analysing brain tumor in MRI image

A Review on Prompt Engineering in Agriculture

Advanced Sensor-Based Landslide Detection and Alert System Utilizing Machine Learning

TrueNews-AI Powered Detection of Manipulated Text and Images