Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset

Fabeela Ali Rawther; Abhinay A K; Anagha Tess B; Alan Joseph; Adham Saheer

Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset

Authors

Fabeela Ali Rawther

Amal Jyothi College Of Engineering

Author
Abhinay A K

Amal Jyothi College of Engineering,

Author
Anagha Tess B

Amal Jyothi College of Engineering,

Author
Alan Joseph

Amal Jyothi College of Engineering,

Author
Adham Saheer

Amal Jyothi College of Engineering,

Author

Abstract

Most machine learning models are not only highly
dependent on difficult datasets but also on the quality of labeled
data they are trained on, especially for offensive content detection.
In this paper, we study the TweetEval dataset to provide a
comparison of its ground truth with manually annotated labels;
inter-annotator agreements are applied here as a metric for
assessing the consistency of annotation. Cohen’s Kappa coefficient
is used to quantify how much each pair of annotators agreed and
where they differed. In-depth examination of missed classifications
demonstrates other difficulties with manual labelling: subjective
interpretation, context dependency, and annotator bias. The in-
sights gathered demonstrate how manual annotation can have
positive and negative effects on further model training practices,
highlighting the importance of standardized annotation guidelines.
In their actions, the findings contribute to enhancing offensive
content detection models by advocating dataset reliability and the
reduction of inconsistencies in labeling.

Keywords:

—TweetEval Dataset, Annotation Consistency, Inter- Annotator Agreement,Cohen’s Kappa,, Offensive Language Detection, Hybrid Models,Annotator Bias

Downloads 0

Full Text (PDF)

Published

20-06-2025

Issue

Vol. 5 No. 1 (2025): IJERA

Section

Articles

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

How to Cite

[1]

Fabeela Ali Rawther, Abhinay A K, Anagha Tess B, Alan Joseph, and Adham Saheer, “Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset”, IJERA, vol. 5, no. 1, Jun. 2025, Accessed: Apr. 25, 2026. [Online]. Available: https://ijera.in/index.php/IJERA/article/view/312

Download Citation

Indexed By

Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset

Authors

Fabeela Ali Rawther

Abhinay A K

Anagha Tess B

Alan Joseph

Adham Saheer

Abstract

Keywords:

Published

Issue

Section

License

How to Cite

Similar Articles

Similar Articles

Pixelyse : ViT- VAE for Document Forgery Detection

Automatic Fall Detection And Alert System For Home Safety

Smart Road Condition Monitoring and Optimal Routing System Using Yolo V11

A Literature Review on IMAGE FORGERY DETECTION

LanguaGuide -Your personalized AI companion for mastering languages, anytime, anywhere.

A Review of Machine Learning Approaches for Canine Skin Disease Detection Using Image Processing Techniques

Driver Drowsiness Detection Using Python

A Crowd Monitoring and Real-Time Tracking System using CNN

TraceFusion: Precision AI for Missing and Wanted Person Detection