Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset

Fabeela Ali Rawther; Abhinay A K; Anagha Tess B; Alan Joseph; Adham Saheer

Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset

Authors

Fabeela Ali Rawther

Amal Jyothi College Of Engineering

Author
Abhinay A K

Amal Jyothi College of Engineering,

Author
Anagha Tess B

Amal Jyothi College of Engineering,

Author
Alan Joseph

Amal Jyothi College of Engineering,

Author
Adham Saheer

Amal Jyothi College of Engineering,

Author

Abstract

Most machine learning models are not only highly
dependent on difficult datasets but also on the quality of labeled
data they are trained on, especially for offensive content detection.
In this paper, we study the TweetEval dataset to provide a
comparison of its ground truth with manually annotated labels;
inter-annotator agreements are applied here as a metric for
assessing the consistency of annotation. Cohen’s Kappa coefficient
is used to quantify how much each pair of annotators agreed and
where they differed. In-depth examination of missed classifications
demonstrates other difficulties with manual labelling: subjective
interpretation, context dependency, and annotator bias. The in-
sights gathered demonstrate how manual annotation can have
positive and negative effects on further model training practices,
highlighting the importance of standardized annotation guidelines.
In their actions, the findings contribute to enhancing offensive
content detection models by advocating dataset reliability and the
reduction of inconsistencies in labeling.

Keywords:

—TweetEval Dataset, Annotation Consistency, Inter- Annotator Agreement,Cohen’s Kappa,, Offensive Language Detection, Hybrid Models,Annotator Bias

Downloads 0

Full Text (PDF)

Published

20-06-2025

Issue

Vol. 5 No. 1 (2025): IJERA

Section

Articles

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

How to Cite

[1]

Fabeela Ali Rawther, Abhinay A K, Anagha Tess B, Alan Joseph, and Adham Saheer, “Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset”, IJERA, vol. 5, no. 1, Jun. 2025, Accessed: Apr. 23, 2026. [Online]. Available: https://ijera.in/index.php/IJERA/article/view/312

Download Citation

Indexed By

Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset

Authors

Fabeela Ali Rawther

Abhinay A K

Anagha Tess B

Alan Joseph

Adham Saheer

Abstract

Keywords:

Published

Issue

Section

License

How to Cite

Similar Articles

Similar Articles

Personality Profiling Using CV Analysis

A Reliable Method for Detecting Brain Tumors in Magnetic Resonance Images Utilizing EfficientNet

Potato Leaf Disease Detection Using VIT

Wildlife Detection And Recognition Using YOLO V8

Deep Learning for Cyber Threat Detection

The Carbon footprint of Machine Learning Models

Unveiling Stress through Facial Expressions: A Literature Review on Detection Methods

AI Enabled Robot for Data Collection in Unreachable and Extreme Environment

Traffic Violation Detection Using Machine Learning: A Comprehensive Study

DeepScan : A Deepfake Video Detection System