Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset
Abstract
Most machine learning models are not only highly
dependent on difficult datasets but also on the quality of labeled
data they are trained on, especially for offensive content detection.
In this paper, we study the TweetEval dataset to provide a
comparison of its ground truth with manually annotated labels;
inter-annotator agreements are applied here as a metric for
assessing the consistency of annotation. Cohen’s Kappa coefficient
is used to quantify how much each pair of annotators agreed and
where they differed. In-depth examination of missed classifications
demonstrates other difficulties with manual labelling: subjective
interpretation, context dependency, and annotator bias. The in-
sights gathered demonstrate how manual annotation can have
positive and negative effects on further model training practices,
highlighting the importance of standardized annotation guidelines.
In their actions, the findings contribute to enhancing offensive
content detection models by advocating dataset reliability and the
reduction of inconsistencies in labeling.
Keywords:
—TweetEval Dataset, Annotation Consistency, Inter- Annotator Agreement,Cohen’s Kappa,, Offensive Language Detection, Hybrid Models,Annotator BiasPublished
Issue
Section
License
Copyright (c) 2025 International Journal on Emerging Research Areas

This work is licensed under a Creative Commons Attribution 4.0 International License.
All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
How to Cite
Similar Articles
- Alan Joseph, A K Abhinay, Dr. Gee Varghese Titus, Anagha Tess B, Adham Saheer, Fabeela Ali Rawther, Comparative Analysis of Text Classification Models for Offensive Language Detection on Social Media Platforms , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Fabeela Ali Rawther, Abhinay A K, Anagha Tess B, Alan Joseph, Adham Saheer, Survey of Machine Learning and Deep Learning Approaches for Automated Hate Speech Detection and Sentiment Analysis in Multilingual Contexts , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- Mekha Jose, Jocelyn Anthony, Jose V Joseph, Joshwa Thomas, Sharon Baby Thomas, A Review of Machine Learning and Deep Learning Approaches for Offensive Text Detection , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- Ansamol Varghese, Anandhu Anoj, Emil Thomas, Deepta K Sunny, Angel Thomas, TrueNews: AI Powered Detection of Manipulated Text and Images , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- Sreyas George, Gregan George, Ruth Tennyson, Rishil Shajan, Dr. Juby Mathew, MindPulse: Employee Mental Health Detection and Attrition Prediction App , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Syam Gopi, Evelyn Susan Jacob, Joel John, Raynell Rajeev, Steve Alex, Survey on AI Malware Detection Methods and Cybersecurity Education , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- R Karthika, Maria Toms, S R Aadrash, P U Prabath, InsightAI: Bridging Natural Language and Data Analytics , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Richa Maria Biju, Merwin Maria Antony, Mishal Rose Thankachan, Joshua John Sajit, Bini M Issac, Enhancing Image Forgery Detection with Multi-Modal Deep Learning and Statistical Methods , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- Lis Jose , Achyuth P Murali, Christin Joseph Shaji, Christy Kunjumon Peter , Multiple Detection and Diagnosis of Skin Diseases using CNN , International Journal on Emerging Research Areas: Vol. 4 No. 1 (2024): IJERA
- Anu Rose Joy, An overview of Fake News DetectionusingBidirectional Long Short-TermMemory(BiLSTM)Models , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
You may also start an advanced similarity search for this article.
