Evaluating Annotation Consistency in Offensive Language Detection: A Data Analytics Approach on the TweetEval Dataset
Abstract
Most machine learning models are not only highly
dependent on difficult datasets but also on the quality of labeled
data they are trained on, especially for offensive content detection.
In this paper, we study the TweetEval dataset to provide a
comparison of its ground truth with manually annotated labels;
inter-annotator agreements are applied here as a metric for
assessing the consistency of annotation. Cohen’s Kappa coefficient
is used to quantify how much each pair of annotators agreed and
where they differed. In-depth examination of missed classifications
demonstrates other difficulties with manual labelling: subjective
interpretation, context dependency, and annotator bias. The in-
sights gathered demonstrate how manual annotation can have
positive and negative effects on further model training practices,
highlighting the importance of standardized annotation guidelines.
In their actions, the findings contribute to enhancing offensive
content detection models by advocating dataset reliability and the
reduction of inconsistencies in labeling.
Keywords:
—TweetEval Dataset, Annotation Consistency, Inter- Annotator Agreement,Cohen’s Kappa,, Offensive Language Detection, Hybrid Models,Annotator BiasPublished
Issue
Section
License
Copyright (c) 2025 International Journal on Emerging Research Areas

This work is licensed under a Creative Commons Attribution 4.0 International License.
All published work in this journal is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
How to Cite
Similar Articles
- Elisabeth Thomas, Arjun Saji, Aswin M S, Augustine Salas, Emil Viju, A Comprehensive Review of Advancing Cattle Monitoring and Behavior Classification using Deep Learning , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- Dr nitha C Vellayudan, Akshay K.P, Muhamed Adhil P.M, C.A Sivasankar , Crop Yield and Price Prediction , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
- Alan K George, Arpita Mary Mathew, Asin Mary Jacob, Elizabeth Antony, Shiney Thomas, Classification of Lung Cancer Subtypes Using Deep Learning Model , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- Sandra Saji, Melbin Mathew, Angel Mariya S, Amrutha Mugesh, Jincy Lukose, MACHINE LEARNING FOR DETECTION AND PREDICTION OF TOMATO LEAF DISEASES , International Journal on Emerging Research Areas: Vol. 3 No. 1 (2023): IJERA
- Mekha Jose, Avin Joshy, Abishek R Paleri, Athul Mohan, Ali Jasim R M, A Review on Contribution and Influence of Artificial Intelligence in Road Safety and Optimal Routing , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- Fabeela Ali Rawther, Raihana Rasaldeen, Stefi Marshal Fernandez, Irin Rose Jaison, Ria Mariam Mathews, A Survey on Automating Answer-Sheet Evaluation Using AI Techniques , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- Jesvin Jelson , Kesiya Rachel Johns, Mehak , Ken Jacob Zachariah, Neenu R, Custom Cart – Virtual try-on in e-commerce platforms using generative AI , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Aksa Ann Jacob, Midhun P Mathew, Adarsh S, Aaron Tom Viji, Aleena Varghese, A STUDY ON DISEASE DETECTION AND REMEDY IDENTIFICATION IN LEAVES , International Journal on Emerging Research Areas: Vol. 5 No. 1 (2025): IJERA
- Anishamol Abraham, Elbin Santhosh, Diliya Saji, Edwin Roy, Catherine Achu Punnoose, AI Revolutionizing Fashion: A Review of Algorithms and Applications , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
- Aron Thomas , Abhinav B Kannanthanam , Elzabeth Bobus , Adhil Salim , Elizabeth Jullu , R Neenu, A Hybrid SQL Query Execution Model for JSON Data: Balancing Resource Efficiency and Analytical Performance , International Journal on Emerging Research Areas: Vol. 4 No. 2 (2024): IJERA
You may also start an advanced similarity search for this article.
