๐ฎ
๐ฎ
The Ethereal
Determination of toxic comments and unintended model bias minimization using Deep learning approach
November 08, 2023 ยท Entered Twilight ยท ๐ arXiv.org
Repo contents: BERT_unintended_bias.ipynb, EDA_toxic_dataset.ipynb, Logistic_regression_Model_bias.ipynb, README.md
Authors
Md Azim Khan
arXiv ID
2311.04789
Category
cs.LG: Machine Learning
Cross-listed
cs.CL,
cs.CY
Citations
1
Venue
arXiv.org
Repository
https://github.com/zim10/Determine_Toxic_comment_and_identity_bias.git
Last Checked
3 months ago
Abstract
Online conversations can be toxic and subjected to threats, abuse, or harassment. To identify toxic text comments, several deep learning and machine learning models have been proposed throughout the years. However, recent studies demonstrate that because of the imbalances in the training data, some models are more likely to show unintended biases including gender bias and identity bias. In this research, our aim is to detect toxic comment and reduce the unintended bias concerning identity features such as race, gender, sex, religion by fine-tuning an attention based model called BERT(Bidirectional Encoder Representation from Transformers). We apply weighted loss to address the issue of unbalanced data and compare the performance of a fine-tuned BERT model with a traditional Logistic Regression model in terms of classification and bias minimization. The Logistic Regression model with the TFIDF vectorizer achieve 57.1% accuracy, and fine-tuned BERT model's accuracy is 89%. Code is available at https://github.com/zim10/Determine_Toxic_comment_and_identity_bias.git
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Machine Learning
๐ฎ
๐ฎ
The Ethereal
Continuous control with deep reinforcement learning
๐
๐
Old Age
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
๐
๐
Old Age
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
๐
๐
Old Age
SGDR: Stochastic Gradient Descent with Warm Restarts
๐ฎ
๐ฎ
The Ethereal