Leveraging Large Language Models and Topic Modeling for Toxicity Classification
November 26, 2024 ยท Entered Twilight ยท ๐ International Conference on Computing, Networking and Communications
Repo contents: DEMOGRAPHIC_ANALYSIS.ipynb, FINETUNE_BERTWEET.ipynb, FINETUNE_HATEBERT.ipynb, LDA_SPLIT.ipynb, LDA_TOPIC_MODELING.ipynb, NLPOSITIONALITY_ANALYSIS.ipynb, OTHER_LLMs.ipynb, README.md, SAVE_CSV.ipynb, TOPIC_FINETUNE_BERTWEET.ipynb, TOPIC_FINETUNE_HATEBERT.ipynb
Authors
Haniyeh Ehsani Oskouie, Christina Chance, Claire Huang, Margaret Capetz, Elizabeth Eyeson, Majid Sarrafzadeh
arXiv ID
2411.17876
Category
cs.CL: Computation & Language
Cross-listed
cs.LG
Citations
5
Venue
International Conference on Computing, Networking and Communications
Repository
https://github.com/aheldis/Toxicity-Classification.git
โญ 1
Last Checked
3 months ago
Abstract
Content moderation and toxicity classification represent critical tasks with significant social implications. However, studies have shown that major classification models exhibit tendencies to magnify or reduce biases and potentially overlook or disadvantage certain marginalized groups within their classification processes. Researchers suggest that the positionality of annotators influences the gold standard labels in which the models learned from propagate annotators' bias. To further investigate the impact of annotator positionality, we delve into fine-tuning BERTweet and HateBERT on the dataset while using topic-modeling strategies for content moderation. The results indicate that fine-tuning the models on specific topics results in a notable improvement in the F1 score of the models when compared to the predictions generated by other prominent classification models such as GPT-4, PerspectiveAPI, and RewireAPI. These findings further reveal that the state-of-the-art large language models exhibit significant limitations in accurately detecting and interpreting text toxicity contrasted with earlier methodologies. Code is available at https://github.com/aheldis/Toxicity-Classification.git.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Computation & Language
๐
๐
Old Age
๐
๐
Old Age
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
๐
๐
Old Age
XLNet: Generalized Autoregressive Pretraining for Language Understanding
๐ฎ
๐ฎ
The Ethereal
Effective Approaches to Attention-based Neural Machine Translation
๐
๐
Old Age
A large annotated corpus for learning natural language inference
๐
๐
Old Age