Automatic Detection of Sexist Statements Commonly Used at the Workplace

July 08, 2020 · Declared Dead · 🏛 Lecture Notes in Computer Science

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Dylan Grosz, Patricia Conde-Cespedes arXiv ID 2007.04181 Category cs.CL: Computation & Language Cross-listed cs.CY, cs.LG Citations 40 Venue Lecture Notes in Computer Science Last Checked 4 months ago

Abstract

Detecting hate speech in the workplace is a unique classification task, as the underlying social context implies a subtler version of conventional hate speech. Applications regarding a state-of the-art workplace sexism detection model include aids for Human Resources departments, AI chatbots and sentiment analysis. Most existing hate speech detection methods, although robust and accurate, focus on hate speech found on social media, specifically Twitter. The context of social media is much more anonymous than the workplace, therefore it tends to lend itself to more aggressive and "hostile" versions of sexism. Therefore, datasets with large amounts of "hostile" sexism have a slightly easier detection task since "hostile" sexist statements can hinge on a couple words that, regardless of context, tip the model off that a statement is sexist. In this paper we present a dataset of sexist statements that are more likely to be said in the workplace as well as a deep learning model that can achieve state-of-the art results. Previous research has created state-of-the-art models to distinguish "hostile" and "benevolent" sexism based simply on aggregated Twitter data. Our deep learning methods, initialized with GloVe or random word embeddings, use LSTMs with attention mechanisms to outperform those models on a more diverse, filtered dataset that is more targeted towards workplace sexism, leading to an F1 score of 0.88.