Resources and benchmark corpora for hate speech detection: a systematic review
Hate Speech in social media is a complex phenomenon, whose detection has recently
gained significant traction in the Natural Language Processing community, as attested by …
gained significant traction in the Natural Language Processing community, as attested by …
The psychological well-being of content moderators: the emotional labor of commercial moderation and avenues for improving support
M Steiger, TJ Bharucha, S Venkatagiri… - Proceedings of the …, 2021 - dl.acm.org
An estimated 100,000 people work today as commercial content moderators. These
moderators are often exposed to disturbing content, which can lead to lasting psychological …
moderators are often exposed to disturbing content, which can lead to lasting psychological …
On the opportunities and risks of foundation models
AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …
Dealing with disagreements: Looking beyond the majority vote in subjective annotations
Majority voting and averaging are common approaches used to resolve annotator
disagreements and derive single ground truth labels from multiple annotations. However …
disagreements and derive single ground truth labels from multiple annotations. However …
Hatebert: Retraining bert for abusive language detection in english
In this paper, we introduce HateBERT, a re-trained BERT model for abusive language
detection in English. The model was trained on RAL-E, a large-scale dataset of Reddit …
detection in English. The model was trained on RAL-E, a large-scale dataset of Reddit …
Latent hatred: A benchmark for understanding implicit hate speech
Hate speech has grown significantly on social media, causing serious consequences for
victims of all demographics. Despite much attention being paid to characterize and detect …
victims of all demographics. Despite much attention being paid to characterize and detect …
Social biases in NLP models as barriers for persons with disabilities
Building equitable and inclusive NLP technologies demands consideration of whether and
how social attitudes are represented in ML models. In particular, representations encoded in …
how social attitudes are represented in ML models. In particular, representations encoded in …
Directions in abusive language training data, a systematic review: Garbage in, garbage out
Data-driven and machine learning based approaches for detecting, categorising and
measuring abusive content such as hate speech and harassment have gained traction due …
measuring abusive content such as hate speech and harassment have gained traction due …
Misogyny detection in twitter: a multilingual and cross-domain study
The freedom of expression given by social media has a dark side: the growing proliferation
of abusive contents on these platforms. Misogynistic speech is a kind of abusive language …
of abusive contents on these platforms. Misogynistic speech is a kind of abusive language …
A survey of race, racism, and anti-racism in NLP
Despite inextricable ties between race and language, little work has considered race in NLP
research and development. In this work, we survey 79 papers from the ACL anthology that …
research and development. In this work, we survey 79 papers from the ACL anthology that …