Resources and benchmark corpora for hate speech detection: a systematic review
Hate Speech in social media is a complex phenomenon, whose detection has recently
gained significant traction in the Natural Language Processing community, as attested by …
gained significant traction in the Natural Language Processing community, as attested by …
A literature review of textual hate speech detection methods and datasets
F Alkomah, X Ma - Information, 2022 - mdpi.com
Online toxic discourses could result in conflicts between groups or harm to online
communities. Hate speech is complex and multifaceted harmful or offensive content …
communities. Hate speech is complex and multifaceted harmful or offensive content …
Holistic evaluation of language models
Language models (LMs) are becoming the foundation for almost all major language
technologies, but their capabilities, limitations, and risks are not well understood. We present …
technologies, but their capabilities, limitations, and risks are not well understood. We present …
On the opportunities and risks of foundation models
AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …
Evaluating the social impact of generative ai systems in systems and society
Generative AI systems across modalities, ranging from text, image, audio, and video, have
broad social impacts, but there exists no official standard for means of evaluating those …
broad social impacts, but there exists no official standard for means of evaluating those …
On the dangers of stochastic parrots: Can language models be too big?🦜
The past 3 years of work in NLP have been characterized by the development and
deployment of ever larger language models, especially for English. BERT, its variants, GPT …
deployment of ever larger language models, especially for English. BERT, its variants, GPT …
Dealing with disagreements: Looking beyond the majority vote in subjective annotations
Majority voting and averaging are common approaches used to resolve annotator
disagreements and derive single ground truth labels from multiple annotations. However …
disagreements and derive single ground truth labels from multiple annotations. However …
Into the laion's den: Investigating hate in multimodal datasets
AbstractScale the model, scale the data, scale the compute'is the reigning sentiment in the
world of generative AI today. While the impact of model scaling has been extensively …
world of generative AI today. While the impact of model scaling has been extensively …
Dynabench: Rethinking benchmarking in NLP
We introduce Dynabench, an open-source platform for dynamic dataset creation and model
benchmarking. Dynabench runs in a web browser and supports human-and-model-in-the …
benchmarking. Dynabench runs in a web browser and supports human-and-model-in-the …
[PDF][PDF] XHate-999: Analyzing and detecting abusive language across domains and languages
We present XHATE-999, a multi-domain and multilingual evaluation data set for abusive
language detection. By aligning test instances across six typologically diverse languages …
language detection. By aligning test instances across six typologically diverse languages …