Cobra frames: Contextual reasoning about effects and harms of offensive statements

X Zhou, H Zhu, A Yerukola, T Davidson… - arxiv preprint arxiv …, 2023 - arxiv.org
Warning: This paper contains content that may be offensive or upsetting. Understanding the
harms and offensiveness of statements requires reasoning about the social and situational …

Hatemoji: A test suite and adversarially-generated dataset for benchmarking and detecting emoji-based hate

HR Kirk, B Vidgen, P Röttger, T Thrush… - arxiv preprint arxiv …, 2021 - arxiv.org
Detecting online hate is a complex task, and low-performing models have harmful
consequences when used for sensitive applications such as content moderation. Emoji …