[PDF][PDF] Overview of the multilingual text detoxification task at pan 2024
Despite different countries and social platform regulations, digital abusive speech persists
as a significant challenge. One of the way to tackle abusive, or more specifically, toxic …
as a significant challenge. One of the way to tackle abusive, or more specifically, toxic …
Through the lens of core competency: Survey on evaluation of large language models
From pre-trained language model (PLM) to large language model (LLM), the field of natural
language processing (NLP) has witnessed steep performance gains and wide practical …
language processing (NLP) has witnessed steep performance gains and wide practical …
Learning interpretable style embeddings via prompting llms
Style representation learning builds content-independent representations of author style in
text. Stylometry, the analysis of style in text, is often performed by expert forensic linguists …
text. Stylometry, the analysis of style in text, is often performed by expert forensic linguists …
Subtle misogyny detection and mitigation: An expert-annotated dataset
B Sheppard, A Richter, A Cohen, EA Smith… - arxiv preprint arxiv …, 2023 - arxiv.org
Using novel approaches to dataset development, the Biasly dataset captures the nuance
and subtlety of misogyny in ways that are unique within the literature. Built in collaboration …
and subtlety of misogyny in ways that are unique within the literature. Built in collaboration …
CMD: a framework for Context-aware Model self-Detoxification
Text detoxification aims to minimize the risk of language models producing toxic content.
Existing detoxification methods of directly constraining the model output or further training …
Existing detoxification methods of directly constraining the model output or further training …
Multilingual content moderation: A case study on Reddit
Content moderation is the process of flagging content based on pre-defined platform rules.
There has been a growing need for AI moderators to safeguard users as well as protect the …
There has been a growing need for AI moderators to safeguard users as well as protect the …
COUNT: COntrastive UNlikelihood text style transfer for text detoxification
Offensive and toxic text on social media platforms can lead to polarization and divisiveness
within online communities and hinders constructive dialogue. Text detoxification is a crucial …
within online communities and hinders constructive dialogue. Text detoxification is a crucial …
Demonstrations are all you need: Advancing offensive content paraphrasing using in-context learning
Paraphrasing of offensive content is a better alternative to content removal and helps
improve civility in a communication environment. Supervised paraphrasers; however, rely …
improve civility in a communication environment. Supervised paraphrasers; however, rely …
Don't Take This Out of Context! On the Need for Contextual Models and Evaluations for Stylistic Rewriting
Most existing stylistic text rewriting methods and evaluation metrics operate on a sentence
level, but ignoring the broader context of the text can lead to preferring generic, ambiguous …
level, but ignoring the broader context of the text can lead to preferring generic, ambiguous …
Active Learning for Robust and Representative LLM Generation in Safety-Critical Scenarios
Ensuring robust safety measures across a wide range of scenarios is crucial for user-facing
systems. While Large Language Models (LLMs) can generate valuable data for safety …
systems. While Large Language Models (LLMs) can generate valuable data for safety …