A general framework for inference-time scaling and steering of diffusion models

R Singhal, Z Horvitz, R Teehan, M Ren, Z Yu… - arxiv preprint arxiv …, 2025 - arxiv.org
Diffusion models produce impressive results in modalities ranging from images and video to
protein design and text. However, generating samples with user-specified properties …

[PDF][PDF] PAN 2024 multilingual TextDetox: exploring different regimes for synthetic data training for multilingual text detoxification

N Sushko - Working Notes of CLEF, 2024 - downloads.webis.de
Multilingual text detoxification is a style transfer task of creating neutral versions of toxic texts
across multiple languages. In this paper, we use a mix of real and synthetic data to build a …

[PDF][PDF] Marsan at PAN 2024 TextDetox: ToxiCleanse RL and paving the way for toxicity-free online discourse

M Najafi, E Tavan, S Colreavy - Working Notes of CLEF, 2024 - downloads.webis.de
Addressing the pervasive issue of toxicity in online communication requires innovative
solutions beyond mere identification and removal of harmful content. This paper presents …

[PDF][PDF] Multilingual text detoxification using google cloud translation and post-processing

Z Luo, M Luo, A Wang - Working Notes of CLEF, 2024 - ceur-ws.org
The task of text detoxification aims to re-write toxic text into non-toxic text. Though existing
methods have achieved impressive detoxification performance in monolingual settings …

[PDF][PDF] A multilingual text detoxification method based on few-shot learning and CO-STAR framework

J Peng, Z Han, H Zhang, J Ye, C Liu, B Liu… - Working Notes of …, 2024 - ceur-ws.org
Multilingual text detoxification is a natural language processing downstream task that inputs
toxic sentences, and then outputs a neutral version that preserves the original meaning and …

Multilingual and Explainable Text Detoxification with Parallel Corpora

D Dementieva, N Babakov, A Ronen, AA Ayele… - arxiv preprint arxiv …, 2024 - arxiv.org
Even with various regulations in place across countries and social media platforms
(Government of India, 2021; European Parliament and Council of the European Union …

CogSteer: Cognition-Inspired Selective Layer Intervention for Efficient Semantic Steering in Large Language Models

X Wang, J Pan, L Jiang, L Ding, X Li… - arxiv preprint arxiv …, 2024 - arxiv.org
Despite their impressive capabilities, large language models (LLMs) often lack
interpretability and can generate toxic content. While using LLMs as foundation models and …

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

Y Deng, Y Yang, J Zhang, W Wang, B Li - arxiv preprint arxiv:2502.05163, 2025 - arxiv.org
The rapid advancement of large language models (LLMs) has increased the need for
guardrail models to ensure responsible use, particularly in detecting unsafe and illegal …

RAGthoven: A Configurable Toolkit for RAG-enabled LLM Experimentation

G Karetka, D Skottis, L Dutková, P Hraška… - Proceedings of the …, 2025 - aclanthology.org
Abstract Large Language Models (LLMs) have significantly altered the landscape of Natural
Language Processing (NLP), having topped the benchmarks of many standard tasks and …

TextClass Benchmark: A Continuous Elo Rating of LLMs in Social Sciences

B González-Bustamante - arxiv preprint arxiv:2412.00539, 2024 - arxiv.org
The TextClass Benchmark project is an ongoing, continuous benchmarking process that
aims to provide a comprehensive, fair, and dynamic evaluation of LLMs and transformers for …