Iterative reasoning preference optimization

RY Pang, W Yuan, H He, K Cho… - Advances in …, 2025 - proceedings.neurips.cc
Iterative preference optimization methods have recently been shown to perform well for
general instruction tuning tasks, but typically make little improvement on reasoning tasks. In …

Culturally aware and adapted nlp: A taxonomy and a survey of the state of the art

CC Liu, I Gurevych, A Korhonen - ar** for zero-shot cross-lingual transfer in large language models
L Bandarkar, B Muller, P Yuvraj, R Hou… - ar**, is the practice of combining different models with
the same architecture together without further training. In this work, we present a model …