- Academic Search

RY Pang, W Yuan, H He, K Cho… - Advances in …, 2025 - proceedings.neurips.cc

Iterative preference optimization methods have recently been shown to perform well for
general instruction tuning tasks, but typically make little improvement on reasoning tasks. In …

Tallenna Viittaa Viittausten määrä 91 Aiheeseen liittyviä artikkeleita Kaikki 5 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Culturally aware and adapted nlp: A taxonomy and a survey of the state of the art

CC Liu, I Gurevych, A Korhonen - ar** for zero-shot cross-lingual transfer in large language models

L Bandarkar, B Muller, P Yuvraj, R Hou… - ar**, is the practice of combining different models with
the same architecture together without further training. In this work, we present a model …

Tallenna Viittaa Viittausten määrä 2 Aiheeseen liittyviä artikkeleita Kaikki 4 versiota HTML-versio

Luo ilmoitus

Viittaa

Tarkennettu haku

Tallennettu omaan kirjastoon

Mapo: Advancing multilingual reasoning through multilingual alignment-as-preference optimization

Iterative reasoning preference optimization

Culturally aware and adapted nlp: A taxonomy and a survey of the state of the art