Large Language Models Can Self-Improve in Long-context Reasoning
Large language models (LLMs) have achieved substantial progress in processing long
contexts but still struggle with long-context reasoning. Existing approaches typically involve …
contexts but still struggle with long-context reasoning. Existing approaches typically involve …
Memorization Inheritance in Sequence-Level Knowledge Distillation for Neural Machine Translation
In this work, we explore how instance-level memorization in the teacher Neural Machine
Translation (NMT) model gets inherited by the student model in sequence-level knowledge …
Translation (NMT) model gets inherited by the student model in sequence-level knowledge …
A Bayesian Optimization Approach to Machine Translation Reranking
Reranking a list of candidates from a machine translation system with an external scoring
model and returning the highest-scoring candidate remains a simple and effective method …
model and returning the highest-scoring candidate remains a simple and effective method …