Neural machine translation for low-resource languages: A survey

S Ranathunga, ESA Lee, M Prifti Skenduli… - ACM Computing …, 2023 - dl.acm.org
Neural Machine Translation (NMT) has seen tremendous growth in the last ten years since
the early 2000s and has already entered a mature phase. While considered the most widely …

Neural machine translation: A review

F Stahlberg - Journal of Artificial Intelligence Research, 2020 - jair.org
The field of machine translation (MT), the automatic translation of written text from one
natural language into another, has experienced a major paradigm shift in recent years …

Rlaif: Scaling reinforcement learning from human feedback with ai feedback

H Lee, S Phatale, H Mansoor, KR Lu, T Mesnard… - 2023 - openreview.net
Reinforcement learning from human feedback (RLHF) is an effective technique for aligning
large language models (LLMs) to human preferences, but gathering high-quality human …

Refiner: Reasoning feedback on intermediate representations

D Paul, M Ismayilzada, M Peyrard, B Borges… - arxiv preprint arxiv …, 2023 - arxiv.org
Language models (LMs) have recently shown remarkable performance on reasoning tasks
by explicitly generating intermediate inferences, eg, chain-of-thought prompting. However …

Back to basics: Revisiting reinforce style optimization for learning from human feedback in llms

A Ahmadian, C Cremer, M Gallé, M Fadaee… - arxiv preprint arxiv …, 2024 - arxiv.org
AI alignment in the shape of Reinforcement Learning from Human Feedback (RLHF) is
increasingly treated as a crucial ingredient for high performance large language models …

Deep reinforcement learning: A survey

X Wang, S Wang, X Liang, D Zhao… - … on Neural Networks …, 2022 - ieeexplore.ieee.org
Deep reinforcement learning (DRL) integrates the feature representation ability of deep
learning with the decision-making ability of reinforcement learning so that it can achieve …

Factually consistent summarization via reinforcement learning with textual entailment feedback

P Roit, J Ferret, L Shani, R Aharoni, G Cideron… - arxiv preprint arxiv …, 2023 - arxiv.org
Despite the seeming success of contemporary grounded text generation systems, they often
tend to generate factually inconsistent text with respect to their input. This phenomenon is …

Survey on reinforcement learning for language processing

V Uc-Cetina, N Navarro-Guerrero… - Artificial Intelligence …, 2023 - Springer
In recent years some researchers have explored the use of reinforcement learning (RL)
algorithms as key components in the solution of various natural language processing (NLP) …

Rlaif vs. rlhf: Scaling reinforcement learning from human feedback with ai feedback

H Lee, S Phatale, H Mansoor, T Mesnard… - arxiv preprint arxiv …, 2023 - arxiv.org
Reinforcement learning from human feedback (RLHF) has proven effective in aligning large
language models (LLMs) with human preferences, but gathering high-quality preference …

A survey on non-autoregressive generation for neural machine translation and beyond

Y **ao, L Wu, J Guo, J Li, M Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Non-autoregressive (NAR) generation, which is first proposed in neural machine translation
(NMT) to speed up inference, has attracted much attention in both machine learning and …