Rl4f: Generating natural language feedback with reinforcement learning for repairing model outputs

AF Akyürek, E Akyürek, A Madaan, A Kalyan… - arxiv preprint arxiv …, 2023 - arxiv.org
Despite their unprecedented success, even the largest language models make mistakes.
Similar to how humans learn and improve using feedback, previous work proposed …

Interactive natural language processing

Z Wang, G Zhang, K Yang, N Shi, W Zhou… - arxiv preprint arxiv …, 2023 - arxiv.org
Interactive Natural Language Processing (iNLP) has emerged as a novel paradigm within
the field of NLP, aimed at addressing limitations in existing frameworks while aligning with …

Chain-of-experts: When llms meet complex operations research problems

Z **ao, D Zhang, Y Wu, L Xu, YJ Wang… - The twelfth …, 2023 - openreview.net
Large language models (LLMs) have emerged as powerful techniques for various NLP
tasks, such as mathematical reasoning and plan generation. In this paper, we study …

On improving summarization factual consistency from natural language feedback

Y Liu, B Deb, M Teruel, A Halfaker, D Radev… - arxiv preprint arxiv …, 2022 - arxiv.org
Despite the recent progress in language generation models, their outputs may not always
meet user expectations. In this work, we study whether informational feedback in natural …

Edit5: Semi-autoregressive text-editing with t5 warm-start

J Mallinson, J Adamek, E Malmi, A Severyn - arxiv preprint arxiv …, 2022 - arxiv.org
We present EdiT5-a novel semi-autoregressive text-editing model designed to combine the
strengths of non-autoregressive text-editing and autoregressive decoding. EdiT5 is faster …

On evaluating and mitigating gender biases in multilingual settings

A Vashishtha, K Ahuja, S Sitaram - arxiv preprint arxiv:2307.01503, 2023 - arxiv.org
While understanding and removing gender biases in language models has been a long-
standing problem in Natural Language Processing, prior research work has primarily been …

Search-oriented conversational query editing

K Mao, Z Dou, B Liu, H Qian, F Mo, X Wu… - Findings of the …, 2023 - aclanthology.org
Conversational query rewriting (CQR) realizes conversational search by reformulating the
search dialogue into a standalone rewrite. However, existing CQR models either are not …

SWiPE: A dataset for document-level simplification of Wikipedia pages

P Laban, J Vig, W Kryscinski, S Joty, C **ong… - arxiv preprint arxiv …, 2023 - arxiv.org
Text simplification research has mostly focused on sentence-level simplification, even
though many desirable edits-such as adding relevant background information or reordering …

Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation

B Minixhofer, J Pfeiffer, I Vulić - arxiv preprint arxiv:2305.18893, 2023 - arxiv.org
Many NLP pipelines split text into sentences as one of the crucial preprocessing steps. Prior
sentence segmentation tools either rely on punctuation or require a considerable amount of …

Cobias: Contextual reliability in bias assessment

P Govil, H Jain, VK Bonagiri, A Chadha… - arxiv preprint arxiv …, 2024 - arxiv.org
Large Language Models (LLMs) often inherit biases from the web data they are trained on,
which contains stereotypes and prejudices. Current methods for evaluating and mitigating …