[HTML][HTML] A survey of GPT-3 family large language models including ChatGPT and GPT-4

KS Kalyan - Natural Language Processing Journal, 2024 - Elsevier
Large language models (LLMs) are a special class of pretrained language models (PLMs)
obtained by scaling model size, pretraining corpus and computation. LLMs, because of their …

Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents

Z Zhang, Y Yao, A Zhang, X Tang, X Ma, Z He… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models (LLMs) have dramatically enhanced the field of language
intelligence, as demonstrably evidenced by their formidable empirical performance across a …

Encouraging divergent thinking in large language models through multi-agent debate

T Liang, Z He, W Jiao, X Wang, Y Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
Modern large language models (LLMs) like ChatGPT have shown remarkable performance
on general language tasks but still struggle on complex reasoning tasks, which drives the …

Prompting palm for translation: Assessing strategies and performance

D Vilar, M Freitag, C Cherry, J Luo, V Ratnakar… - arxiv preprint arxiv …, 2022 - arxiv.org
Large language models (LLMs) that have been trained on multilingual but not parallel text
exhibit a remarkable ability to translate between languages. We probe this ability in an in …

Multilingual large language model: A survey of resources, taxonomy and frontiers

L Qin, Q Chen, Y Zhou, Z Chen, Y Li, L Liao… - arxiv preprint arxiv …, 2024 - arxiv.org
Multilingual Large Language Models are capable of using powerful Large Language
Models to handle and respond to queries in multiple languages, which achieves remarkable …

[PDF][PDF] Unifying the perspectives of nlp and software engineering: A survey on language models for code

Z Zhang, C Chen, B Liu, C Liao, Z Gong… - arxiv preprint arxiv …, 2023 - simg.baai.ac.cn
In this work we systematically review the recent advancements in code processing with
language models, covering 50+ models, 30+ evaluation tasks, 170+ datasets, and 700 …

Toward human-like evaluation for natural language generation with error analysis

Q Lu, L Ding, L **e, K Zhang, DF Wong… - arxiv preprint arxiv …, 2022 - arxiv.org
The state-of-the-art language model-based automatic metrics, eg BARTScore, benefiting
from large-scale contextualized pre-training, have been successfully used in a wide range of …

FEDS-ICL: Enhancing translation ability and efficiency of large language model by optimizing demonstration selection

S Zhu, L Pan, D **ong - Information Processing & Management, 2024 - Elsevier
Large language models (LLMs) that exhibit a remarkable ability by in-context learning (ICL)
with bilingual demonstrations have been recognized as a potential solution for machine …

Large language models as analogical reasoners

M Yasunaga, X Chen, Y Li, P Pasupat… - arxiv preprint arxiv …, 2023 - arxiv.org
Chain-of-thought (CoT) prompting for language models demonstrates impressive
performance across reasoning tasks, but typically needs labeled exemplars of the reasoning …

Towards effective disambiguation for machine translation with large language models

V Iyer, P Chen, A Birch - arxiv preprint arxiv:2309.11668, 2023 - arxiv.org
Resolving semantic ambiguity has long been recognised as a central challenge in the field
of machine translation. Recent work on benchmarking translation performance on …