- Academic Search

H Ye, T Liu, A Zhang, W Hua, W Jia - arxiv preprint arxiv:2309.06794, 2023 - arxiv.org

As large language models continue to develop in the field of AI, text generation systems are
susceptible to a worrisome phenomenon known as hallucination. In this study, we …

Speichern Zitieren Zitiert von: 120 Ähnliche Artikel Alle 2 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Digital forgetting in large language models: A survey of unlearning methods

A Blanco-Justicia, N Jebreel… - Artificial Intelligence …, 2025 - Springer

Large language models (LLMs) have become the state of the art in natural language
processing. The massive adoption of generative LLMs and the capabilities they have shown …

Speichern Zitieren Zitiert von: 13 Ähnliche Artikel Alle 2 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Mathdial: A dialogue tutoring dataset with rich pedagogical properties grounded in math reasoning problems

J Macina, N Daheim, SP Chowdhury, T Sinha… - arxiv preprint arxiv …, 2023 - arxiv.org

While automatic dialogue tutors hold great potential in making education personalized and
more accessible, research on such systems has been hampered by a lack of sufficiently …

Speichern Zitieren Zitiert von: 47 Ähnliche Artikel Alle 6 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Deep model fusion: A survey

W Li, Y Peng, M Zhang, L Ding, H Hu… - arxiv preprint arxiv …, 2023 - arxiv.org

Deep model fusion/merging is an emerging technique that merges the parameters or
predictions of multiple deep learning models into a single one. It combines the abilities of …

Speichern Zitieren Zitiert von: 54 Ähnliche Artikel Alle 2 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Detecting and mitigating hallucinations in multilingual summarisation

Y Qiu, Y Ziser, A Korhonen, EM Ponti… - arxiv preprint arxiv …, 2023 - arxiv.org

Hallucinations pose a significant challenge to the reliability of neural models for abstractive
summarisation. While automatically generated summaries may be fluent, they often lack …

Speichern Zitieren Zitiert von: 37 Ähnliche Artikel Alle 4 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

Chat vector: A simple approach to equip LLMs with instruction following and model alignment in new languages

SC Huang, PZ Li, YC Hsu, KM Chen… - Proceedings of the …, 2024 - aclanthology.org

Recently, the development of open-source large language models (LLMs) has advanced
rapidly. Nevertheless, due to data constraints, the capabilities of most open-source LLMs are …

Speichern Zitieren Zitiert von: 7 Ähnliche Artikel Alle 2 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Configurable foundation models: Building llms from a modular perspective

C **ao, Z Zhang, C Song, D Jiang, F Yao, X Han… - arxiv preprint arxiv …, 2024 - arxiv.org

Advancements in LLMs have recently unveiled challenges tied to computational efficiency
and continual scalability due to their requirements of huge parameters, making the …

Speichern Zitieren Zitiert von: 9 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

Dial beinfo for faithfulness: Improving factuality of information-seeking dialogue via behavioural fine-tuning

E Razumovskaia, I Vulić, P Marković… - Findings of the …, 2024 - aclanthology.org

Factual faithfulness is a crucial requirement in information-seeking dialogue: the system
should respond to the user queries so that the responses are meaningful and aligned with …

Speichern Zitieren Zitiert von: 7 Ähnliche Artikel HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Model merging by uncertainty-based gradient matching

N Daheim, T Möllenhoff, EM Ponti, I Gurevych… - arxiv preprint arxiv …, 2023 - arxiv.org

Models trained on different datasets can be merged by a weighted-averaging of their
parameters, but why does it work and when can it fail? Here, we connect the inaccuracy of …

Speichern Zitieren Zitiert von: 42 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Climategpt: Towards ai synthesizing interdisciplinary research on climate change

D Thulke, Y Gao, P Pelser, R Brune, R Jalota… - arxiv preprint arxiv …, 2024 - arxiv.org

This paper introduces ClimateGPT, a model family of domain-specific large language
models that synthesize interdisciplinary research on climate change. We trained two 7B …

Speichern Zitieren Zitiert von: 28 Ähnliche Artikel Alle 2 Versionen HTML-Version

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Cognitive mirage: A review of hallucinations in large language models

Digital forgetting in large language models: A survey of unlearning methods

Mathdial: A dialogue tutoring dataset with rich pedagogical properties grounded in math reasoning problems

Deep model fusion: A survey

Detecting and mitigating hallucinations in multilingual summarisation

Chat vector: A simple approach to equip LLMs with instruction following and model alignment in new languages

Configurable foundation models: Building llms from a modular perspective

Dial beinfo for faithfulness: Improving factuality of information-seeking dialogue via behavioural fine-tuning

Model merging by uncertainty-based gradient matching

Climategpt: Towards ai synthesizing interdisciplinary research on climate change