Google Академія

C Yang, Z Yang, N Hua - arxiv preprint arxiv:2405.13216, 2024 - arxiv.org

Long-context modeling presents a significant challenge for transformer-based large
language models (LLMs) due to the quadratic complexity of the self-attention mechanism …

Зберегти Послатися Цитовано в 2 джерелах Пов’язані статті Кількість версій: 2 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Out-of-distribution generalisation in spoken language understanding

D Porjazovski, A Moisio, M Kurimo - arxiv preprint arxiv:2407.07425, 2024 - arxiv.org

Test data is said to be out-of-distribution (OOD) when it unexpectedly differs from the training
data, a common challenge in real-world use cases of machine learning. Although OOD …

Зберегти Послатися Пов’язані статті Кількість версій: 8 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] github.io

[PDF][PDF] A comprehensive study on LLM agent challenges

P Ingle, M Parab, P Lendave, A Bhanushali, PK Bn - aair-lab.github.io

This paper intricately examines the manifold challenges and inherent issues associated with
Large Language Models (LLMs), both for the models themselves and the human context. It …

Зберегти Послатися Цитовано в 3 джерелах Пов’язані статті Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] uic.edu

Length Generalization with Recursive Neural Networks and Beyond

JR Chowdhury - 2024 - search.proquest.com

Abstract We investigate Recursive Neural Networks (RvNNs) for language processing tasks.
Roughly, from a generalized perspective, RvNNs repeatedly apply some neural function on …

Зберегти Послатися Пов’язані статті Кількість версій: 3

Створити сповіщення

Послатися

Розширений пошук

Збережено в моїй бібліотеці

Monotonic location attention for length generalization

The rise and potential of large language model based agents: A survey

Out-of-distribution generalisation in spoken language understanding

[PDF][PDF] A comprehensive study on LLM agent challenges

Length Generalization with Recursive Neural Networks and Beyond