Google Akademik

Y Liang, Z Sha, Z Shi, Z Song, Y Zhou - arxiv preprint arxiv:2408.13233, 2024 - arxiv.org

The computational complexity of the self-attention mechanism in popular transformer
architectures poses significant challenges for training and inference, and becomes the …

Kaydet Alıntı yap Alıntılanma sayısı: 25 İlgili makaleler 5 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

Retrieval augmented generation or long-context llms? a comprehensive study and hybrid approach

Z Li, C Li, M Zhang, Q Mei… - Proceedings of the 2024 …, 2024 - aclanthology.org

Abstract Retrieval Augmented Generation (RAG) has been a powerful tool for Large
Language Models (LLMs) to efficiently process overly lengthy contexts. However, recent …

Kaydet Alıntı yap Alıntılanma sayısı: 20 İlgili makaleler 3 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Flooding spread of manipulated knowledge in llm-based multi-agent communities

T Ju, Y Wang, X Ma, P Cheng, H Zhao, Y Wang… - arxiv preprint arxiv …, 2024 - arxiv.org

The rapid adoption of large language models (LLMs) in multi-agent systems has highlighted
their impressive capabilities in various applications, such as collaborative problem-solving …

Kaydet Alıntı yap Alıntılanma sayısı: 5 İlgili makaleler 6 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Longmemeval: Benchmarking chat assistants on long-term interactive memory

D Wu, H Wang, W Yu, Y Zhang, KW Chang… - arxiv preprint arxiv …, 2024 - arxiv.org

Recent large language model (LLM)-driven chat assistant systems have integrated memory
components to track user-assistant chat histories, enabling more accurate and personalized …

Kaydet Alıntı yap Alıntılanma sayısı: 3 İlgili makaleler 3 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Circuit Complexity Bounds for RoPE-based Transformer Architecture

B Chen, X Li, Y Liang, J Long, Z Shi, Z Song - arxiv preprint arxiv …, 2024 - arxiv.org

Characterizing the express power of the Transformer architecture is critical to understanding
its capacity limits and scaling law. Recent works provide the circuit complexity bounds to …

Kaydet Alıntı yap Alıntılanma sayısı: 2 İlgili makaleler 3 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[HTML] mdpi.com

[HTML][HTML] Leveraging Large Language Models for Enhancing Safety in Maritime Operations

T Miller, I Durlik, E Kostecka, A Łobodzińska… - Applied Sciences, 2025 - mdpi.com

Maritime operations play a critical role in global trade but face persistent safety challenges
due to human error, environmental factors, and operational complexities. This review …

Kaydet Alıntı yap İlgili makaleler 2 sürümün hepsi Önbellek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

What is Wrong with Perplexity for Long-context Language Modeling?

L Fang, Y Wang, Z Liu, C Zhang, S Jegelka… - arxiv preprint arxiv …, 2024 - arxiv.org

Handling long-context inputs is crucial for large language models (LLMs) in tasks such as
extended conversations, document summarization, and many-shot in-context learning …

Kaydet Alıntı yap Alıntılanma sayısı: 1 İlgili makaleler 2 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Moba: A two-level agent system for efficient mobile task automation

Z Zhu, H Tang, Y Li, K Lan, Y Jiang, H Zhou… - arxiv preprint arxiv …, 2024 - arxiv.org

Current mobile assistants are limited by dependence on system APIs or struggle with
complex user instructions and diverse interfaces due to restricted comprehension and …

Kaydet Alıntı yap Alıntılanma sayısı: 1 İlgili makaleler 2 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Knowledge

YJ Lee, D Lee, J Youn, K Oh, B Ko, J Hyeon… - arxiv preprint arxiv …, 2024 - arxiv.org

Humans share a wide variety of images related to their personal experiences within
conversations via instant messaging tools. However, existing works focus on (1) image …

Kaydet Alıntı yap Alıntılanma sayısı: 1 İlgili makaleler 4 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Memsim: A bayesian simulator for evaluating memory of llm-based personal assistants

Z Zhang, Q Dai, L Chen, Z Jiang, R Li, J Zhu… - arxiv preprint arxiv …, 2024 - arxiv.org

LLM-based agents have been widely applied as personal assistants, capable of memorizing
information from user messages and responding to personal queries. However, there still …

Kaydet Alıntı yap Alıntılanma sayısı: 1 İlgili makaleler 2 sürümün hepsi HTML olarak görüntüle

Uyarı oluştur

Alıntı yap

Gelişmiş arama

Kitaplığım'a kaydedildi

Evaluating very long-term conversational memory of llm agents

Multi-layer transformers gradient can be approximated in almost linear time

Retrieval augmented generation or long-context llms? a comprehensive study and hybrid approach

Flooding spread of manipulated knowledge in llm-based multi-agent communities

Longmemeval: Benchmarking chat assistants on long-term interactive memory

Circuit Complexity Bounds for RoPE-based Transformer Architecture

[HTML][HTML] Leveraging Large Language Models for Enhancing Safety in Maritime Operations

What is Wrong with Perplexity for Long-context Language Modeling?

Moba: A two-level agent system for efficient mobile task automation

Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Knowledge

Memsim: A bayesian simulator for evaluating memory of llm-based personal assistants