Академия Google

S Islam, H Elmekki, A Elsebai, J Bentahar… - Expert Systems with …, 2024 - Elsevier

Abstract Transformers are Deep Neural Networks (DNN) that utilize a self-attention
mechanism to capture contextual relationships within sequential data. Unlike traditional …

Сохранить Цитировать Цитируется: 187 Похожие статьи Все версии статьи (4)

[Free GPT-4]
[DeepSeek]

[PDF] google.com

Autonomous driving system: A comprehensive survey

J Zhao, W Zhao, B Deng, Z Wang, F Zhang… - Expert Systems with …, 2024 - Elsevier

Automation is increasingly at the forefront of transportation research, with the potential to
bring fully autonomous vehicles to our roads in the coming years. This comprehensive …

Сохранить Цитировать Цитируется: 102 Похожие статьи Все версии статьи (2)

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Decision transformer: Reinforcement learning via sequence modeling

L Chen, K Lu, A Rajeswaran, K Lee… - Advances in neural …, 2021 - proceedings.neurips.cc

We introduce a framework that abstracts Reinforcement Learning (RL) as a sequence
modeling problem. This allows us to draw upon the simplicity and scalability of the …

Сохранить Цитировать Цитируется: 1799 Похожие статьи Все версии статьи (11) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Offline reinforcement learning as one big sequence modeling problem

M Janner, Q Li, S Levine - Advances in neural information …, 2021 - proceedings.neurips.cc

Reinforcement learning (RL) is typically viewed as the problem of estimating single-step
policies (for model-free RL) or single-step models (for model-based RL), leveraging the …

Сохранить Цитировать Цитируется: 793 Похожие статьи Все версии статьи (9) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Megalodon: Efficient llm pretraining and inference with unlimited context length

X Ma, X Yang, W **ong, B Chen, L Yu… - Advances in …, 2025 - proceedings.neurips.cc

The quadratic complexity and weak length extrapolation of Transformers limits their ability to
scale to long sequences, and while sub-quadratic solutions like linear attention and state …

Сохранить Цитировать Цитируется: 27 Похожие статьи Все версии статьи (2) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Structured state space models for in-context reinforcement learning

C Lu, Y Schroecker, A Gu, E Parisotto… - Advances in …, 2023 - proceedings.neurips.cc

Structured state space sequence (S4) models have recently achieved state-of-the-art
performance on long-range sequence modeling tasks. These models also have fast …

Сохранить Цитировать Цитируется: 83 Похожие статьи Все версии статьи (6) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

History aware multimodal transformer for vision-and-language navigation

S Chen, PL Guhur, C Schmid… - Advances in neural …, 2021 - proceedings.neurips.cc

Vision-and-language navigation (VLN) aims to build autonomous visual agents that follow
instructions and navigate in real scenes. To remember previously visited locations and …

Сохранить Цитировать Цитируется: 239 Похожие статьи Все версии статьи (8) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey of meta-reinforcement learning

J Beck, R Vuorio, EZ Liu, Z **ong, L Zintgraf… - arxiv preprint arxiv …, 2023 - arxiv.org

While deep reinforcement learning (RL) has fueled multiple high-profile successes in
machine learning, it is held back from more widespread adoption by its often poor data …

Сохранить Цитировать Цитируется: 176 Похожие статьи Все версии статьи (2) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Frozen pretrained transformers as universal computation engines

K Lu, A Grover, P Abbeel, I Mordatch - Proceedings of the AAAI …, 2022 - ojs.aaai.org

We investigate the capability of a transformer pretrained on natural language to generalize
to other modalities with minimal finetuning--in particular, without finetuning of the self …

Сохранить Цитировать Цитируется: 320 Похожие статьи Все версии статьи (11) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Mega: moving average equipped gated attention

X Ma, C Zhou, X Kong, J He, L Gui, G Neubig… - arxiv preprint arxiv …, 2022 - arxiv.org

The design choices in the Transformer attention mechanism, including weak inductive bias
and quadratic computational complexity, have limited its application for modeling long …

Сохранить Цитировать Цитируется: 152 Похожие статьи Все версии статьи (3) В виде HTML

Создать оповещение

Цитировать

Расширенный поиск

Сохранено в вашей библиотеке

Stabilizing transformers for reinforcement learning

A comprehensive survey on applications of transformers for deep learning tasks

Autonomous driving system: A comprehensive survey

Decision transformer: Reinforcement learning via sequence modeling

Offline reinforcement learning as one big sequence modeling problem

Megalodon: Efficient llm pretraining and inference with unlimited context length

Structured state space models for in-context reinforcement learning

History aware multimodal transformer for vision-and-language navigation

A survey of meta-reinforcement learning

Frozen pretrained transformers as universal computation engines

Mega: moving average equipped gated attention