- Academic Search

D Hafner, J Pasukonis, J Ba, T Lillicrap - ar** a general algorithm that learns to solve tasks across a wide range of
applications has been a fundamental challenge in artificial intelligence. Although current …

保存引用被引用数: 535 関連記事全 2 バージョン HTMLバージョン

[Free GPT-4]

[PDF] mlr.press

Bigger, better, faster: Human-level atari with human-level efficiency

M Schwarzer, JSO Ceron, A Courville… - International …, 2023 - proceedings.mlr.press

We introduce a value-based RL agent, which we call BBF, that achieves super-human
performance in the Atari 100K benchmark. BBF relies on scaling the neural networks used …

保存引用被引用数: 86 関連記事全 8 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Gaia-1: A generative world model for autonomous driving

A Hu, L Russell, H Yeo, Z Murez, G Fedoseev… - arxiv preprint arxiv …, 2023 - arxiv.org

Autonomous driving promises transformative improvements to transportation, but building
systems capable of safely navigating the unstructured complexity of real-world scenarios …

保存引用被引用数: 172 関連記事全 2 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Transformers learn shortcuts to automata

B Liu, JT Ash, S Goel, A Krishnamurthy… - arxiv preprint arxiv …, 2022 - arxiv.org

Algorithmic reasoning requires capabilities which are most naturally understood through
recurrent models of computation, like the Turing machine. However, Transformer models …

保存引用被引用数: 174 関連記事全 4 バージョン HTMLバージョン

[Free GPT-4]

[PDF] mlr.press

Masked world models for visual control

Y Seo, D Hafner, H Liu, F Liu, S James… - … on Robot Learning, 2023 - proceedings.mlr.press

Visual model-based reinforcement learning (RL) has the potential to enable sample-efficient
robot learning from visual observations. Yet the current approaches typically train a single …

保存引用被引用数: 139 関連記事全 6 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Temporal difference learning for model predictive control

N Hansen, X Wang, H Su - arxiv preprint arxiv:2203.04955, 2022 - arxiv.org

Data-driven model predictive control has two key advantages over model-free methods: a
potential for improved sample efficiency through model learning, and better performance as …

[Free GPT-4]

[PDF] arxiv.org

On Transforming Reinforcement Learning With Transformers: The Development Trajectory

S Hu, L Shen, Y Zhang, Y Chen… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Transformers, originally devised for natural language processing (NLP), have also produced
significant successes in computer vision (CV). Due to their strong expression power …

保存引用被引用数: 33 関連記事全 5 バージョン

[Free GPT-4]

[PDF] springer.com

Advances of machine learning in materials science: Ideas and techniques

SS Chong, YS Ng, HQ Wang, JC Zheng - Frontiers of Physics, 2024 - Springer

In this big data era, the use of large dataset in conjunction with machine learning (ML) has
been increasingly popular in both industry and academia. In recent times, the field of …

保存引用被引用数: 28 関連記事全 4 バージョン

[Free GPT-4]

[PDF] arxiv.org

Transformers are sample-efficient world models

V Micheli, E Alonso, F Fleuret - arxiv preprint arxiv:2209.00588, 2022 - arxiv.org

Deep reinforcement learning agents are notoriously sample inefficient, which considerably
limits their application to real-world problems. Recently, many model-based methods have …

保存引用被引用数: 169 関連記事全 6 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Manigaussian: Dynamic gaussian splatting for multi-task robotic manipulation

G Lu, S Zhang, Z Wang, C Liu, J Lu, Y Tang - European Conference on …, 2024 - Springer

Performing language-conditioned robotic manipulation tasks in unstructured environments
is highly demanded for general intelligent robots. Conventional robotic manipulation …

保存引用被引用数: 27 関連記事全 2 バージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

Mastering atari games with limited data

Mastering diverse domains through world models

Bigger, better, faster: Human-level atari with human-level efficiency

Gaia-1: A generative world model for autonomous driving

Transformers learn shortcuts to automata

Masked world models for visual control

Temporal difference learning for model predictive control

On Transforming Reinforcement Learning With Transformers: The Development Trajectory

Advances of machine learning in materials science: Ideas and techniques

Transformers are sample-efficient world models

Manigaussian: Dynamic gaussian splatting for multi-task robotic manipulation