Μελετητής Google

GW Lindsay - Journal of cognitive neuroscience, 2021 - direct.mit.edu

Convolutional neural networks (CNNs) were inspired by early findings in the study of
biological vision. They have since become successful tools in computer vision and state-of …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 613 Σχετικά άρθρα Όλες οι 10 εκδοχές

[Free GPT-4]

[PDF] mdpi.com

Learning for a robot: Deep reinforcement learning, imitation learning, transfer learning

J Hua, L Zeng, G Li, Z Ju - Sensors, 2021 - mdpi.com

Dexterous manipulation of the robot is an important part of realizing intelligence, but
manipulators can only perform simple tasks such as sorting and packing in a structured …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 231 Σχετικά άρθρα Όλες οι 10 εκδοχές Προσωρινά αποθηκευμένη

[Free GPT-4]

[PDF] neurips.cc

A minimalist approach to offline reinforcement learning

S Fujimoto, SS Gu - Advances in neural information …, 2021 - proceedings.neurips.cc

Offline reinforcement learning (RL) defines the task of learning from a fixed batch of data.
Due to errors in value estimation from out-of-distribution actions, most offline RL algorithms …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 858 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] neurips.cc

Cal-ql: Calibrated offline rl pre-training for efficient online fine-tuning

M Nakamoto, S Zhai, A Singh… - Advances in …, 2024 - proceedings.neurips.cc

A compelling use case of offline reinforcement learning (RL) is to obtain a policy initialization
from existing datasets followed by fast online fine-tuning with limited interaction. However …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 102 Σχετικά άρθρα Όλες οι 7 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

Foundation models for decision making: Problems, methods, and opportunities

S Yang, O Nachum, Y Du, J Wei, P Abbeel… - arxiv preprint arxiv …, 2023 - arxiv.org

Foundation models pretrained on diverse data at scale have demonstrated extraordinary
capabilities in a wide range of vision and language tasks. When such models are deployed …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 146 Σχετικά άρθρα Όλες οι 3 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] neurips.cc

Behavior Transformers: Cloning modes with one stone

NM Shafiullah, Z Cui… - Advances in neural …, 2022 - proceedings.neurips.cc

While behavior learning has made impressive progress in recent times, it lags behind
computer vision and natural language processing due to its inability to leverage large …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 190 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

Solving rubik's cube with a robot hand

I Akkaya, M Andrychowicz, M Chociej, M Litwin… - arxiv preprint arxiv …, 2019 - arxiv.org

We demonstrate that models trained only in simulation can be used to solve a manipulation
problem of unprecedented complexity on a real robot. This is made possible by two key …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 1273 Σχετικά άρθρα Όλες οι 7 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

robosuite: A modular simulation framework and benchmark for robot learning

Y Zhu, J Wong, A Mandlekar, R Martín-Martín… - arxiv preprint arxiv …, 2020 - arxiv.org

robosuite is a simulation framework for robot learning powered by the MuJoCo physics
engine. It offers a modular design for creating robotic tasks as well as a suite of benchmark …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 444 Σχετικά άρθρα Όλες οι 2 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] sagepub.com

Learning dexterous in-hand manipulation

OAIM Andrychowicz, B Baker… - … Journal of Robotics …, 2020 - journals.sagepub.com

We use reinforcement learning (RL) to learn dexterous in-hand manipulation policies that
can perform vision-based object reorientation on a physical Shadow Dexterous Hand. The …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 1913 Σχετικά άρθρα Όλες οι 7 εκδοχές

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] dm_control: Software and tasks for continuous control

S Tunyasuvunakool, A Muldal, Y Doron, S Liu, S Bohez… - Software Impacts, 2020 - Elsevier

The dm_control software package is a collection of Python libraries and task suites for
reinforcement learning agents in an articulated-body simulation. Infrastructure includes a …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 411 Σχετικά άρθρα Όλες οι 6 εκδοχές

Δημιουργία ειδοποίησης

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

Reinforcement and imitation learning for diverse visuomotor skills

Convolutional neural networks as a model of the visual system: Past, present, and future

Learning for a robot: Deep reinforcement learning, imitation learning, transfer learning

A minimalist approach to offline reinforcement learning

Cal-ql: Calibrated offline rl pre-training for efficient online fine-tuning

Foundation models for decision making: Problems, methods, and opportunities

Behavior Transformers: Cloning modes with one stone

Solving rubik's cube with a robot hand

robosuite: A modular simulation framework and benchmark for robot learning

Learning dexterous in-hand manipulation

[HTML][HTML] dm_control: Software and tasks for continuous control