- Academic Search

F Zeng, W Gan, Y Wang, N Liu, PS Yu - arxiv preprint arxiv:2311.07226, 2023‏ - arxiv.org‏

The human ability to learn, generalize, and control complex manipulation tasks through multi-
modality feedback suggests a unique capability, which we refer to as dexterity intelligence …‏

שמור צטט צוטט על ידי 126 מאמרים בנושא זה כל 3 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] jair.org

Towards continual reinforcement learning: A review and perspectives‏

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022‏ - jair.org‏

In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …‏

שמור צטט צוטט על ידי 356 מאמרים בנושא זה כל 10 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Lm-nav: Robotic navigation with large pre-trained models of language, vision, and action‏

D Shah, B Osiński, S Levine - Conference on robot …, 2023‏ - proceedings.mlr.press‏

Goal-conditioned policies for robotic navigation can be trained on large, unannotated
datasets, providing for good generalization to real-world settings. However, particularly in …‏

שמור צטט צוטט על ידי 448 מאמרים בנושא זה כל 6 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Bridgedata v2: A dataset for robot learning at scale‏

HR Walke, K Black, TZ Zhao, Q Vuong… - … on Robot Learning, 2023‏ - proceedings.mlr.press‏

We introduce BridgeData V2, a large and diverse dataset of robotic manipulation behaviors
designed to facilitate research in scalable robot learning. BridgeData V2 contains 53,896 …‏

שמור צטט צוטט על ידי 121 מאמרים בנושא זה כל 5 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Q-transformer: Scalable offline reinforcement learning via autoregressive q-functions‏

Y Chebotar, Q Vuong, K Hausman… - … on Robot Learning, 2023‏ - proceedings.mlr.press‏

In this work, we present a scalable reinforcement learning method for training multi-task
policies from large offline datasets that can leverage both human demonstrations and …‏

שמור צטט צוטט על ידי 91 מאמרים בנושא זה כל 6 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

On the opportunities and risks of foundation models‏

R Bommasani, DA Hudson, E Adeli, R Altman… - arxiv preprint arxiv …, 2021‏ - arxiv.org‏

AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …‏

שמור צטט צוטט על ידי 4839 מאמרים בנושא זה כל 2 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Causal machine learning: A survey and open problems‏

J Kaddour, A Lynch, Q Liu, MJ Kusner… - arxiv preprint arxiv …, 2022‏ - arxiv.org‏

Causal Machine Learning (CausalML) is an umbrella term for machine learning methods
that formalize the data-generation process as a structural causal model (SCM). This …‏

שמור צטט צוטט על ידי 190 מאמרים בנושא זה כל 2 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Contrastive learning as goal-conditioned reinforcement learning‏

B Eysenbach, T Zhang, S Levine… - Advances in Neural …, 2022‏ - proceedings.neurips.cc‏

In reinforcement learning (RL), it is easier to solve a task if given a good representation.
While deep RL should automatically acquire such good representations, prior work often …‏

שמור צטט צוטט על ידי 150 מאמרים בנושא זה כל 6 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Calvin: A benchmark for language-conditioned policy learning for long-horizon robot manipulation tasks‏

O Mees, L Hermann, E Rosete-Beas… - IEEE Robotics and …, 2022‏ - ieeexplore.ieee.org‏

General-purpose robots coexisting with humans in their environment must learn to relate
human language to their perceptions and actions to be useful in a range of daily tasks …‏

שמור צטט צוטט על ידי 224 מאמרים בנושא זה כל 5 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Hiql: Offline goal-conditioned rl with latent states as actions‏

S Park, D Ghosh, B Eysenbach… - Advances in Neural …, 2023‏ - proceedings.neurips.cc‏

Unsupervised pre-training has recently become the bedrock for computer vision and natural
language processing. In reinforcement learning (RL), goal-conditioned RL can potentially …‏

שמור צטט צוטט על ידי 55 מאמרים בנושא זה כל 7 הגרסאות פתיחה בתור HTML

יצירת התראה

צטט

חיפוש מתקדם

נשמר בספרייה שלי

Learning to achieve goals

Large language models for robotics: A survey‏

Towards continual reinforcement learning: A review and perspectives‏

Lm-nav: Robotic navigation with large pre-trained models of language, vision, and action‏

Bridgedata v2: A dataset for robot learning at scale‏

Q-transformer: Scalable offline reinforcement learning via autoregressive q-functions‏

On the opportunities and risks of foundation models‏

Causal machine learning: A survey and open problems‏

Contrastive learning as goal-conditioned reinforcement learning‏

Calvin: A benchmark for language-conditioned policy learning for long-horizon robot manipulation tasks‏

Hiql: Offline goal-conditioned rl with latent states as actions‏