- Academic Search

K Kawaharazuka, T Matsushima… - Advanced …, 2024 - Taylor & Francis

Recent developments in foundation models, like Large Language Models (LLMs) and Vision-
Language Models (VLMs), trained on extensive data, facilitate flexible application across …

Enregistrer Citer Cité 39 fois Autres articles Les 2 versions Free GPT-4

[Free GPT-4]

[HTML] cell.com

[HTML][HTML] Generating meaning: active inference and the scope and limits of passive AI

G Pezzulo, T Parr, P Cisek, A Clark, K Friston - Trends in Cognitive …, 2024 - cell.com

Prominent accounts of sentient behavior depict brains as generative models of organismic
interaction with the world, evincing intriguing similarities with current advances in generative …

Enregistrer Citer Cité 43 fois Autres articles Les 9 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Rt-2: Vision-language-action models transfer web knowledge to robotic control

A Brohan, N Brown, J Carbajal, Y Chebotar… - arxiv preprint arxiv …, 2023 - arxiv.org

We study how vision-language models trained on Internet-scale data can be incorporated
directly into end-to-end robotic control to boost generalization and enable emergent …

Enregistrer Citer Cité 777 fois Autres articles Les 2 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Open x-embodiment: Robotic learning datasets and rt-x models

A O'Neill, A Rehman, A Gupta, A Maddukuri… - arxiv preprint arxiv …, 2023 - arxiv.org

Large, high-capacity models trained on diverse datasets have shown remarkable successes
on efficiently tackling downstream applications. In domains from NLP to Computer Vision …

Enregistrer Citer Cité 296 fois Autres articles Les 2 versions Free GPT-4 Version HTML

[Free GPT-4]

[HTML] mlr.press

[HTML][HTML] Rt-2: Vision-language-action models transfer web knowledge to robotic control

B Zitkovich, T Yu, S Xu, P Xu, T **ao… - … on Robot Learning, 2023 - proceedings.mlr.press

We study how vision-language models trained on Internet-scale data can be incorporated
directly into end-to-end robotic control to boost generalization and enable emergent …

Enregistrer Citer Cité 195 fois Autres articles Les 2 versions Free GPT-4 En cache

[Free GPT-4]

[PDF] ed.ac.uk

Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration⁰

A O'Neill, A Rehman, A Maddukuri… - … on Robotics and …, 2024 - ieeexplore.ieee.org

Large, high-capacity models trained on diverse datasets have shown remarkable successes
on efficiently tackling downstream applications. In domains from NLP to Computer Vision …

Enregistrer Citer Cité 147 fois Autres articles

[Free GPT-4]

[PDF] arxiv.org

Octo: An open-source generalist robot policy

OM Team, D Ghosh, H Walke, K Pertsch… - arxiv preprint arxiv …, 2024 - arxiv.org

Large policies pretrained on diverse robot datasets have the potential to transform robotic
learning: instead of training new policies from scratch, such generalist robot policies may be …

Enregistrer Citer Cité 167 fois Autres articles Les 3 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] mlr.press

Liv: Language-image representations and rewards for robotic control

YJ Ma, V Kumar, A Zhang, O Bastani… - International …, 2023 - proceedings.mlr.press

Abstract We present Language-Image Value learning (LIV), a unified objective for vision-
language representation and reward learning from action-free videos with text annotations …

Enregistrer Citer Cité 108 fois Autres articles Les 9 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Roboagent: Generalization and efficiency in robot manipulation via semantic augmentations and action chunking

H Bharadhwaj, J Vakil, M Sharma… - … on Robotics and …, 2024 - ieeexplore.ieee.org

The grand aim of having a single robot that can manipulate arbitrary objects in diverse
settings is at odds with the paucity of robotics datasets. Acquiring and growing such datasets …

Enregistrer Citer Cité 88 fois Autres articles Les 3 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

ViNT: A foundation model for visual navigation

D Shah, A Sridhar, N Dashora, K Stachowicz… - arxiv preprint arxiv …, 2023 - arxiv.org

General-purpose pre-trained models (" foundation models") have enabled practitioners to
produce generalizable solutions for individual machine learning problems with datasets that …

Enregistrer Citer Cité 118 fois Autres articles Les 4 versions Free GPT-4 Version HTML

Créer l'alerte

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

Where are we in the search for an artificial visual cortex for embodied intelligence?

Real-world robot applications of foundation models: A review

[HTML][HTML] Generating meaning: active inference and the scope and limits of passive AI

Rt-2: Vision-language-action models transfer web knowledge to robotic control

Open x-embodiment: Robotic learning datasets and rt-x models

[HTML][HTML] Rt-2: Vision-language-action models transfer web knowledge to robotic control

Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration⁰

Octo: An open-source generalist robot policy

Liv: Language-image representations and rewards for robotic control

Roboagent: Generalization and efficiency in robot manipulation via semantic augmentations and action chunking

ViNT: A foundation model for visual navigation