الباحث العلمي من Google

K Kawaharazuka, T Matsushima… - Advanced …, 2024‏ - Taylor & Francis‏

Recent developments in foundation models, like Large Language Models (LLMs) and Vision-
Language Models (VLMs), trained on extensive data, facilitate flexible application across …‏

حفظ اقتباس تم اقتباسها في عدد: 41 مقالات ذات صلة الإصدارات الـ 3كلها

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Voxposer: Composable 3d value maps for robotic manipulation with language models‏

W Huang, C Wang, R Zhang, Y Li, J Wu… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Large language models (LLMs) are shown to possess a wealth of actionable knowledge that
can be extracted for robot manipulation in the form of reasoning and planning. Despite the …‏

حفظ اقتباس تم اقتباسها في عدد: 469 مقالات ذات صلة الإصدارات الـ 6كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Octo: An open-source generalist robot policy‏

OM Team, D Ghosh, H Walke, K Pertsch… - arxiv preprint arxiv …, 2024‏ - arxiv.org‏

Large policies pretrained on diverse robot datasets have the potential to transform robotic
learning: instead of training new policies from scratch, such generalist robot policies may be …‏

حفظ اقتباس تم اقتباسها في عدد: 192 مقالات ذات صلة الإصدارات الـ 3كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Foundation models in robotics: Applications, challenges, and the future‏

R Firoozi, J Tucker, S Tian… - … Journal of Robotics …, 2023‏ - journals.sagepub.com‏

We survey applications of pretrained foundation models in robotics. Traditional deep
learning models in robotics are trained on small datasets tailored for specific tasks, which …‏

حفظ اقتباس تم اقتباسها في عدد: 135 مقالات ذات صلة الإصدارات الـ 5كلها

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A systematic survey of prompt engineering on vision-language foundation models‏

J Gu, Z Han, S Chen, A Beirami, B He, G Zhang… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Prompt engineering is a technique that involves augmenting a large pre-trained model with
task-specific hints, known as prompts, to adapt the model to new tasks. Prompts can be …‏

حفظ اقتباس تم اقتباسها في عدد: 141 مقالات ذات صلة الإصدارات الـ 3كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Zero-shot robotic manipulation with pretrained image-editing diffusion models‏

K Black, M Nakamoto, P Atreya, H Walke… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

If generalist robots are to operate in truly unstructured environments, they need to be able to
recognize and reason about novel objects and scenarios. Such objects and scenarios might …‏

حفظ اقتباس تم اقتباسها في عدد: 96 مقالات ذات صلة الإصدارات الـ 4كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Roboagent: Generalization and efficiency in robot manipulation via semantic augmentations and action chunking‏

H Bharadhwaj, J Vakil, M Sharma… - … on Robotics and …, 2024‏ - ieeexplore.ieee.org‏

The grand aim of having a single robot that can manipulate arbitrary objects in diverse
settings is at odds with the paucity of robotics datasets. Acquiring and growing such datasets …‏

حفظ اقتباس تم اقتباسها في عدد: 94 مقالات ذات صلة الإصدارات الـ 4كلها

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Diffusion model is an effective planner and data synthesizer for multi-task reinforcement learning‏

H He, C Bai, K Xu, Z Yang, W Zhang… - Advances in neural …, 2023‏ - proceedings.neurips.cc‏

Diffusion models have demonstrated highly-expressive generative capabilities in vision and
NLP. Recent studies in reinforcement learning (RL) have shown that diffusion models are …‏

حفظ اقتباس تم اقتباسها في عدد: 77 مقالات ذات صلة الإصدارات الـ 6كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A comprehensive survey of data augmentation in visual reinforcement learning‏

G Ma, Z Wang, Z Yuan, X Wang, B Yuan… - arxiv preprint arxiv …, 2022‏ - arxiv.org‏

Visual reinforcement learning (RL), which makes decisions directly from high-dimensional
visual inputs, has demonstrated significant potential in various domains. However …‏

حفظ اقتباس تم اقتباسها في عدد: 35 مقالات ذات صلة الإصدارات الـ 2كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Generative skill chaining: Long-horizon skill planning with diffusion models‏

UA Mishra, S Xue, Y Chen… - Conference on Robot …, 2023‏ - proceedings.mlr.press‏

Long-horizon tasks, usually characterized by complex subtask dependencies, present a
significant challenge in manipulation planning. Skill chaining is a practical approach to …‏

حفظ اقتباس تم اقتباسها في عدد: 62 مقالات ذات صلة الإصدارات الـ 6كلها إصدار HTML‏

إنشاء تنبيه

اقتباس

بحث متقدم

تم حفظ المقالة في مكتبتي.

Scaling robot learning with semantically imagined experience

Real-world robot applications of foundation models: A review‏

Voxposer: Composable 3d value maps for robotic manipulation with language models‏

Octo: An open-source generalist robot policy‏

Foundation models in robotics: Applications, challenges, and the future‏

A systematic survey of prompt engineering on vision-language foundation models‏

Zero-shot robotic manipulation with pretrained image-editing diffusion models‏

Roboagent: Generalization and efficiency in robot manipulation via semantic augmentations and action chunking‏

Diffusion model is an effective planner and data synthesizer for multi-task reinforcement learning‏

A comprehensive survey of data augmentation in visual reinforcement learning‏

Generative skill chaining: Long-horizon skill planning with diffusion models‏