- Academic Search

Y Cao, H Zhao, Y Cheng, T Shu, Y Chen… - … on Neural Networks …, 2024 - ieeexplore.ieee.org

With extensive pretrained knowledge and high-level general capabilities, large language
models (LLMs) emerge as a promising avenue to augment reinforcement learning (RL) in …

Salva Cita Citato da 36 Articoli correlati Tutte e 2 le versioni

[Free GPT-4]

[PDF] arxiv.org

Aligning cyber space with physical world: A comprehensive survey on embodied ai

Y Liu, W Chen, Y Bai, X Liang, G Li, W Gao… - arxiv preprint arxiv …, 2024 - arxiv.org

Embodied Artificial Intelligence (Embodied AI) is crucial for achieving Artificial General
Intelligence (AGI) and serves as a foundation for various applications that bridge cyberspace …

Salva Cita Citato da 33 Articoli correlati Tutte e 2 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Drivelm: Driving with graph visual question answering

C Sima, K Renz, K Chitta, L Chen, H Zhang… - … on Computer Vision, 2024 - Springer

We study how vision-language models (VLMs) trained on web-scale data can be integrated
into end-to-end driving systems to boost generalization and enable interactivity with human …

Salva Cita Citato da 145 Articoli correlati Tutte e 5 le versioni

[Free GPT-4]

[PDF] arxiv.org

Photorealistic video generation with diffusion models

A Gupta, L Yu, K Sohn, X Gu, M Hahn, FF Li… - … on Computer Vision, 2024 - Springer

We present WALT, a diffusion transformer for photorealistic video generation from text
prompts. Our approach has two key design decisions. First, we use a causal encoder to …

Salva Cita Citato da 127 Articoli correlati Tutte e 3 le versioni

[Free GPT-4]

[PDF] arxiv.org

Foundation models in robotics: Applications, challenges, and the future

R Firoozi, J Tucker, S Tian… - … Journal of Robotics …, 2023 - journals.sagepub.com

We survey applications of pretrained foundation models in robotics. Traditional deep
learning models in robotics are trained on small datasets tailored for specific tasks, which …

Salva Cita Citato da 124 Articoli correlati Tutte e 2 le versioni

[Free GPT-4]

[PDF] arxiv.org

Octo: An open-source generalist robot policy

OM Team, D Ghosh, H Walke, K Pertsch… - arxiv preprint arxiv …, 2024 - arxiv.org

Large policies pretrained on diverse robot datasets have the potential to transform robotic
learning: instead of training new policies from scratch, such generalist robot policies may be …

Salva Cita Citato da 165 Articoli correlati Tutte e 3 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Videopoet: A large language model for zero-shot video generation

D Kondratyuk, L Yu, X Gu, J Lezama, J Huang… - arxiv preprint arxiv …, 2023 - arxiv.org

We present VideoPoet, a language model capable of synthesizing high-quality video, with
matching audio, from a large variety of conditioning signals. VideoPoet employs a decoder …

Salva Cita Citato da 175 Articoli correlati Tutte e 5 le versioni Versione HTML

[Free GPT-4]

[PDF] neurips.cc

Large language models as commonsense knowledge for large-scale task planning

Z Zhao, WS Lee, D Hsu - Advances in Neural Information …, 2024 - proceedings.neurips.cc

Large-scale task planning is a major challenge. Recent work exploits large language
models (LLMs) directly as a policy and shows surprisingly interesting results. This paper …

Salva Cita Citato da 176 Articoli correlati Tutte e 7 le versioni Versione HTML

[Free GPT-4]

[PDF] stanford.edu

Netllm: Adapting large language models for networking

D Wu, X Wang, Y Qiao, Z Wang, J Jiang, S Cui… - Proceedings of the …, 2024 - dl.acm.org

Many networking tasks now employ deep learning (DL) to solve complex prediction and
optimization problems. However, current design philosophy of DL-based algorithms entails …

Salva Cita Citato da 24 Articoli correlati Tutte e 3 le versioni

[Free GPT-4]

[PDF] sagepub.com

Fmb: a functional manipulation benchmark for generalizable robotic learning

J Luo, C Xu, F Liu, L Tan, Z Lin, J Wu… - … Journal of Robotics …, 2023 - journals.sagepub.com

In this paper, we propose a real-world benchmark for studying robotic learning in the context
of functional manipulation: a robot needs to accomplish complex long-horizon behaviors by …

Salva Cita Citato da 25 Articoli correlati Tutte e 4 le versioni

Crea avviso

Cita

Ricerca avanzata

Salvato in La mia biblioteca

Rt-2: Vision-language-action models transfer web knowledge to robotic control

Survey on large language model-enhanced reinforcement learning: Concept, taxonomy, and methods

Aligning cyber space with physical world: A comprehensive survey on embodied ai

Drivelm: Driving with graph visual question answering

Photorealistic video generation with diffusion models

Foundation models in robotics: Applications, challenges, and the future

Octo: An open-source generalist robot policy

Videopoet: A large language model for zero-shot video generation

Large language models as commonsense knowledge for large-scale task planning

Netllm: Adapting large language models for networking

Fmb: a functional manipulation benchmark for generalizable robotic learning