Google Acadèmic

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arxiv preprint arxiv …, 2023 - arxiv.org

AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, so do risks from misalignment. To provide a comprehensive …

Desa Cita Citat per 247 Articles relacionats Totes les 4 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

On transforming reinforcement learning with transformers: The development trajectory

S Hu, L Shen, Y Zhang, Y Chen… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Transformers, originally devised for natural language processing (NLP), have also produced
significant successes in computer vision (CV). Due to their strong expression power …

Desa Cita Citat per 36 Articles relacionats Totes les 7 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Voxposer: Composable 3d value maps for robotic manipulation with language models

W Huang, C Wang, R Zhang, Y Li, J Wu… - arxiv preprint arxiv …, 2023 - arxiv.org

Large language models (LLMs) are shown to possess a wealth of actionable knowledge that
can be extracted for robot manipulation in the form of reasoning and planning. Despite the …

Desa Cita Citat per 474 Articles relacionats Totes les 6 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

A survey on multimodal large language models for autonomous driving

C Cui, Y Ma, X Cao, W Ye, Y Zhou… - Proceedings of the …, 2024 - openaccess.thecvf.com

With the emergence of Large Language Models (LLMs) and Vision Foundation Models
(VFMs), multimodal AI systems benefiting from large models have the potential to equally …

Desa Cita Citat per 287 Articles relacionats Totes les 7 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Open problems and fundamental limitations of reinforcement learning from human feedback

S Casper, X Davies, C Shi, TK Gilbert… - arxiv preprint arxiv …, 2023 - arxiv.org

Reinforcement learning from human feedback (RLHF) is a technique for training AI systems
to align with human goals. RLHF has emerged as the central method used to finetune state …

Desa Cita Citat per 473 Articles relacionats Totes les 7 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Chatgpt for robotics: Design principles and model abilities

SH Vemprala, R Bonatti, A Bucker, A Kapoor - Ieee Access, 2024 - ieeexplore.ieee.org

This paper presents an experimental study regarding the use of OpenAI's ChatGPT for
robotics applications. We outline a strategy that combines design principles for prompt …

Desa Cita Citat per 501 Articles relacionats Totes les 6 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Language to rewards for robotic skill synthesis

W Yu, N Gileadi, C Fu, S Kirmani, KH Lee… - arxiv preprint arxiv …, 2023 - arxiv.org

Large language models (LLMs) have demonstrated exciting progress in acquiring diverse
new capabilities through in-context learning, ranging from logical reasoning to code-writing …

Desa Cita Citat per 276 Articles relacionats Totes les 5 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Code as policies: Language model programs for embodied control

J Liang, W Huang, F **a, P Xu… - … on Robotics and …, 2023 - ieeexplore.ieee.org

Large language models (LLMs) trained on code-completion have been shown to be capable
of synthesizing simple Python programs from docstrings [1]. We find that these code-writing …

Desa Cita Citat per 865 Articles relacionats Totes les 5 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Ignore previous prompt: Attack techniques for language models

F Perez, I Ribeiro - arxiv preprint arxiv:2211.09527, 2022 - arxiv.org

Transformer-based large language models (LLMs) provide a powerful foundation for natural
language tasks in large-scale customer-facing applications. However, studies that explore …

Desa Cita Citat per 355 Articles relacionats Totes les 3 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Languagempc: Large language models as decision makers for autonomous driving

H Sha, Y Mu, Y Jiang, L Chen, C Xu, P Luo… - arxiv preprint arxiv …, 2023 - arxiv.org

Existing learning-based autonomous driving (AD) systems face challenges in
comprehending high-level information, generalizing to rare events, and providing …

Desa Cita Citat per 194 Articles relacionats Totes les 4 versions Free GPT-4 DeepSeek Versió HTML

Crea una alerta

Cita

Cerca avançada

S'ha desat a La meva biblioteca

Correcting robot plans with natural language feedback

Ai alignment: A comprehensive survey

On transforming reinforcement learning with transformers: The development trajectory

Voxposer: Composable 3d value maps for robotic manipulation with language models

A survey on multimodal large language models for autonomous driving

Open problems and fundamental limitations of reinforcement learning from human feedback

Chatgpt for robotics: Design principles and model abilities

Language to rewards for robotic skill synthesis

Code as policies: Language model programs for embodied control

Ignore previous prompt: Attack techniques for language models

Languagempc: Large language models as decision makers for autonomous driving