- Academic Search

N Shinn, F Cassano, A Gopinath… - Advances in …, 2024 - proceedings.neurips.cc

Large language models (LLMs) have been increasingly used to interact with external
environments (eg, games, compilers, APIs) as goal-driven agents. However, it remains …

Save Cite Cited by 1074 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] qub.ac.uk

[PDF][PDF] DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models.

B Wang, W Chen, H Pei, C ** large generative language models to supervised tasks may
fail to sufficiently probe models' novel capabilities. Using GPT-3 as a case study, we show …

Save Cite Cited by 911 Related articles All 3 versions Free GPT-4

[Free GPT-4]

[PDF] jair.org Full View

A survey of zero-shot generalisation in deep reinforcement learning

R Kirk, A Zhang, E Grefenstette, T Rocktäschel - Journal of Artificial …, 2023 - jair.org

The study of zero-shot generalisation (ZSG) in deep Reinforcement Learning (RL) aims to
produce RL algorithms whose policies generalise well to novel unseen situations at …

Save Cite Cited by 410 Related articles All 9 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Do the rewards justify the means? measuring trade-offs between rewards and ethical behavior in the machiavelli benchmark

A Pan, JS Chan, A Zou, N Li, S Basart… - International …, 2023 - proceedings.mlr.press

Artificial agents have traditionally been trained to maximize reward, which may incentivize
power-seeking and deception, analogous to how next-token prediction in language models …

Save Cite Cited by 132 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] openreview.net

Agentbench: Evaluating llms as agents

X Liu, H Yu, H Zhang, Y Xu, X Lei, H Lai, Y Gu… - arxiv preprint arxiv …, 2023 - arxiv.org

Large Language Models (LLMs) are becoming increasingly smart and autonomous,
targeting real-world pragmatic missions beyond traditional NLP tasks. As a result, there has …

Save Cite Cited by 257 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Grounding large language models in interactive environments with online reinforcement learning

T Carta, C Romac, T Wolf, S Lamprier… - International …, 2023 - proceedings.mlr.press

Recent works successfully leveraged Large Language Models'(LLM) abilities to capture
abstract knowledge about world's physics to solve decision-making problems. Yet, the …

Save Cite Cited by 161 Related articles All 12 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Textworld: A learning environment for text-based games

The rise and potential of large language model based agents: A survey

Reflexion: Language agents with verbal reinforcement learning

[PDF][PDF] DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models.

A survey of zero-shot generalisation in deep reinforcement learning

Do the rewards justify the means? measuring trade-offs between rewards and ethical behavior in the machiavelli benchmark

Agentbench: Evaluating llms as agents

Grounding large language models in interactive environments with online reinforcement learning