Google Академик

Сачувај Цитирај 368 пута наведен Сродни чланци Све верзије (7) HTML верзија

Webshop: Towards scalable real-world web interaction with grounded language agents

S Yao, H Chen, J Yang… - Advances in Neural …, 2022 - proceedings.neurips.cc

Most existing benchmarks for grounding language in interactive environments either lack
realistic linguistic elements, or prove difficult to scale up due to substantial human …

Сачувај Цитирај 40 пута наведен Сродни чланци Све верзије (9) HTML верзија

When do transformers shine in rl? decoupling memory from credit assignment

T Ni, M Ma, B Eysenbach… - Advances in Neural …, 2023 - proceedings.neurips.cc

Reinforcement learning (RL) algorithms face two distinct challenges: learning effective
representations of past and present observations, and determining how actions influence …

Сачувај Цитирај 24 пута наведен Сродни чланци Све верзије (6) HTML верзија

Facing off world model backbones: Rnns, transformers, and s4

F Deng, J Park, S Ahn - Advances in Neural Information …, 2023 - proceedings.neurips.cc

World models are a fundamental component in model-based reinforcement learning
(MBRL). To perform temporally extended and consistent simulations of the future in partially …

Сачувај Цитирај 1 пута наведен Сродни чланци Све верзије (4)

CrowdTransfer: Enabling Crowd Knowledge Transfer in AIoT Community

Y Liu, B Guo, N Li, Y Ding, Z Zhang… - … Surveys & Tutorials, 2024 - ieeexplore.ieee.org

Artificial Intelligence of Things (AIoT) is an emerging frontier based on the deep fusion of
Internet of Things (IoT) and Artificial Intelligence (AI) technologies. The fundamental goal of …

Сачувај Цитирај 36 пута наведен Сродни чланци Све верзије (5) HTML верзија

On the link between conscious function and general intelligence in humans and machines

A Juliani, K Arulkumaran, S Sasai, R Kanai - arxiv preprint arxiv …, 2022 - arxiv.org

In popular media, there is often a connection drawn between the advent of awareness in
artificial agents and those same agents simultaneously achieving human or superhuman …

Сачувај Цитирај 26 пута наведен Сродни чланци Све верзије (8) HTML верзија

Large-scale retrieval for reinforcement learning

P Humphreys, A Guez, O Tieleman… - Advances in …, 2022 - proceedings.neurips.cc

Effective decision making involves flexibly relating past experiences and relevant contextual
information to a novel situation. In deep reinforcement learning (RL), the dominant paradigm …

Сачувај Цитирај 23 пута наведен Сродни чланци Све верзије (2) HTML верзија

[PDF] openreview.net

Memory gym: Partially observable challenges to memory-based agents

M Pleines, M Pallasch, F Zimmer… - … conference on learning …, 2023 - openreview.net

Memory Gym is a novel benchmark for challenging Deep Reinforcement Learning agents to
memorize events across long sequences, be robust to noise, and generalize. It consists of …

Сачувај Цитирај 19 пута наведен Сродни чланци Све верзије (11)

Large language models can segment narrative events similarly to humans

S Michelmann, M Kumar, KA Norman… - Behavior Research …, 2025 - Springer

Humans perceive discrete events such as “restaurant visits” and “train rides” in their
continuous experience. One important prerequisite for studying human event perception is …