Performance errors during rodent learning reflect a dynamic choice strategy

Z Zhu, KV Kuchibhotla - Current Biology, 2024 - cell.com
Humans, even as infants, use cognitive strategies, such as exploration and hypothesis
testing, to learn about causal interactions in the environment. In animal learning studies …

UniAP: towards universal animal perception in vision via few-shot learning

M Sun, Z Zhao, W Chai, H Luo, S Cao… - Proceedings of the …, 2024 - ojs.aaai.org
Animal visual perception is an important technique for automatically monitoring animal
health, understanding animal behaviors, and assisting animal-related research. However, it …

Harnessing the flexibility of neural networks to predict dynamic theoretical parameters underlying human choice behavior

Y Ger, E Nachmani, L Wolf… - PLoS Computational …, 2024 - journals.plos.org
Reinforcement learning (RL) models are used extensively to study human behavior. These
rely on normative models of behavior and stress interpretability over predictive capabilities …

Adaptive action supervision in reinforcement learning from real-world multi-agent demonstrations

K Fujii, K Tsutsui, A Scott, H Nakahara… - arxiv preprint arxiv …, 2023 - arxiv.org
Modeling of real-world biological multi-agents is a fundamental problem in various scientific
and engineering fields. Reinforcement learning (RL) is a powerful framework to generate …

Integrating inverse reinforcement learning into data-driven mechanistic computational models: a novel paradigm to decode cancer cell heterogeneity

PC Kinnunen, KKY Ho, S Srivastava… - Frontiers in Systems …, 2024 - frontiersin.org
Cellular heterogeneity is a ubiquitous aspect of biology and a major obstacle to successful
cancer treatment. Several techniques have emerged to quantify heterogeneity in live cells …

Discovering individual rewards in collective behavior through inverse multi-agent reinforcement learning

D Waelchli, P Weber, P Koumoutsakos - arxiv preprint arxiv:2305.10548, 2023 - arxiv.org
The discovery of individual objectives in collective behavior of complex dynamical systems
such as fish schools and bacteria colonies is a long-standing challenge. Inverse …

[HTML][HTML] Online estimation of objective function for continuous-time deterministic systems

HJ Asl, E Uchibe - Neural Networks, 2024 - Elsevier
We developed two online data-driven methods for estimating an objective function in
continuous-time linear and nonlinear deterministic systems. The primary focus addressed …

Efficient adaptation in mixed-motive environments via hierarchical opponent modeling and planning

Y Huang, A Liu, F Kong, Y Yang, SC Zhu… - arxiv preprint arxiv …, 2024 - arxiv.org
Despite the recent successes of multi-agent reinforcement learning (MARL) algorithms,
efficiently adapting to co-players in mixed-motive environments remains a significant …

Learning true objectives: Linear algebraic characterizations of identifiability in inverse reinforcement learning

ML Shehab, A Aspeel, N Arechiga… - 6th Annual Learning …, 2024 - proceedings.mlr.press
Inverse reinforcement Learning (IRL) has emerged as a powerful paradigm for extracting
expert skills from observed behavior, with applications ranging from autonomous systems to …

Multi-intention inverse q-learning for interpretable behavior representation

H Zhu, B De La Crompe, G Kalweit, A Schneider… - arxiv preprint arxiv …, 2023 - arxiv.org
In advancing the understanding of natural decision-making processes, inverse
reinforcement learning (IRL) methods have proven instrumental in reconstructing animal's …