Sledovat
Pedro A. Ortega
Pedro A. Ortega
Artificial Intelligence & Machine Learning
E-mailová adresa ověřena na: adaptiveagents.org - Domovská stránka
Název
Citace
Citace
Rok
Social influence as intrinsic motivation for multi-agent deep reinforcement learning
N Jaques, A Lazaridou, E Hughes, C Gulcehre, P Ortega, DJ Strouse, ...
International conference on machine learning, 3040-3049, 2019
5712019
AI safety gridworlds
J Leike, M Martic, V Krakovna, PA Ortega, T Everitt, A Lefrancq, L Orseau, ...
arXiv preprint arXiv:1711.09883, 2017
3622017
Thermodynamics as a theory of decision-making with information-processing costs
PA Ortega, DA Braun
Proceedings of the Royal Society A: Mathematical, Physical and Engineering …, 2013
3072013
A Medical Claim Fraud/Abuse Detection System based on Data Mining: A Case Study in Chile.
PA Ortega, CJ Figueroa, GA Ruz
DMIN 6, 26-29, 2006
1682006
Meta reinforcement learning as task inference
J Humplik, A Galashov, L Hasenclever, PA Ortega, YW Teh, N Heess
arXiv preprint arXiv:1905.06424, 2019
1482019
Nash equilibria in multi-agent motor interactions
DA Braun, PA Ortega, DM Wolpert
PLoS computational biology 5 (8), e1000468, 2009
1452009
Neural networks and the chomsky hierarchy
G Delétang, A Ruoss, J Grau-Moya, T Genewein, LK Wenliang, E Catt, ...
arXiv preprint arXiv:2207.02098, 2022
1442022
Causal reasoning from meta-reinforcement learning
I Dasgupta, J Wang, S Chiappa, J Mitrovic, P Ortega, D Raposo, ...
arXiv preprint arXiv:1901.08162, 2019
1342019
Meta-learning of sequential strategies
PA Ortega, JX Wang, M Rowland, T Genewein, Z Kurth-Nelson, ...
arXiv preprint arXiv:1905.03030, 2019
1002019
From poincaré recurrence to convergence in imperfect information games: Finding equilibrium via regularization
J Perolat, R Munos, JB Lespiau, S Omidshafiei, M Rowland, P Ortega, ...
International Conference on Machine Learning, 8525-8535, 2021
982021
Information, utility and bounded rationality
DA Ortega, PA Braun
Artificial General Intelligence: 4th International Conference, AGI 2011 …, 2011
872011
A minimum relative entropy principle for learning and acting
PA Ortega, DA Braun
Journal of Artificial Intelligence Research 38, 475-511, 2010
862010
Action and perception as divergence minimization
D Hafner, PA Ortega, J Ba, T Parr, K Friston, N Heess
arXiv preprint arXiv:2009.01791, 2020
722020
Shaking the foundations: delusions in sequence models for interaction and control
PA Ortega, M Kunesch, G Delétang, T Genewein, J Grau-Moya, J Veness, ...
arXiv preprint arXiv:2110.10819, 2021
682021
Path integral control and bounded rationality
DA Braun, PA Ortega, E Theodorou, S Schaal
2011 IEEE symposium on adaptive dynamic programming and reinforcement …, 2011
652011
Agent incentives: A causal perspective
T Everitt, R Carey, ED Langlois, PA Ortega, S Legg
Proceedings of the AAAI Conference on Artificial Intelligence 35 (13), 11487 …, 2021
602021
Intrinsic social motivation via causal influence in multi-agent RL
N Jaques, A Lazaridou, E Hughes, C Gulcehre, PA Ortega, DJ Strouse, ...
542018
Generalized Thompson sampling for sequential decision-making and causal inference
PA Ortega, DA Braun
Complex Adaptive Systems Modeling 2 (2), 2014
542014
Information-Theoretic Bounded Rationality
PA Ortega, DA Braun, JS Dyer, KE Kim, N Tishby
arXiv preprint arXiv:1512.06789, 2015
492015
Meta-trained agents implement bayes-optimal agents
V Mikulik, G Delétang, T McGrath, T Genewein, M Martic, S Legg, ...
Advances in neural information processing systems 33, 18691-18703, 2020
472020
Systém momentálně nemůže danou operaci provést. Zkuste to znovu později.
Články 1–20