Seguir
Pedro A. Ortega
Pedro A. Ortega
Artificial Intelligence & Machine Learning
Dirección de correo verificada de adaptiveagents.org - Página principal
Título
Citado por
Citado por
Año
Social influence as intrinsic motivation for multi-agent deep reinforcement learning
N Jaques, A Lazaridou, E Hughes, C Gulcehre, P Ortega, DJ Strouse, ...
International conference on machine learning, 3040-3049, 2019
4562019
AI safety gridworlds
J Leike, M Martic, V Krakovna, PA Ortega, T Everitt, A Lefrancq, L Orseau, ...
arXiv preprint arXiv:1711.09883, 2017
3222017
Thermodynamics as a theory of decision-making with information-processing costs
PA Ortega, DA Braun
Proceedings of the Royal Society A: Mathematical, Physical and Engineering …, 2013
2782013
A Medical Claim Fraud/Abuse Detection System based on Data Mining: A Case Study in Chile.
PA Ortega, CJ Figueroa, GA Ruz
DMIN 6, 26-29, 2006
1572006
Nash equilibria in multi-agent motor interactions
DA Braun, PA Ortega, DM Wolpert
PLoS computational biology 5 (8), e1000468, 2009
1322009
Meta reinforcement learning as task inference
J Humplik, A Galashov, L Hasenclever, PA Ortega, YW Teh, N Heess
arXiv preprint arXiv:1905.06424, 2019
1302019
Causal reasoning from meta-reinforcement learning
I Dasgupta, J Wang, S Chiappa, J Mitrovic, P Ortega, D Raposo, ...
arXiv preprint arXiv:1901.08162, 2019
1212019
Meta-learning of sequential strategies
PA Ortega, JX Wang, M Rowland, T Genewein, Z Kurth-Nelson, ...
arXiv preprint arXiv:1905.03030, 2019
822019
Information, utility and bounded rationality
DA Ortega, PA Braun
Artificial General Intelligence: 4th International Conference, AGI 2011 …, 2011
812011
A minimum relative entropy principle for learning and acting
PA Ortega, DA Braun
Journal of Artificial Intelligence Research 38, 475-511, 2010
792010
Neural networks and the chomsky hierarchy
G Delétang, A Ruoss, J Grau-Moya, T Genewein, LK Wenliang, E Catt, ...
arXiv preprint arXiv:2207.02098, 2022
752022
From poincaré recurrence to convergence in imperfect information games: Finding equilibrium via regularization
J Perolat, R Munos, JB Lespiau, S Omidshafiei, M Rowland, P Ortega, ...
International Conference on Machine Learning, 8525-8535, 2021
752021
Path integral control and bounded rationality
DA Braun, PA Ortega, E Theodorou, S Schaal
2011 IEEE symposium on adaptive dynamic programming and reinforcement …, 2011
652011
Action and perception as divergence minimization
D Hafner, PA Ortega, J Ba, T Parr, K Friston, N Heess
arXiv preprint arXiv:2009.01791, 2020
552020
Intrinsic social motivation via causal influence in multi-agent RL
N Jaques, A Lazaridou, E Hughes, C Gulcehre, PA Ortega, DJ Strouse, ...
532018
Generalized Thompson sampling for sequential decision-making and causal inference
PA Ortega, DA Braun
Complex Adaptive Systems Modeling 2 (2), 2014
492014
Laser processing of Al2O3/a‐SiCx:H stacks: a feasible solution for the rear surface of high‐efficiency p‐type c‐Si solar cells
I Martín, P Ortega, M Colina, A Orpella, G López, R Alcubilla
Progress in Photovoltaics: Research and Applications 21 (5), 1171-1175, 2013
472013
Shaking the foundations: delusions in sequence models for interaction and control
PA Ortega, M Kunesch, G Delétang, T Genewein, J Grau-Moya, J Veness, ...
arXiv preprint arXiv:2110.10819, 2021
432021
Human decision-making under limited time
PA Ortega, AA Stocker
Advances in Neural Information Processing Systems 29, 2016
402016
Agent incentives: A causal perspective
T Everitt, R Carey, ED Langlois, PA Ortega, S Legg
Proceedings of the AAAI Conference on Artificial Intelligence 35 (13), 11487 …, 2021
392021
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20