Seguir
Marlos C. Machado
Marlos C. Machado
DeepMind, Amii, and University of Alberta
Email confirmado em google.com - Página inicial
Título
Citado por
Citado por
Ano
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
MC Machado, MG Bellemare, E Talvitie, J Veness, M Hausknecht, ...
Journal of Artificial Intelligence Research 61, 523-562, 2018
3712018
A laplacian framework for option discovery in reinforcement learning
MC Machado, MG Bellemare, M Bowling
Proceedings of the 34th International Conference on Machine Learning-Volume …, 2017
1922017
State of the art control of atari games using shallow reinforcement learning
Y Liang, MC Machado, E Talvitie, M Bowling
Proceedings of the 2016 International Conference on Autonomous Agents …, 2016
1132016
Generalization and Regularization in DQN
J Farebrother, MC Machado, M Bowling
arXiv preprint arXiv:1810.00123, 2018
962018
Autonomous navigation of stratospheric balloons using reinforcement learning
MG Bellemare, S Candido, PS Castro, J Gong, MC Machado, S Moitra, ...
Nature 588 (7836), 77-82, 2020
942020
True online temporal-difference learning
H Van Seijen, AR Mahmood, PM Pilarski, MC Machado, RS Sutton
The Journal of Machine Learning Research 17 (1), 5057-5096, 2016
912016
Eigenoption Discovery through the Deep Successor Representation
MC Machado, C Rosenbaum, X Guo, M Liu, G Tesauro, M Campbell
arXiv preprint arXiv:1710.11089, 2017
892017
Count-based exploration with the successor representation
MC Machado, MG Bellemare, M Bowling
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 5125-5133, 2020
882020
On Bonus Based Exploration Methods In The Arcade Learning Environment
AA Taiga, W Fedus, MC Machado, A Courville, MG Bellemare
58*2020
Player modeling: Towards a common taxonomy
MC Machado, EPC Fantini, L Chaimowicz
2011 16th international conference on computer games (CGAMES), 50-57, 2011
522011
Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning
R Agarwal, MC Machado, PS Castro, MG Bellemare
arXiv preprint arXiv:2101.05265, 2021
492021
Learning Purposeful Behaviour in the Absence of Rewards
MC Machado, M Bowling
arXiv preprint arXiv:1605.07700, 2016
262016
Domain-independent optimistic initialization for reinforcement learning
MC Machado, S Srinivasan, M Bowling
Workshops at the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
222015
Exploration in reinforcement learning with deep covering options
Y Jinnai, JW Park, MC Machado, G Konidaris
International Conference on Learning Representations, 2020
212020
The Eigenoption-Critic Framework
M Liu, MC Machado, G Tesauro, M Campbell
arXiv preprint arXiv:1712.04065, 2017
132017
An operator view of policy gradient methods
D Ghosh, M C Machado, N Le Roux
Advances in Neural Information Processing Systems 33, 2020
102020
Introspective agents: Confidence measures for general value functions
C Sherstan, A White, MC Machado, PM Pilarski
International Conference on Artificial General Intelligence, 258-261, 2016
102016
A binary classification approach for automatic preference modeling of virtual agents in Civilization IV
MC Machado, GL Pappa, L Chaimowicz
2012 IEEE Conference on Computational Intelligence and Games (CIG), 155-162, 2012
102012
Combining metaheuristics and csp algorithms to solve sudoku
MC Machado, L Chaimowicz
2011 Brazilian Symposium on Games and Digital Entertainment, 124-131, 2011
102011
Beyond variance reduction: Understanding the true impact of baselines on policy optimization
W Chung, V Thomas, MC Machado, N Le Roux
International Conference on Machine Learning, 1999-2009, 2021
92021
O sistema não pode efectuar a operação agora. Tente novamente mais tarde.
Artigos 1–20