Nantas Nardelli
Citado por
Citado por
Counterfactual multi-agent policy gradients
J Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
Stabilising experience replay for deep multi-agent reinforcement learning
J Foerster, N Nardelli, G Farquhar, T Afouras, PHS Torr, P Kohli, ...
International conference on machine learning, 1146-1155, 2017
The starcraft multi-agent challenge
M Samvelyan, T Rashid, CS De Witt, G Farquhar, N Nardelli, TGJ Rudner, ...
arXiv preprint arXiv:1902.04043, 2019
Torchcraft: a library for machine learning research on real-time strategy games
G Synnaeve, N Nardelli, A Auvolat, S Chintala, T Lacroix, Z Lin, F Richoux, ...
arXiv preprint arXiv:1611.00625, 2016
A survey of reinforcement learning informed by natural language
J Luketina, N Nardelli, G Farquhar, J Foerster, J Andreas, E Grefenstette, ...
arXiv preprint arXiv:1906.03926, 2019
Playing doom with slam-augmented deep reinforcement learning
S Bhatti, A Desmaison, O Miksik, N Nardelli, N Siddharth, PHS Torr
arXiv preprint arXiv:1612.00380, 2016
Counterfactual reasoning about intent for interactive navigation in dynamic environments
A Bordallo, F Previtali, N Nardelli, S Ramamoorthy
2015 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2015
The nethack learning environment
H Küttler, N Nardelli, AH Miller, R Raileanu, M Selvatici, E Grefenstette, ...
arXiv preprint arXiv:2006.13760, 2020
Value propagation networks
N Nardelli, G Synnaeve, Z Lin, P Kohli, PHS Torr, N Usunier
arXiv preprint arXiv:1805.11199, 2018
Torchbeast: A pytorch platform for distributed rl
H Küttler, N Nardelli, T Lavril, M Selvatici, V Sivakumar, T Rocktäschel, ...
arXiv preprint arXiv:1910.03552, 2019
Mvfst-rl: An asynchronous rl framework for congestion control with delayed actions
V Sivakumar, T Rocktäschel, AH Miller, H Küttler, N Nardelli, M Rabbat, ...
arXiv preprint arXiv:1910.04054, 2019
Multitask soft option learning
M Igl, A Gambardella, J He, N Nardelli, N Siddharth, W Böhmer, ...
Conference on Uncertainty in Artificial Intelligence, 969-978, 2020
WordCraft: An Environment for Benchmarking Commonsense Agents
M Jiang, J Luketina, N Nardelli, P Minervini, PHS Torr, S Whiteson, ...
arXiv preprint arXiv:2007.09185, 2020
Simulation-Based Inference for Global Health Decisions
CS de Witt, B Gram-Hansen, N Nardelli, A Gambardella, R Zinkov, ...
arXiv preprint arXiv:2005.07062, 2020
Lessons from reinforcement learning for biological representations of space
A Muryy, N Siddharth, N Nardelli, A Glennerster, PHS Torr
Vision Research 174, 79-93, 2020
Inference and Distillation for Option Learning
M Igl, W Boehmer, A Gambardella, PHS Torr, N Nardelli, N Siddharth, ...
Workshop on Probabilistic Reinforcement Learning and Structured Control …, 2018
Team Edinferno Description Paper for RoboCup 2013 SPL
A Valtazanos, E Vafeias, AB Mico, D Mankowitz, N Nardelli, ...
Team Edinferno Description Paper for RoboCup 2014 SPL
AB Micó, N Nardelli, S Penkov, E Vafeias, G Schropp, S Manilov, ...
O sistema não pode efectuar a operação agora. Tente novamente mais tarde.
Artigos 1–18