Seguir
Abbas Abdolmaleki
Abbas Abdolmaleki
Deepmind
Email confirmado em google.com
Título
Citado por
Citado por
Ano
Maximum a posteriori policy optimisation
A Abdolmaleki, JT Springenberg, Y Tassa, R Munos, N Heess, ...
arXiv preprint arXiv:1806.06920, 2018
3282018
Deepmind control suite
Y Tassa, Y Doron, A Muldal, T Erez, Y Li, DL Casas, D Budden, ...
arXiv preprint arXiv:1801.00690, 2018
3122018
Keep doing what worked: Behavioral modelling priors for offline reinforcement learning
NY Siegel, JT Springenberg, F Berkenkamp, A Abdolmaleki, M Neunert, ...
arXiv preprint arXiv:2002.08396, 2020
1702020
Magnetic control of tokamak plasmas through deep reinforcement learning
J Degrave, F Felici, J Buchli, M Neunert, B Tracey, F Carpanese, T Ewalds, ...
Nature 602 (7897), 414-419, 2022
1332022
Acme: A research framework for distributed reinforcement learning
M Hoffman, B Shahriari, J Aslanides, G Barth-Maron, F Behbahani, ...
arXiv preprint arXiv:2006.00979, 2020
1282020
Model-based relative entropy stochastic search
A Abdolmaleki, R Lioutikov, JR Peters, N Lau, L Pualo Reis, G Neumann
Advances in Neural Information Processing Systems 28, 2015
752015
V-mpo: On-policy maximum a posteriori policy optimization for discrete and continuous control
HF Song, A Abdolmaleki, JT Springenberg, A Clark, H Soyer, JW Rae, ...
arXiv preprint arXiv:1909.12238, 2019
702019
Robust reinforcement learning for continuous control with model misspecification
DJ Mankowitz, N Levine, R Jeong, Y Shi, J Kay, A Abdolmaleki, ...
arXiv preprint arXiv:1906.07516, 2019
692019
Continuous-discrete reinforcement learning for hybrid control in robotics
M Neunert, A Abdolmaleki, M Wulfmeier, T Lampe, T Springenberg, ...
Conference on Robot Learning, 735-751, 2020
502020
Relative entropy regularized policy iteration
A Abdolmaleki, JT Springenberg, J Degrave, S Bohez, Y Tassa, D Belov, ...
arXiv preprint arXiv:1812.02256, 2018
502018
Model-free trajectory optimization for reinforcement learning
R Akrour, G Neumann, H Abdulsamad, A Abdolmaleki
International Conference on Machine Learning, 2961-2970, 2016
442016
Value constrained model-free continuous control
S Bohez, A Abdolmaleki, M Neunert, J Buchli, N Heess, R Hadsell
arXiv preprint arXiv:1902.04623, 2019
412019
An optimized gait generator based on fourier series towards fast and robust biped locomotion involving arms swing
N Shafii, A Khorsandian, A Abdolmaleki, B Jozi
2009 IEEE International Conference on Automation and Logistics, 2018-2023, 2009
392009
Deriving and improving CMA-ES with information geometric trust regions
A Abdolmaleki, B Price, N Lau, LP Reis, G Neumann
Proceedings of the Genetic and Evolutionary Computation Conference, 657-664, 2017
352017
A distributional view on multi-objective policy optimization
A Abdolmaleki, S Huang, L Hasenclever, M Neunert, F Song, M Zambelli, ...
International Conference on Machine Learning, 11-22, 2020
332020
Imagined value gradients: Model-based policy optimization with tranferable latent dynamics models
A Byravan, JT Springenberg, A Abdolmaleki, R Hafner, M Neunert, ...
Conference on Robot Learning, 566-589, 2020
332020
Omnidirectional walking and active balance for soccer humanoid robot
N Shafii, A Abdolmaleki, R Ferreira, N Lau, LP Reis
Portuguese Conference on Artificial Intelligence, 283-294, 2013
312013
From motor control to team play in simulated humanoid football
S Liu, G Lever, Z Wang, J Merel, SM Eslami, D Hennes, WM Czarnecki, ...
arXiv preprint arXiv:2105.12196, 2021
302021
Learning a humanoid kick with controlled distance
A Abdolmaleki, D Simões, N Lau, LP Reis, G Neumann
Robot World Cup, 45-57, 2016
282016
Beyond pick-and-place: Tackling robotic stacking of diverse shapes
AX Lee, CM Devin, Y Zhou, T Lampe, K Bousmalis, JT Springenberg, ...
5th Annual Conference on Robot Learning, 2021
252021
O sistema não pode efectuar a operação agora. Tente mais tarde.
Artigos 1–20