Maximum a posteriori policy optimisation A Abdolmaleki, JT Springenberg, Y Tassa, R Munos, N Heess, ... arXiv preprint arXiv:1806.06920, 2018 | 177 | 2018 |
Deepmind control suite Y Tassa, Y Doron, A Muldal, T Erez, Y Li, DL Casas, D Budden, ... arXiv preprint arXiv:1801.00690, 2018 | 139 | 2018 |
Model-based relative entropy stochastic search A Abdolmaleki, R Lioutikov, JR Peters, N Lau, L Pualo Reis, G Neumann Advances in Neural Information Processing Systems 28, 3537-3545, 2015 | 50 | 2015 |
An optimized gait generator based on fourier series towards fast and robust biped locomotion involving arms swing N Shafii, A Khorsandian, A Abdolmaleki, B Jozi 2009 IEEE International Conference on Automation and Logistics, 2018-2023, 2009 | 37 | 2009 |
Keep doing what worked: Behavioral modelling priors for offline reinforcement learning NY Siegel, JT Springenberg, F Berkenkamp, A Abdolmaleki, M Neunert, ... arXiv preprint arXiv:2002.08396, 2020 | 36 | 2020 |
Model-free trajectory optimization for reinforcement learning R Akrour, G Neumann, H Abdulsamad, A Abdolmaleki International Conference on Machine Learning, 2961-2970, 2016 | 33 | 2016 |
V-MPO: On-policy maximum a posteriori policy optimization for discrete and continuous control HF Song, A Abdolmaleki, JT Springenberg, A Clark, H Soyer, JW Rae, ... arXiv preprint arXiv:1909.12238, 2019 | 26 | 2019 |
Omnidirectional walking and active balance for soccer humanoid robot N Shafii, A Abdolmaleki, R Ferreira, N Lau, LP Reis Portuguese Conference on Artificial Intelligence, 283-294, 2013 | 26 | 2013 |
Acme: A research framework for distributed reinforcement learning M Hoffman, B Shahriari, J Aslanides, G Barth-Maron, F Behbahani, ... arXiv preprint arXiv:2006.00979, 2020 | 24 | 2020 |
Relative entropy regularized policy iteration A Abdolmaleki, JT Springenberg, J Degrave, S Bohez, Y Tassa, D Belov, ... arXiv preprint arXiv:1812.02256, 2018 | 24 | 2018 |
Deriving and improving CMA-ES with information geometric trust regions A Abdolmaleki, B Price, N Lau, LP Reis, G Neumann Proceedings of the Genetic and Evolutionary Computation Conference, 657-664, 2017 | 23 | 2017 |
Learning a humanoid kick with controlled distance A Abdolmaleki, D Simões, N Lau, LP Reis, G Neumann Robot World Cup, 45-57, 2016 | 22 | 2016 |
Robust reinforcement learning for continuous control with model misspecification DJ Mankowitz, N Levine, R Jeong, Y Shi, J Kay, A Abdolmaleki, ... arXiv preprint arXiv:1906.07516, 2019 | 19 | 2019 |
Regularized covariance estimation for weighted maximum likelihood policy search methods A Abdolmaleki, N Lau, LP Reis, G Neumann 2015 IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids …, 2015 | 19 | 2015 |
Simultaneously learning vision and feature-based control policies for real-world ball-in-a-cup D Schwab, T Springenberg, MF Martins, T Lampe, M Neunert, ... arXiv preprint arXiv:1902.04706, 2019 | 17 | 2019 |
Value constrained model-free continuous control S Bohez, A Abdolmaleki, M Neunert, J Buchli, N Heess, R Hadsell arXiv preprint arXiv:1902.04623, 2019 | 16 | 2019 |
Regularized hierarchical policies for compositional transfer in robotics M Wulfmeier, A Abdolmaleki, R Hafner, JT Springenberg, M Neunert, ... arXiv preprint arXiv:1906.11228, 2019 | 15 | 2019 |
Guide actor-critic for continuous control V Tangkaratt, A Abdolmaleki, M Sugiyama arXiv preprint arXiv:1705.07606, 2017 | 14 | 2017 |
Imagined value gradients: Model-based policy optimization with tranferable latent dynamics models A Byravan, JT Springenberg, A Abdolmaleki, R Hafner, M Neunert, ... Conference on Robot Learning, 566-589, 2020 | 12 | 2020 |
Model-free trajectory-based policy optimization with monotonic improvement R Akrour, A Abdolmaleki, H Abdulsamad, J Peters, G Neumann The Journal of Machine Learning Research 19 (1), 565-589, 2018 | 12 | 2018 |