Abbas Abdolmaleki

Citado por

	Todos	Desde 2019
Citações	3746	3502
Índice h	27	25
Índice i10	43	36

1200

600

300

900

20132014201520162017201820192020202120222023202410 14 28 27 57 83 175 384 538 893 1114 389

Acesso público

Ver tudo

12 artigos

0 artigos

disponível

não disponível

Com base em autorizações de financiamento

Coautores

Martin RiedmillerDeepMindEmail confirmado em google.com
Nicolas HeessDeepMindEmail confirmado em google.com
Michael NeunertGoogle DeepMindEmail confirmado em google.com
Luis Paulo ReisAssociate Professor, University of PortoEmail confirmado em fe.up.pt
Nuno LauUniversidade de AveiroEmail confirmado em ua.pt
Thomas LampeDeepMindEmail confirmado em google.com
Yuval TassaSenior Research Scientist, Google DeepMindEmail confirmado em google.com
Roland HafnerDeepMindEmail confirmado em google.com
Gerhard NeumannProfessor, Karlsruhe Institute of Technology (KIT)Email confirmado em robot-learning.de
Noah Y. SiegelDeepMindEmail confirmado em google.com
Josh MerelEmail confirmado em google.com
Steven BohezGoogle DeepMindEmail confirmado em google.com
Nima ShafiiNVIDIAEmail confirmado em nvidia.com
Jan PetersProfessor for Intelligent Autonomous Systems/TU Darmstadt, Dept. Head/German AI Research Center DFKIEmail confirmado em ias.tu-darmstadt.de
Rudolf LioutikovTT-Professor, Intuitive Robots Lab, Karlsruhe Institute of TechnologyEmail confirmado em kit.edu
Jost Tobias SpringenbergGoogle DeepMind

Seguir

Abbas Abdolmaleki

Deepmind

Email confirmado em google.com

Artificial Intelligence Reinforcement Learning Robotics


Título Ordenar por citações Ordenar por ano Ordenar por título	Citado por Citado por	Ano
Magnetic control of tokamak plasmas through deep reinforcement learning J Degrave, F Felici, J Buchli, M Neunert, B Tracey, F Carpanese, T Ewalds, ... Nature 602 (7897), 414-419, 2022	614	2022
Deepmind control suite Y Tassa, Y Doron, A Muldal, T Erez, Y Li, DL Casas, D Budden, ... arXiv preprint arXiv:1801.00690, 2018	545	2018
Maximum a posteriori policy optimisation A Abdolmaleki, JT Springenberg, Y Tassa, R Munos, N Heess, ... arXiv preprint arXiv:1806.06920, 2018	482	2018
Keep doing what worked: Behavioral modelling priors for offline reinforcement learning NY Siegel, JT Springenberg, F Berkenkamp, A Abdolmaleki, M Neunert, ... arXiv preprint arXiv:2002.08396, 2020	275	2020
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	231	2020
Robust reinforcement learning for continuous control with model misspecification DJ Mankowitz, N Levine, R Jeong, Y Shi, J Kay, A Abdolmaleki, ... arXiv preprint arXiv:1906.07516, 2019	109	2019
V-mpo: On-policy maximum a posteriori policy optimization for discrete and continuous control HF Song, A Abdolmaleki, JT Springenberg, A Clark, H Soyer, JW Rae, ... arXiv preprint arXiv:1909.12238, 2019	105	2019
From motor control to team play in simulated humanoid football S Liu, G Lever, Z Wang, J Merel, SMA Eslami, D Hennes, WM Czarnecki, ... Science Robotics 7 (69), eabo0235, 2022	102	2022
Model-based relative entropy stochastic search A Abdolmaleki, R Lioutikov, JR Peters, N Lau, L Pualo Reis, G Neumann Advances in Neural Information Processing Systems 28, 2015	87	2015
Continuous-discrete reinforcement learning for hybrid control in robotics M Neunert, A Abdolmaleki, M Wulfmeier, T Lampe, T Springenberg, ... Conference on Robot Learning, 735-751, 2020	85	2020
Beyond pick-and-place: Tackling robotic stacking of diverse shapes AX Lee, CM Devin, Y Zhou, T Lampe, K Bousmalis, JT Springenberg, ... 5th Annual Conference on Robot Learning, 2021	78	2021
A distributional view on multi-objective policy optimization A Abdolmaleki, S Huang, L Hasenclever, M Neunert, F Song, M Zambelli, ... International conference on machine learning, 11-22, 2020	71	2020
Relative entropy regularized policy iteration A Abdolmaleki, JT Springenberg, J Degrave, S Bohez, Y Tassa, D Belov, ... arXiv preprint arXiv:1812.02256, 2018	65	2018
Value constrained model-free continuous control S Bohez, A Abdolmaleki, M Neunert, J Buchli, N Heess, R Hadsell arXiv preprint arXiv:1902.04623, 2019	64	2019
Model-free trajectory optimization for reinforcement learning R Akrour, G Neumann, H Abdulsamad, A Abdolmaleki International Conference on Machine Learning, 2961-2970, 2016	48	2016
Imagined value gradients: Model-based policy optimization with tranferable latent dynamics models A Byravan, JT Springenberg, A Abdolmaleki, R Hafner, M Neunert, ... Conference on Robot Learning, 566-589, 2020	42	2020
Data-efficient hindsight off-policy option learning M Wulfmeier, D Rao, R Hafner, T Lampe, A Abdolmaleki, T Hertweck, ... International Conference on Machine Learning, 11340-11350, 2021	41	2021
An optimized gait generator based on fourier series towards fast and robust biped locomotion involving arms swing N Shafii, A Khorsandian, A Abdolmaleki, B Jozi 2009 IEEE International Conference on Automation and Logistics, 2018-2023, 2009	40	2009
Deriving and improving cma-es with information geometric trust regions A Abdolmaleki, B Price, N Lau, LP Reis, G Neumann Proceedings of the Genetic and Evolutionary Computation Conference, 657-664, 2017	39	2017
Omnidirectional walking and active balance for soccer humanoid robot N Shafii, A Abdolmaleki, R Ferreira, N Lau, LP Reis Progress in Artificial Intelligence: 16th Portuguese Conference on …, 2013	38	2013

O sistema não pode efectuar a operação agora. Tente mais tarde.

Artigos 1–20

Citações por ano

Citações duplicadas

Citações unidas

Adicionar coautoresCoautores

Seguir

Citado por

Coautores