Bei Peng

Citado por

	Todos	Desde 2019
Citações	2058	1876
Índice h	16	15
Índice i10	19	16

640

320

160

480

201520162017201820192020202120222023202416 36 43 77 56 139 313 519 628 218

Acesso público

Ver tudo

14 artigos

0 artigos

disponível

não disponível

Com base em autorizações de financiamento

Coautores

Matthew E. TaylorAssociate Professor, University of AlbertaEmail confirmado em ualberta.ca
Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoEmail confirmado em cs.ox.ac.uk
David L. RobertsAssociate Professor, Assistant Director Undergraduate Programs, Interim Director Digital GamesEmail confirmado em csc.ncsu.edu
Michael LittmanBrown UniversityEmail confirmado em brown.edu
Robert LoftinLecturer, University of SheffieldEmail confirmado em sheffield.ac.uk
James MacGlashanSony AIEmail confirmado em sony.com
Wendelin BöhmerSequential Decision Making Group, Delft University of TechnologyEmail confirmado em tudelft.nl
Tabish RashidMicrosoft ResearchEmail confirmado em microsoft.com
Christian Schroeder de WittUniversity of OxfordEmail confirmado em robots.ox.ac.uk
Tarun GuptaUniversity of Oxford, Microsoft ResearchEmail confirmado em microsoft.com
Jeff HuangBrown UniversityEmail confirmado em jeffhuang.com
Philip TorrProfessor, University of OxfordEmail confirmado em eng.ox.ac.uk
Anuj MahajanAmazonEmail confirmado em cs.ox.ac.uk
Sanmit NarvekarResearch Scientist, WaymoEmail confirmado em cs.utexas.edu
Peter StoneProfessor of Computer Science, The University of Texas at AustinEmail confirmado em cs.utexas.edu
Jivko SinapovAssistant Professor, Tufts UniversityEmail confirmado em cs.tufts.edu
Matteo LeonettiDepartment of Informatics, King's College LondonEmail confirmado em kcl.ac.uk
Gregory FarquharDeepMindEmail confirmado em google.com
Tonghan WangEcon CS group, Harvard UniversityEmail confirmado em g.harvard.edu
Shariq IqbalResearch Scientist, DeepmindEmail confirmado em deepmind.com

Seguir

Bei Peng

Lecturer (Assistant Professor), University of Liverpool

Email confirmado em liverpool.ac.uk - Página inicial

Machine Learning Reinforcement Learning Interactive Learning Multi-Agent Systems Curriculum Learning


Título Ordenar por citações Ordenar por ano Ordenar por título	Citado por Citado por	Ano
Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey S Narvekar, B Peng, M Leonetti, J Sinapov, ME Taylor, P Stone Journal of Machine Learning Research (JMLR 2020) 21, 1-50, 2020	445	2020
Weighted QMIX: Expanding Monotonic Value Function Factorisation T Rashid, G Farquhar, B Peng, S Whiteson Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS 2020), 2020	314*	2020
Interactive learning from policy-dependent human feedback J MacGlashan, MK Ho, R Loftin, B Peng, G Wang, DL Roberts, ME Taylor, ... 34th International Conference on Machine Learning (ICML 2017), 2285-2294, 2017	301	2017
RODE: Learning Roles to Decompose Multi-Agent Tasks T Wang, T Gupta, A Mahajan, B Peng, S Whiteson, C Zhang International Conference on Learning Representations (ICLR 2021), 2020	181	2020
FACMAC: Factored Multi-Agent Centralised Policy Gradients B Peng, T Rashid, CAS de Witt, PA Kamienny, PHS Torr, W Böhmer, ... 35th Conference on Neural Information Processing Systems (NeurIPS 2021), 2021	169	2021
Learning behaviors via human-delivered discrete feedback: modeling implicit feedback strategies to speed up learning R Loftin, B Peng, J MacGlashan, ML Littman, ME Taylor, J Huang, ... Autonomous agents and multi-agent systems (JAAMAS 2016) 30 (1), 30-59, 2016	121	2016
A strategy-aware technique for learning behaviors from discrete human feedback RT Loftin, J MacGlashan, B Peng, ME Taylor, ML Littman, J Huang, ... Twenty-Eighth AAAI Conference on Artificial Intelligence (AAAI 2014), 2014	79	2014
Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning S Iqbal, CAS de Witt, B Peng, W Böhmer, S Whiteson, F Sha 38th International Conference on Machine Learning (ICML 2021), 2021	73*	2021
Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control CS de Witt, B Peng (equal contribution), PA Kamienny, P Torr, W Böhmer, ... arXiv preprint arXiv:2003.06709, 2020	72	2020
A need for speed: Adapting agent action speed to improve task learning from non-expert humans B Peng, J MacGlashan, R Loftin, ML Littman, DL Roberts, ME Taylor Autonomous Agents and Multiagent Systems (AAMAS 2016), 2016	56	2016
UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning T Gupta, A Mahajan, B Peng, W Böhmer, S Whiteson 38th International Conference on Machine Learning (ICML 2021), 2021	48	2021
Optimistic Exploration even with a Pessimistic Initialisation T Rashid, B Peng, W Böhmer, S Whiteson International Conference on Learning Representations (ICLR 2020), 2020	46	2020
Learning something from nothing: Leveraging implicit human feedback strategies R Loftin, B Peng, J MacGlashan, ML Littman, ME Taylor, J Huang, ... The 23rd IEEE international symposium on robot and human interactive …, 2014	30	2014
Regularized Softmax Deep Multi-Agent Q-Learning L Pan, T Rashid, B Peng, L Huang, S Whiteson 35th Conference on Neural Information Processing Systems (NeurIPS 2021), 2021	26*	2021
Training an agent to ground commands with reward and punishment J MacGlashan, M Littman, R Loftin, B Peng, D Roberts, M Taylor Workshops at the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014	25	2014
Curriculum Design for Machine Learners in Sequential Decision Tasks B Peng, J MacGlashan, R Loftin, ML Littman, DL Roberts, ME Taylor IEEE Transactions on Emerging Topics in Computational Intelligence 2 (4 …, 2018	18	2018
An empirical study of non-expert curriculum design for machine learners B Peng, J MacGlashan, R Loftin, ML Littman, DL Roberts, ME Taylor Proceedings of the IJCAI Interactive Machine Learning Workshop, 2016	14	2016
Convergent Actor Critic by Humans J MacGlashan, ML Littman, DL Roberts, R Loftin, B Peng, ME Taylor International Conference on Intelligent Robots and Systems (IROS 2016), 2016	12	2016
Towards integrating real-time crowd advice with reinforcement learning GV de la Cruz, B Peng, WS Lasecki, ME Taylor Proceedings of the 20th International Conference on Intelligent User …, 2015	10	2015
Generating real-time crowd advice to improve reinforcement learning agents GV de la Cruz, B Peng, WS Lasecki, ME Taylor Workshops at the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015	4	2015

O sistema não pode efectuar a operação agora. Tente mais tarde.

Artigos 1–20

Citações por ano

Citações duplicadas

Citações unidas

Adicionar coautoresCoautores

Seguir

Citado por

Coautores