Distributional reinforcement learning for multi-dimensional reward functions P Zhang, X Chen, L Zhao, W Xiong, T Qin, TY Liu Advances in Neural Information Processing Systems 34, 1519-1529, 2021 | 17 | 2021 |
Demonstration actor critic G Liu, L Zhao, P Zhang, J Bian, T Qin, N Yu, TY Liu Neurocomputing 434, 194-202, 2021 | 10 | 2021 |
An Adaptive Deep RL Method for Non-Stationary Environments with Piecewise Stable Context X Chen, X Zhu, Y Zheng, P Zhang, L Zhao, W Cheng, P Cheng, Y Xiong, ... arXiv preprint arXiv:2212.12735, 2022 | 7 | 2022 |
Distributional Pareto-Optimal Multi-Objective Reinforcement Learning XQ Cai, P Zhang, L Zhao, J Bian, M Sugiyama, A Llorens Advances in Neural Information Processing Systems 36, 2024 | 4 | 2024 |
Asking before action: Gather information in embodied decision making with language models X Chen, S Zhang, P Zhang, L Zhao, J Chen arXiv preprint arXiv:2305.15695, 2023 | 4 | 2023 |
Independence-aware Advantage Estimation LZ Pushi Zhang, G Liu, J Bian, M Huang, T Qin, TY Liu Proceedings of the Thirtieth International Joint Conference on Artificial …, 2021 | 4* | 2021 |
IG-Net: Image-Goal Network for Offline Visual Navigation on A Large-Scale Game Map P Zhang, B Zhu, XQ Cai, L Zhao, M Sugiyama, J Bian | | 2023 |
Preference-conditioned Pixel-based AI Agent For Game Testing S Abdelfattah, A Brown, P Zhang 2023 IEEE Conference on Games (CoG), 1-8, 2023 | | 2023 |