Behavior proximal policy optimization Z Zhuang, K Lei, J Liu, D Wang, Y Guo arXiv preprint arXiv:2302.11312, 2023 | 22 | 2023 |
Ceil: Generalized contextual imitation learning J Liu, L He, Y Kang, Z Zhuang, D Wang, H Xu Advances in Neural Information Processing Systems 36, 2024 | 7 | 2024 |
Hinfshot: A challenge dataset for few-shot node classification in heterogeneous information network Z Zhuang, X Xiang, S Huang, D Wang Proceedings of the 2021 International Conference on Multimedia Retrieval …, 2021 | 6 | 2021 |
Beyond ood state actions: Supported cross-domain offline reinforcement learning J Liu, Z Zhang, Z Wei, Z Zhuang, Y Kang, S Gai, D Wang Proceedings of the AAAI Conference on Artificial Intelligence 38 (12), 13945 …, 2024 | 5 | 2024 |
Design from policies: Conservative test-time adaptation for offline policy optimization J Liu, H Zhang, Z Zhuang, Y Kang, D Wang, B Wang Advances in Neural Information Processing Systems 36, 2024 | 5 | 2024 |
Rotogbml: Towards out-of-distribution generalization for gradient-based meta-learning M Zhang, Z Zhuang, Z Wang, D Wang, W Li arXiv preprint arXiv:2303.06679, 2023 | 5 | 2023 |
Homogenization with explicit semantics preservation for heterogeneous information network T Huang, Z Zhuang, S Zhang, D Wang Proceedings of the 29th ACM International Conference on Information …, 2020 | 3 | 2020 |
Reinformer: Max-Return Sequence Modeling for offline RL Z Zhuang, D Peng, Z Zhang, D Wang arXiv preprint arXiv:2405.08740, 2024 | | 2024 |
Context-Former: Stitching via Latent Conditioned Sequence Modeling Z Zhang, J Xu, Z Zhuang, J Liu arXiv preprint arXiv:2401.16452, 2024 | | 2024 |
RSG: Fast Learning Adaptive Skills for Quadruped Robots by Skill Graph H Zhang, D Shi, Z Zhuang, H Zhao, Z Wei, F Zhao, S Gai, S Lyu, D Wang arXiv preprint arXiv:2311.06015, 2023 | | 2023 |
SERA: Sample Efficient Reward Augmentation in offline-to-online Reinforcement Learning Z Zhang, X Xiong, Z Zhuang, J Liu, D Wang arXiv preprint arXiv:2310.19805, 2023 | | 2023 |
Sample Efficient Reward Augmentation in offline-to-online Reinforcement Learning Z Zhang, X Xiong, Z Zhuang, J Liu, D Wang arXiv e-prints, arXiv: 2310.19805, 2023 | | 2023 |
STRAPPER: Preference-based Reinforcement Learning via Self-training Augmentation and Peer Regularization Y Kang, L He, J Liu, Z Zhuang, D Wang arXiv preprint arXiv:2307.09692, 2023 | | 2023 |