Seguir
Zihan Qiu
Título
Citado por
Citado por
Ano
Supported policy optimization for offline reinforcement learning
J Wu, H Wu, Z Qiu, J Wang, M Long
Advances in Neural Information Processing Systems 35, 31278-31291, 2022
382022
Emergent Mixture-of-Experts: Can Dense Pre-trained Transformers Benefit from Emergent Modular Structures?
Z Qiu, Z Huang, J Fu
arXiv preprint arXiv:2310.10908, 2023
32023
HyperMoE: Towards Better Mixture of Experts via Transferring Among Experts
H Zhao, Z Qiu, H Wu, Z Wang, Z He, J Fu
arXiv preprint arXiv:2402.12656, 2024
2024
Empirical Study on Updating Key-Value Memories in Transformer Feed-forward Layers
Z Qiu, Z Huang, Y Huang, J Fu
Tiny Paper @ ICLR 2024, 2024
2024
Heterogenous Memory Augmented Neural Networks
Z Qiu, Z Liu, S Yan, S Zhang, J Fu
arXiv preprint arXiv:2310.10909, 2023
2023
O sistema não pode efectuar a operação agora. Tente mais tarde.
Artigos 1–5