Swin transformer v2: Scaling up capacity and resolution Z Liu, H Hu, Y Lin, Z Yao, Z Xie, Y Wei, J Ning, Y Cao, Z Zhang, L Dong, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 2098 | 2022 |
Simmim: A simple framework for masked image modeling Z Xie, Z Zhang, Y Cao, Y Lin, J Bao, Z Yao, Q Dai, H Hu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 1482 | 2022 |
Local relation networks for image recognition H Hu, Z Zhang, Z Xie, S Lin Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 663 | 2019 |
Propagate yourself: Exploring pixel-level consistency for unsupervised visual representation learning Z Xie, Y Lin, Z Zhang, Y Cao, S Lin, H Hu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 485 | 2021 |
DeepSeek-Coder: When the Large Language Model Meets Programming--The Rise of Code Intelligence D Guo, Q Zhu, D Yang, Z Xie, K Dong, W Zhang, G Chen, X Bi, Y Wu, ... arXiv preprint arXiv:2401.14196, 2024 | 462 | 2024 |
Self-supervised learning with swin transformers Z Xie, Y Lin, Z Yao, Z Zhang, Q Dai, Y Cao, H Hu arXiv preprint arXiv:2105.04553, 2021 | 212 | 2021 |
Deepseek-vl: towards real-world vision-language understanding H Lu, W Liu, B Zhang, B Wang, K Dong, B Liu, J Sun, T Ren, Z Li, H Yang, ... arXiv preprint arXiv:2403.05525, 2024 | 192 | 2024 |
Contrastive learning rivals masked image modeling in fine-tuning via feature distillation Y Wei, H Hu, Z Xie, Z Zhang, Y Cao, J Bao, D Chen, B Guo arXiv preprint arXiv:2205.14141, 2022 | 144 | 2022 |
Revealing the dark secrets of masked image modeling Z Xie, Z Geng, J Hu, Z Zhang, H Hu, Y Cao Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 142 | 2023 |
Deepseekmoe: Towards ultimate expert specialization in mixture-of-experts language models D Dai, C Deng, C Zhao, RX Xu, H Gao, D Chen, J Li, W Zeng, X Yu, Y Wu, ... arXiv preprint arXiv:2401.06066, 2024 | 138 | 2024 |
Spatially adaptive inference with stochastic feature sampling and interpolation Z Xie, Z Zhang, X Zhu, G Huang, S Lin Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 116 | 2020 |
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence Q Zhu, D Guo, Z Shao, D Yang, P Wang, R Xu, Y Wu, Y Li, H Gao, S Ma, ... arXiv preprint arXiv:2406.11931, 2024 | 106 | 2024 |
Deepseek-v2: A strong, economical, and efficient mixture-of-experts language model A Liu, B Feng, B Wang, B Wang, B Liu, C Zhao, C Dengr, C Ruan, D Dai, ... arXiv preprint arXiv:2405.04434, 2024 | 105 | 2024 |
Dreamcraft3d: Hierarchical 3d generation with bootstrapped diffusion prior J Sun, B Zhang, R Shao, L Wang, W Liu, Z Xie, Y Liu arXiv preprint arXiv:2310.16818, 2023 | 95 | 2023 |
Deepseek llm: Scaling open-source language models with longtermism X Bi, D Chen, G Chen, S Chen, D Dai, C Deng, H Ding, K Dong, Q Du, ... arXiv preprint arXiv:2401.02954, 2024 | 72 | 2024 |
Parametric instance classification for unsupervised visual feature learning Y Cao, Z Xie, B Liu, Y Lin, Z Zhang, H Hu Advances in neural information processing systems 33, 15614-15624, 2020 | 66 | 2020 |
On data scaling in masked image modeling Z Xie, Z Zhang, Y Cao, Y Lin, Y Wei, Q Dai, H Hu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 63 | 2023 |
Deepseek-v3 technical report A Liu, B Feng, B Xue, B Wang, B Wu, C Lu, C Zhao, C Deng, C Zhang, ... arXiv preprint arXiv:2412.19437, 2024 | 28 | 2024 |
Janus: Decoupling visual encoding for unified multimodal understanding and generation C Wu, X Chen, Z Wu, Y Ma, X Liu, Z Pan, W Liu, Z Xie, X Yu, C Ruan, ... arXiv preprint arXiv:2410.13848, 2024 | 22 | 2024 |
Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning D Guo, D Yang, H Zhang, J Song, R Zhang, R Xu, Q Zhu, S Ma, P Wang, ... arXiv preprint arXiv:2501.12948, 2025 | 21 | 2025 |