Swin transformer v2: Scaling up capacity and resolution Z Liu, H Hu, Y Lin, Z Yao, Z Xie, Y Wei, J Ning, Y Cao, Z Zhang, L Dong, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 1710 | 2022 |
Simmim: A simple framework for masked image modeling Z Xie, Z Zhang, Y Cao, Y Lin, J Bao, Z Yao, Q Dai, H Hu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 1259 | 2022 |
Local relation networks for image recognition H Hu, Z Zhang, Z Xie, S Lin Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 614 | 2019 |
Propagate yourself: Exploring pixel-level consistency for unsupervised visual representation learning Z Xie, Y Lin, Z Zhang, Y Cao, S Lin, H Hu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 448 | 2021 |
DeepSeek-Coder: When the Large Language Model Meets Programming--The Rise of Code Intelligence D Guo, Q Zhu, D Yang, Z Xie, K Dong, W Zhang, G Chen, X Bi, Y Wu, ... arXiv preprint arXiv:2401.14196, 2024 | 226 | 2024 |
Self-supervised learning with swin transformers Z Xie, Y Lin, Z Yao, Z Zhang, Q Dai, Y Cao, H Hu arXiv preprint arXiv:2105.04553, 2021 | 191 | 2021 |
Contrastive learning rivals masked image modeling in fine-tuning via feature distillation Y Wei, H Hu, Z Xie, Z Zhang, Y Cao, J Bao, D Chen, B Guo arXiv preprint arXiv:2205.14141, 2022 | 136 | 2022 |
Revealing the dark secrets of masked image modeling Z Xie, Z Geng, J Hu, Z Zhang, H Hu, Y Cao Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 117 | 2023 |
Spatially adaptive inference with stochastic feature sampling and interpolation Z Xie, Z Zhang, X Zhu, G Huang, S Lin Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 111 | 2020 |
Deepseek-vl: towards real-world vision-language understanding H Lu, W Liu, B Zhang, B Wang, K Dong, B Liu, J Sun, T Ren, Z Li, Y Sun, ... arXiv preprint arXiv:2403.05525, 2024 | 70 | 2024 |
Parametric instance classification for unsupervised visual feature learning Y Cao, Z Xie, B Liu, Y Lin, Z Zhang, H Hu Advances in neural information processing systems 33, 15614-15624, 2020 | 64 | 2020 |
Dreamcraft3d: Hierarchical 3d generation with bootstrapped diffusion prior J Sun, B Zhang, R Shao, L Wang, W Liu, Z Xie, Y Liu arXiv preprint arXiv:2310.16818, 2023 | 63 | 2023 |
Deepseekmoe: Towards ultimate expert specialization in mixture-of-experts language models D Dai, C Deng, C Zhao, RX Xu, H Gao, D Chen, J Li, W Zeng, X Yu, Y Wu, ... arXiv preprint arXiv:2401.06066, 2024 | 57 | 2024 |
On data scaling in masked image modeling Z Xie, Z Zhang, Y Cao, Y Lin, Y Wei, Q Dai, H Hu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 51 | 2023 |
Deepseek llm: Scaling open-source language models with longtermism X Bi, D Chen, G Chen, S Chen, D Dai, C Deng, H Ding, K Dong, Q Du, ... arXiv preprint arXiv:2401.02954, 2024 | 43 | 2024 |
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence Q Zhu, D Guo, Z Shao, D Yang, P Wang, R Xu, Y Wu, Y Li, H Gao, S Ma, ... arXiv preprint arXiv:2406.11931, 2024 | 17 | 2024 |
Improving clip fine-tuning performance Y Wei, H Hu, Z Xie, Z Liu, Z Zhang, Y Cao, J Bao, D Chen, B Guo Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 10 | 2023 |
Deepseek-v2: A strong, economical, and efficient mixture-of-experts language model A Liu, B Feng, B Wang, B Wang, B Liu, C Zhao, C Dengr, C Ruan, D Dai, ... arXiv preprint arXiv:2405.04434, 2024 | 9 | 2024 |
icar: Bridging image classification and image-text alignment for visual recognition Y Wei, Y Cao, Z Zhang, Z Yao, Z Xie, H Hu, B Guo arXiv preprint arXiv:2204.10760, 2022 | 8 | 2022 |
Breaking shortcut: Exploring fully convolutional cycle-consistency for video correspondence learning Y Tang, Z Jiang, Z Xie, Y Cao, Z Zhang, PHS Torr, H Hu arXiv preprint arXiv:2105.05838, 2021 | 7 | 2021 |