Scaling up visual and vision-language representation learning with noisy text supervision C Jia, Y Yang, Y Xia, YT Chen, Z Parekh, H Pham, Q Le, YH Sung, Z Li, ... International conference on machine learning, 4904-4916, 2021 | 3562 | 2021 |
The open images dataset v4: Unified image classification, object detection, and visual relationship detection at scale A Kuznetsova, H Rom, N Alldrin, J Uijlings, I Krasin, J Pont-Tuset, ... International journal of computer vision 128 (7), 1956-1981, 2020 | 2875 | 2020 |
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805, 2023 | 2138 | 2023 |
Openimages: A public dataset for large-scale multi-label and multi-class image classification I Krasin, T Duerig, N Alldrin, V Ferrari, S Abu-El-Haija, A Kuznetsova, ... Dataset available from https://github. com/openimages 2 (3), 18, 2017 | 874 | 2017 |
The unreasonable effectiveness of noisy data for fine-grained recognition J Krause, B Sapp, A Howard, H Zhou, A Toshev, T Duerig, J Philbin, ... Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016 | 437 | 2016 |
Blockout: Dynamic model selection for hierarchical deep networks C Murdock, Z Li, H Zhou, T Duerig Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 71 | 2016 |
Graph-rise: Graph-regularized image semantic embedding DC Juan, CT Lu, Z Li, F Peng, A Timofeev, YT Chen, Y Gao, T Duerig, ... arXiv preprint arXiv:1902.10814, 2019 | 39 | 2019 |
The open images dataset v4: Unified image classification, object detection, and visual relationship detection at scale. arXiv 2018 A Kuznetsova, H Rom, N Alldrin, J Uijlings, I Krasin, J Pont-Tuset, ... arXiv preprint arXiv:1811.00982, 2018 | 33 | 2018 |
Ultra fine-grained image semantic embedding DC Juan, CT Lu, Z Li, F Peng, A Timofeev, YT Chen, Y Gao, T Duerig, ... Proceedings of the 13th international conference on web search and data …, 2020 | 18 | 2020 |
Unifying Specialist Image Embedding into Universal Image Embedding Y Feng, F Peng, X Zhang, W Zhu, S Zhang, H Zhou, Z Li, T Duerig, ... arXiv preprint arXiv:2003.03701, 2020 | 4 | 2020 |
Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use IE Toubal, A Avinash, NG Alldrin, J Dlabal, W Zhou, E Luo, O Stretcu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 3 | 2024 |
Graph-RISE: Graph-Regularized Image Semantic Embedding A Timofeev, A Tomkins, CT Lu, DC Juan, F Peng, K Viswanathan, L Gao, ... ACM WSDM, 2020 | 3 | 2020 |
Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use I Eddine Toubal, A Avinash, NG Alldrin, J Dlabal, W Zhou, E Luo, ... arXiv e-prints, arXiv: 2403.02626, 2024 | | 2024 |
CARLS: Cross-platform Asynchronous Representation Learning System CT Lu, Y Zeng, DC Juan, Y Fan, Z Li, J Dlabal, YT Chen, A Gopalan, ... arXiv preprint arXiv:2105.12849, 2021 | | 2021 |