Vivit: A video vision transformer A Arnab, M Dehghani, G Heigold, C Sun, M Lučić, C Schmid Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 757 | 2021 |
Higher order conditional random fields in deep neural networks A Arnab, S Jayasumana, S Zheng, PHS Torr Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016 | 286* | 2016 |
Pixelwise instance segmentation with a dynamically instantiated network A Arnab, PHS Torr Proceedings of the IEEE conference on computer vision and pattern …, 2017 | 259 | 2017 |
On the robustness of semantic segmentation models to adversarial attacks A Arnab, O Miksik, PHS Torr IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 888-897, 2018 | 253 | 2018 |
Exploiting temporal context for 3D human pose estimation in the wild A Arnab, C Doersch, A Zisserman Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 189 | 2019 |
Weakly-and Semi-Supervised Panoptic Segmentation Q Li, A Arnab, PHS Torr Proceedings of the European Conference on Computer Vision (ECCV), 102-118, 2018 | 168 | 2018 |
Attention bottlenecks for multimodal fusion A Nagrani, S Yang, A Arnab, A Jansen, C Schmid, C Sun Advances in Neural Information Processing Systems 34, 14200-14213, 2021 | 164 | 2021 |
Dual graph convolutional network for semantic segmentation L Zhang, X Li, A Arnab, K Yang, Y Tong, PHS Torr arXiv preprint arXiv:1909.06121, 2019 | 160 | 2019 |
Conditional random fields meet deep neural networks for semantic segmentation: Combining probabilistic graphical models with deep learning for structured prediction A Arnab, S Zheng, S Jayasumana, B Romera-Paredes, M Larsson, ... IEEE Signal Processing Magazine 35 (1), 37-52, 2018 | 137 | 2018 |
Dynamic graph message passing networks L Zhang, D Xu, A Arnab, PHS Torr Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020 | 104 | 2020 |
Multiview transformers for video recognition S Yan, X Xiong, A Arnab, Z Lu, M Zhang, C Sun, C Schmid Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 66 | 2022 |
Bottom-up instance segmentation using deep higher-order crfs A Arnab, PHS Torr Proceedings of the British Machine Vision Conference (BMVC), 2016 | 66 | 2016 |
Holistic, Instance-Level Human Parsing Q Li, A Arnab, PHS Torr Proceedings of the British Machine Vision Conference (BMVC), 2017 | 59 | 2017 |
Tokenlearner: What can 8 learned tokens do for images and videos? MS Ryoo, AJ Piergiovanni, A Arnab, M Dehghani, A Angelova arXiv preprint arXiv:2106.11297, 2021 | 54 | 2021 |
Tokenlearner: Adaptive space-time tokenization for videos M Ryoo, AJ Piergiovanni, A Arnab, M Dehghani, A Angelova Advances in Neural Information Processing Systems 34, 12786-12797, 2021 | 46 | 2021 |
The efficiency misnomer M Dehghani, A Arnab, L Beyer, A Vaswani, Y Tay arXiv preprint arXiv:2110.12894, 2021 | 35 | 2021 |
A projected gradient descent method for CRF inference allowing end-to-end training of arbitrary pairwise potentials M Larsson, A Arnab, F Kahl, S Zheng, P Torr International Conference on Energy Minimization Methods in Computer Vision …, 2017 | 35* | 2017 |
Simple open-vocabulary object detection with vision transformers M Minderer, A Gritsenko, A Stone, M Neumann, D Weissenborn, ... arXiv preprint arXiv:2205.06230, 2022 | 34 | 2022 |
End-to-end generative pretraining for multimodal video captioning PH Seo, A Nagrani, A Arnab, C Schmid Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 29 | 2022 |
Polyvit: Co-training vision transformers on images, videos and audio V Likhosherstov, A Arnab, K Choromanski, M Lucic, Y Tay, A Weller, ... arXiv preprint arXiv:2111.12993, 2021 | 29 | 2021 |