Is space-time attention all you need for video understanding? G Bertasius, H Wang, L Torresani ICML 2 (3), 4, 2021 | 924 | 2021 |
Deepedge: A multi-scale bifurcated deep network for top-down contour detection G Bertasius, J Shi, L Torresani Proceedings of the IEEE conference on computer vision and pattern …, 2015 | 496 | 2015 |
Object detection in video with spatiotemporal sampling networks G Bertasius, L Torresani, J Shi Proceedings of the European Conference on Computer Vision (ECCV), 331-346, 2018 | 237 | 2018 |
Semantic segmentation with boundary neural fields G Bertasius, J Shi, L Torresani Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 190 | 2016 |
High-for-low and low-for-high: Efficient boundary detection from deep object features and its applications to high-level vision G Bertasius, J Shi, L Torresani Proceedings of the IEEE international conference on computer vision, 504-512, 2015 | 184 | 2015 |
Classifying, segmenting, and tracking object instances in video with mask propagation G Bertasius, L Torresani Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020 | 151 | 2020 |
Convolutional random walk networks for semantic image segmentation G Bertasius, L Torresani, SX Yu, J Shi Proceedings of the IEEE conference on computer vision and pattern …, 2017 | 140 | 2017 |
Learning temporal pose estimation from sparsely-labeled videos G Bertasius, C Feichtenhofer, D Tran, J Shi, L Torresani Advances in neural information processing systems 32, 2019 | 72* | 2019 |
Am I a baller? basketball performance assessment from first-person videos G Bertasius, H Soo Park, SX Yu, J Shi Proceedings of the IEEE international conference on computer vision, 2177-2185, 2017 | 68 | 2017 |
Automatic lymph node cluster segmentation using holistically-nested neural networks and structured optimization in CT images I Nogues, L Lu, X Wang, H Roth, G Bertasius, N Lay, J Shi, Y Tsehay, ... Medical Image Computing and Computer-Assisted Intervention–MICCAI 2016: 19th …, 2016 | 63 | 2016 |
First person action-object detection with egonet G Bertasius, HS Park, SX Yu, J Shi arXiv preprint arXiv:1603.04908, 2016 | 53 | 2016 |
Vx2text: End-to-end learning of video-based text generation from multimodal inputs X Lin, G Bertasius, J Wang, SF Chang, D Parikh, L Torresani Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 40 | 2021 |
Long-short temporal contrastive learning of video transformers J Wang, G Bertasius, D Tran, L Torresani Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 30 | 2022 |
Unsupervised learning of important objects from first-person videos G Bertasius, H Soo Park, SX Yu, J Shi Proceedings of the IEEE International Conference on Computer Vision, 1956-1964, 2017 | 28* | 2017 |
Egocentric basketball motion planning from a single first-person image G Bertasius, A Chan, J Shi Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 26 | 2018 |
Learning to recognize procedural activities with distant supervision X Lin, F Petroni, G Bertasius, M Rohrbach, SF Chang, L Torresani Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 24 | 2022 |
TallFormer: Temporal Action Localization with a Long-Memory Transformer F Cheng, G Bertasius Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022 | 18 | 2022 |
Long movie clip classification with state-space video models MM Islam, G Bertasius Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022 | 14 | 2022 |
Cobe: Contextualized object embeddings from narrated instructional video G Bertasius, L Torresani Advances in Neural Information Processing Systems 33, 15133-15145, 2020 | 13 | 2020 |
EclipSE: Efficient Long-Range Video Retrieval Using Sight and Sound YB Lin, J Lei, M Bansal, G Bertasius Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022 | 11 | 2022 |