Attention clusters: Purely attention based local feature integration for video classification X Long, C Gan, G De Melo, J Wu, X Liu, S Wen Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018 | 182 | 2018 |
PP-YOLO: An effective and efficient implementation of object detector X Long, K Deng, G Wang, Y Zhang, Q Dang, Y Gao, H Shen, J Ren, ... arXiv preprint arXiv:2007.12099, 2020 | 77 | 2020 |
Multimodal keyless attention fusion for video classification X Long, C Gan, G Melo, X Liu, Y Li, F Li, S Wen Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018 | 77 | 2018 |
Video captioning with multi-faceted attention X Long, C Gan, G De Melo Transactions of the Association for Computational Linguistics 6, 173-184, 2018 | 76 | 2018 |
Multi-Label Classification with Label Graph Superimposing Y Wang, D He, F Li, X Long, Z Zhou, J Ma, S Wen AAAI, 2020 | 64 | 2020 |
Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification R You, Z Guo, L Cui, X Long, Y Bao, S Wen AAAI, 2020 | 61 | 2020 |
Temporal Modeling Approaches for Large-scale Youtube-8M Video Understanding F Li, C Gan, X Liu, Y Bian, X Long, Y Li, Z Li, J Zhou, S Wen CVPR Workshop, 2017 | 54 | 2017 |
Revisiting the Effectiveness of Off-the-shelf Temporal Modeling Approaches for Large-scale Video Classification Y Bian, C Gan, L Xiao, F Li, X Long, Y Li, H Qi, J Zhou, S Wen, Y Lin CVPR Workshop, 2017 | 52 | 2017 |
Rspnet: Relative speed perception for unsupervised video representation learning P Chen, D Huang, D He, X Long, R Zeng, S Wen, M Tan, C Gan AAAI Conference on Artificial Intelligence 1 (3), 5, 2021 | 33 | 2021 |
Graph-pcnn: Two stage human pose estimation with graph pose refinement J Wang, X Long, Y Gao, E Ding, S Wen European Conference on Computer Vision, 492-508, 2020 | 21 | 2020 |
Exploiting Spatial-Temporal Modelling and Multi-Modal Fusion for Human Action Recognition D He, F Li, Q Zhao, X Long, Y Fu, S Wen CVPR Workshop, 2018 | 21 | 2018 |
PP-YOLOv2: A practical object detector X Huang, X Wang, W Lv, X Bai, X Long, K Deng, Q Dang, S Han, Q Liu, ... arXiv preprint arXiv:2104.10419, 2021 | 16 | 2021 |
Deep concept-wise temporal convolutional networks for action localization X Li, T Lin, X Liu, W Zuo, C Li, X Long, D He, F Li, S Wen, C Gan Proceedings of the 28th ACM International Conference on Multimedia, 4004-4012, 2020 | 16 | 2020 |
Purely attention based local feature integration for video classification X Long, G De Melo, D He, F Li, Z Chi, S Wen, C Gan IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020 | 6 | 2020 |
Method and apparatus for classifying video X Long, D He, F Li, CHI Zhizhen, Z Zhichao, X Zhao, P Wang, H Sun, ... US Patent 11,256,920, 2022 | 1 | 2022 |
‘Multi-modal fusion network based on relation-aware pyramid network for temporal action localization J Gao, T Lin, X Long, D He, F Li, X Li, S Wen, E Ding Shanghai Jiao Tong Univ., Shanghai, China.[Online]. Available: http://hacs …, 0 | 1 | |
Method, device, apparatus for predicting video coding complexity and storage medium Z Zhichao, D He, F Li, X Zhao, X Li, CHI Zhizhen, X Long, H Sun US Patent 11,259,029, 2022 | | 2022 |
Image processing method and apparatus, device, and storage medium J Wang, X Long, H Sun, Z Jin, D Errui US Patent App. 17/505,889, 2022 | | 2022 |
Method and apparatus for recognizing image, electronic device and storage medium Y Peng, X Long, H Zheng, Z Jia, B Zhang, W Xiaodi, Y Xin, Y Gu, Y Wang, ... US Patent App. 17/504,188, 2022 | | 2022 |
Method for Training Object Detection Model, Object Detection Method and Related Apparatus W Xiaodi, S Han, Y Feng, Y Xin, B Zhang, X Long, H Zheng, Y Peng, Z Jia US Patent App. 17/489,991, 2022 | | 2022 |