Follow
Botian Shi
Botian Shi
Shanghai Artificial Intelligence Laboratory
Verified email at pjlab.org.cn
Title
Cited by
Cited by
Year
Univl: A unified video and language pre-training model for multimodal understanding and generation
H Luo, L Ji, B Shi, H Huang, N Duan, T Li, J Li, T Bharti, M Zhou
arXiv preprint arXiv:2002.06353, 2020
3872020
Multi-modal sensor fusion for auto driving perception: A survey
K Huang, B Shi, X Li, X Li, S Huang, Y Li
arXiv preprint arXiv:2202.02703, 2022
822022
Knowledge Aware Semantic Concept Expansion for Image-Text Matching.
B Shi, L Ji, P Lu, Z Niu, N Duan
Proceedings of the Twenty-Eighth International Joint Conference on …, 2019
722019
Dense procedure captioning in narrated instructional videos
B Shi, L Ji, Y Liang, N Duan, P Chen, Z Niu, M Zhou
Proceedings of the 57th annual meeting of the association for computational …, 2019
722019
Microsoft concept graph: Mining semantic concepts for short text understanding
L Ji, Y Wang, B Shi, D Zhang, Z Wang, J Yan
Data Intelligence 1 (3), 238-270, 2019
492019
Drive like a human: Rethinking autonomous driving with large language models
D Fu, X Li, L Wen, M Dou, P Cai, B Shi, Y Qiao
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024
482024
Logonet: Towards accurate 3d object detection with local-to-global cross-modal fusion
X Li, T Ma, Y Hou, B Shi, Y Yang, Y Liu, X Wu, Q Chen, Y Li, Y Qiao, L He
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
442023
Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection
X Li, B Shi, Y Hou, X Wu, T Ma, Y Li, L He
Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022
352022
Dilu: A knowledge-driven approach to autonomous driving with large language models
L Wen, D Fu, X Li, X Cai, T Ma, P Cai, M Dou, B Shi, L He, Y Qiao
arXiv preprint arXiv:2309.16292, 2023
292023
On the road with gpt-4v (ision): Early explorations of visual-language model on autonomous driving
L Wen, X Yang, D Fu, X Wang, P Cai, X Li, T Ma, Y Li, L Xu, D Shang, ...
arXiv preprint arXiv:2311.05332, 2023
252023
Streetsurf: Extending multi-view implicit surface reconstruction to street views
J Guo, N Deng, X Li, Y Bai, B Shi, C Wang, C Ding, D Wang, Y Li
arXiv preprint arXiv:2306.04988, 2023
232023
Learning semantic concepts and temporal alignment for narrated video procedural captioning
B Shi, L Ji, Z Niu, N Duan, M Zhou, X Chen
Proceedings of the 28th ACM international conference on multimedia, 4355-4363, 2020
212020
A benchmark for structured procedural knowledge extraction from cooking videos
FF Xu, L Ji, B Shi, J Du, G Neubig, Y Bisk, N Duan
arXiv preprint arXiv:2005.00706, 2020
212020
Uni3d: A unified baseline for multi-dataset 3d object detection
B Zhang, J Yuan, B Shi, T Chen, Y Li, Y Qiao
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
172023
Bi3d: Bi-domain active learning for cross-domain 3d object detection
J Yuan, B Zhang, X Yan, T Chen, B Shi, Y Li, Y Qiao
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
142023
Learning cross-image object semantic relation in transformer for few-shot fine-grained image classification
B Zhang, J Yuan, B Li, T Chen, J Fan, B Shi
Proceedings of the 30th ACM International Conference on Multimedia, 2135-2144, 2022
112022
Ad-pt: Autonomous driving pre-training with large-scale point cloud dataset
J Yuan, B Zhang, X Yan, B Shi, T Chen, Y Li, Y Qiao
Advances in Neural Information Processing Systems 36, 2024
102024
Towards knowledge-driven autonomous driving
X Li, Y Bai, P Cai, L Wen, D Fu, B Zhang, X Yang, X Cai, T Ma, J Guo, ...
arXiv preprint arXiv:2312.04316, 2023
92023
Detzero: Rethinking offboard 3d object detection with long-term sequential point clouds
T Ma, X Yang, H Zhou, X Li, B Shi, J Liu, Y Yang, Z Liu, L He, Y Qiao, Y Li, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
92023
Hashing based efficient inference for image-text matching
RC Tu, L Ji, H Luo, B Shi, HY Huang, N Duan, XL Mao
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 …, 2021
92021
The system can't perform the operation now. Try again later.
Articles 1–20