Botian Shi

Cited by

	All	Since 2019
Citations	1036	1032
h-index	14	14
i10-index	17	17

480

240

120

360

2019202020212022202320243 42 85 198 464 240

Public access

View all

6 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Nan DuanSenior Principal Research Manager, Microsoft ResearchVerified email at microsoft.com
Ming Zhou (周明)Chief Scientist at Sinovation, ACL president (2019), VP of CCF(2020-2024)Verified email at chuangxin.com
Huaishao LuoJD AI ResearchVerified email at jd.com
Pan LuUniversity of California, Los AngelesVerified email at cs.ucla.edu
Yaobo Liangmicrosoft.comVerified email at microsoft.com
Zhongyuan WangBAAIVerified email at baai.ac.cn
Yujing WangPeking University, Microsoft ResearchVerified email at microsoft.com
Graham NeubigCarnegie Mellon UniversityVerified email at cs.cmu.edu
Junyi DuUniversity of Southern CaliforniaVerified email at usc.edu
Fangzheng (Frank) XuCarnegie Mellon UniversityVerified email at cs.cmu.edu
Rong-Cheng TuBeijing Institute of TechnologyVerified email at bit.edu.cn

Botian Shi

Shanghai Artificial Intelligence Laboratory

Verified email at pjlab.org.cn

Autonomous Driving


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Univl: A unified video and language pre-training model for multimodal understanding and generation H Luo, L Ji, B Shi, H Huang, N Duan, T Li, J Li, T Bharti, M Zhou arXiv preprint arXiv:2002.06353, 2020	387	2020
Multi-modal sensor fusion for auto driving perception: A survey K Huang, B Shi, X Li, X Li, S Huang, Y Li arXiv preprint arXiv:2202.02703, 2022	82	2022
Knowledge Aware Semantic Concept Expansion for Image-Text Matching. B Shi, L Ji, P Lu, Z Niu, N Duan Proceedings of the Twenty-Eighth International Joint Conference on …, 2019	72	2019
Dense procedure captioning in narrated instructional videos B Shi, L Ji, Y Liang, N Duan, P Chen, Z Niu, M Zhou Proceedings of the 57th annual meeting of the association for computational …, 2019	72	2019
Microsoft concept graph: Mining semantic concepts for short text understanding L Ji, Y Wang, B Shi, D Zhang, Z Wang, J Yan Data Intelligence 1 (3), 238-270, 2019	49	2019
Drive like a human: Rethinking autonomous driving with large language models D Fu, X Li, L Wen, M Dou, P Cai, B Shi, Y Qiao Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024	48	2024
Logonet: Towards accurate 3d object detection with local-to-global cross-modal fusion X Li, T Ma, Y Hou, B Shi, Y Yang, Y Liu, X Wu, Q Chen, Y Li, Y Qiao, L He Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	44	2023
Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection X Li, B Shi, Y Hou, X Wu, T Ma, Y Li, L He Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022	35	2022
Dilu: A knowledge-driven approach to autonomous driving with large language models L Wen, D Fu, X Li, X Cai, T Ma, P Cai, M Dou, B Shi, L He, Y Qiao arXiv preprint arXiv:2309.16292, 2023	29	2023
On the road with gpt-4v (ision): Early explorations of visual-language model on autonomous driving L Wen, X Yang, D Fu, X Wang, P Cai, X Li, T Ma, Y Li, L Xu, D Shang, ... arXiv preprint arXiv:2311.05332, 2023	25	2023
Streetsurf: Extending multi-view implicit surface reconstruction to street views J Guo, N Deng, X Li, Y Bai, B Shi, C Wang, C Ding, D Wang, Y Li arXiv preprint arXiv:2306.04988, 2023	23	2023
Learning semantic concepts and temporal alignment for narrated video procedural captioning B Shi, L Ji, Z Niu, N Duan, M Zhou, X Chen Proceedings of the 28th ACM international conference on multimedia, 4355-4363, 2020	21	2020
A benchmark for structured procedural knowledge extraction from cooking videos FF Xu, L Ji, B Shi, J Du, G Neubig, Y Bisk, N Duan arXiv preprint arXiv:2005.00706, 2020	21	2020
Uni3d: A unified baseline for multi-dataset 3d object detection B Zhang, J Yuan, B Shi, T Chen, Y Li, Y Qiao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	17	2023
Bi3d: Bi-domain active learning for cross-domain 3d object detection J Yuan, B Zhang, X Yan, T Chen, B Shi, Y Li, Y Qiao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	14	2023
Learning cross-image object semantic relation in transformer for few-shot fine-grained image classification B Zhang, J Yuan, B Li, T Chen, J Fan, B Shi Proceedings of the 30th ACM International Conference on Multimedia, 2135-2144, 2022	11	2022
Ad-pt: Autonomous driving pre-training with large-scale point cloud dataset J Yuan, B Zhang, X Yan, B Shi, T Chen, Y Li, Y Qiao Advances in Neural Information Processing Systems 36, 2024	10	2024
Towards knowledge-driven autonomous driving X Li, Y Bai, P Cai, L Wen, D Fu, B Zhang, X Yang, X Cai, T Ma, J Guo, ... arXiv preprint arXiv:2312.04316, 2023	9	2023
Detzero: Rethinking offboard 3d object detection with long-term sequential point clouds T Ma, X Yang, H Zhou, X Li, B Shi, J Liu, Y Yang, Z Liu, L He, Y Qiao, Y Li, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	9	2023
Hashing based efficient inference for image-text matching RC Tu, L Ji, H Luo, B Shi, HY Huang, N Duan, XL Mao Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 …, 2021	9	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors