Seguir
Yang Jin
Título
Citado por
Citado por
Ano
Beyond short-term snippet: Video relation detection with spatio-temporal global context
C Liu, Y Jin, K Xu, G Gong, Y Mu
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020
792020
Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
Y Jin, K Xu, L Chen, C Liao, J Tan, B Chen, C Lei, A Liu, C Song, X Lei, ...
ICLR 2024, 2023
482023
Learning to effectively estimate the travel time for fastest route recommendation
N Wu, J Wang, WX Zhao, Y Jin
Proceedings of the 28th ACM International Conference on Information and …, 2019
282019
Embracing Consistency: A One-Stage Approach for Spatio-Temporal Video Grounding
Y Jin, Y Li, Z Yuan, Y Mu
Advances in Neural Information Processing Systems 35, 29192-29204, 2022
242022
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization
Y Jin, Z Sun, K Xu, L Chen, H Jiang, Q Huang, C Song, Y Liu, D Zhang, ...
ICML 2024, 2024
232024
Learning instance-level representation for large-scale multi-modal pretraining in e-commerce
Y Jin, Y Li, Z Yuan, Y Mu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
122023
Complex video action reasoning via learnable markov logic network
Y Jin, L Zhu, Y Mu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
102022
Zero-shot video event detection with high-order semantic concept discovery and matching
Y Jin, W Jiang, Y Yang, Y Mu
IEEE Transactions on Multimedia 24, 1896-1908, 2021
92021
Pyramidal flow matching for efficient video generative modeling
Y Jin, Z Sun, N Li, K Xu, H Jiang, N Zhuang, Q Huang, Y Song, Y Mu, ...
arXiv preprint arXiv:2410.05954, 2024
82024
Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization. arXiv 2024
Y Jin, K Xu, K Xu, L Chen, C Liao, J Tan, Q Huang, B Chen, C Lei, A Liu
arXiv preprint arXiv:2309.04669, 0
5
Harder Tasks Need More Experts: Dynamic Routing in MoE Models
Q Huang, Z An, N Zhuang, M Tao, C Zhang, Y Jin, K Xu, L Chen, S Huang, ...
ACL 2024, 2024
42024
Video action segmentation via contextually refined temporal keypoints
B Jiang, Y Jin, Z Tan, Y Mu
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
42023
Weakly-Supervised Spatio-Temporal Video Grounding with Variational Cross-Modal Alignment
Y Jin, Y Mu
ECCV 2024, 2024
12024
RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance
Z Sun, Z Yang, Y Jin, H Chi, K Xu, L Chen, H Jiang, Y Song, K Gai, Y Mu
arXiv preprint arXiv:2405.14677, 2024
2024
O sistema não pode efectuar a operação agora. Tente mais tarde.
Artigos 1–14