Zineng Tang

Cited by

	All	Since 2019
Citations	344	344
h-index	8	8
i10-index	8	8

180

135

202020212022202320244 19 43 110 166

Public access

View all

5 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Mohit BansalParker Distinguished Professor, Computer Science, UNC Chapel HillVerified email at cs.unc.edu
Chenguang ZhuHead of Zoom GenAI ScienceVerified email at zoom.us
Ziyi YangPrincipal Researcher, MicrosoftVerified email at stanford.edu
Jaemin ChoPhD Student at UNC Chapel HillVerified email at cs.unc.edu
Yang LiuMicrosoftVerified email at microsoft.com
Jie Lei 雷杰Research Scientist, Meta AIVerified email at fb.com
Hyounghun KimUlsan National Institute of Science and Technology (UNIST)Verified email at unist.ac.kr
Yixin NieMeta, UNC Chapel HillVerified email at meta.com
Guoxin WangMicrosoftVerified email at microsoft.com
Yuwei FangResearch Scientist, Snap IncVerified email at snapchat.com
Hao TanAdobe ResearchVerified email at adobe.com
Shiyue ZhangBloomberg AIVerified email at cs.unc.edu

Zineng Tang

UC Berkeley

Verified email at cs.unc.edu - Homepage

NLP Multi-modal Grounded Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Any-to-any generation via composable diffusion Z Tang, Z Yang, C Zhu, M Zeng, M Bansal Advances in Neural Information Processing Systems 36, 2024	86	2024
Unifying vision, text, and layout for universal document processing Z Tang, Z Yang, G Wang, Y Fang, Y Liu, C Zhu, M Zeng, C Zhang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	63	2022
Decembert: Learning from noisy instructional videos via dense captions and entropy minimization Z Tang, J Lei, M Bansal Proceedings of the 2021 Conference of the North American Chapter of the …, 2021	59	2021
Dense-caption matching and frame-selection gating for temporal localization in VideoQA H Kim, Z Tang, M Bansal arXiv preprint arXiv:2005.06409, 2020	38	2020
TVLT: Textless vision-language transformer Z Tang, J Cho, Y Nie, M Bansal Advances in neural information processing systems 35, 9617-9632, 2022	30	2022
Vidlankd: Improving language understanding via video-distilled knowledge transfer Z Tang, J Cho, H Tan, M Bansal Advances in Neural Information Processing Systems 34, 24468-24481, 2021	28	2021
Paxion: Patching action knowledge in video-language foundation models Z Wang, A Blume, S Li, G Liu, J Cho, Z Tang, M Bansal, H Ji Advances in Neural Information Processing Systems 36, 2024	13	2024
CoDi-2: In-Context Interleaved and Interactive Any-to-Any Generation Z Tang, Z Yang, M Khademi, Y Liu, C Zhu, M Bansal Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	12	2024
Perceiver-vl: Efficient vision-and-language modeling with iterative latent attention Z Tang, J Cho, J Lei, M Bansal Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2023	7	2023
Continuous language generative flow Z Tang, S Zhang, H Kim, M Bansal Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021	6	2021
Deep colorization by variation Z Tang Proceedings of the 28th ACM International Conference on Information and …, 2019	2	2019
Supplementary Materials for TVLT: Textless Vision-Language Transformer Z Tang, J Cho, Y Nie, M Bansal
Supplementary Material for PERCEIVER-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention Z Tang, J Cho, JLM Bansal

The system can't perform the operation now. Try again later.

Articles 1–13

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors