Follow
Zineng Tang
Zineng Tang
UC Berkeley
Verified email at cs.unc.edu - Homepage
Title
Cited by
Cited by
Year
Any-to-any generation via composable diffusion
Z Tang, Z Yang, C Zhu, M Zeng, M Bansal
Advances in Neural Information Processing Systems 36, 2024
862024
Unifying vision, text, and layout for universal document processing
Z Tang, Z Yang, G Wang, Y Fang, Y Liu, C Zhu, M Zeng, C Zhang, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
632022
Decembert: Learning from noisy instructional videos via dense captions and entropy minimization
Z Tang, J Lei, M Bansal
Proceedings of the 2021 Conference of the North American Chapter of the …, 2021
592021
Dense-caption matching and frame-selection gating for temporal localization in VideoQA
H Kim, Z Tang, M Bansal
arXiv preprint arXiv:2005.06409, 2020
382020
TVLT: Textless vision-language transformer
Z Tang, J Cho, Y Nie, M Bansal
Advances in neural information processing systems 35, 9617-9632, 2022
302022
Vidlankd: Improving language understanding via video-distilled knowledge transfer
Z Tang, J Cho, H Tan, M Bansal
Advances in Neural Information Processing Systems 34, 24468-24481, 2021
282021
Paxion: Patching action knowledge in video-language foundation models
Z Wang, A Blume, S Li, G Liu, J Cho, Z Tang, M Bansal, H Ji
Advances in Neural Information Processing Systems 36, 2024
132024
CoDi-2: In-Context Interleaved and Interactive Any-to-Any Generation
Z Tang, Z Yang, M Khademi, Y Liu, C Zhu, M Bansal
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
122024
Perceiver-vl: Efficient vision-and-language modeling with iterative latent attention
Z Tang, J Cho, J Lei, M Bansal
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2023
72023
Continuous language generative flow
Z Tang, S Zhang, H Kim, M Bansal
Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021
62021
Deep colorization by variation
Z Tang
Proceedings of the 28th ACM International Conference on Information and …, 2019
22019
Supplementary Materials for TVLT: Textless Vision-Language Transformer
Z Tang, J Cho, Y Nie, M Bansal
Supplementary Material for PERCEIVER-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention
Z Tang, J Cho, JLM Bansal
The system can't perform the operation now. Try again later.
Articles 1–13