Jaemin Cho

Cited by

	All	Since 2019
Citations	1757	1748
h-index	16	16
i10-index	17	17

700

350

175

525

20182019202020212022202320246 33 45 93 262 686 627

Public access

View all

12 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Mohit BansalParker Distinguished Professor, Computer Science, UNC Chapel HillVerified email at cs.unc.edu
Abhay ZalaUniversity of North Carolina at Chapel HillVerified email at cs.unc.edu
Yi-Lin SungUNC Chapel HillVerified email at cs.unc.edu
Hao TanAdobe ResearchVerified email at adobe.com
Jie Lei 雷杰Research Scientist, Meta AIVerified email at fb.com
Zineng TangUC BerkeleyVerified email at cs.unc.edu
Hannaneh HajishirziUniversity of Washington; Allen AIVerified email at cs.washington.edu
Han LinPhD Student, UNC NLP GroupVerified email at cs.unc.edu
Seunghyun YoonAdobe ResearchVerified email at adobe.com
Trung H. BuiSenior Research Scientist & Research Manager, Adobe ResearchVerified email at adobe.com
Gunhee KimProfessor, Seoul National UniversityVerified email at snu.ac.kr
Yookoon ParkColumbia UniversityVerified email at columbia.edu
Jiasen LuResearch Scientist, AppleVerified email at apple.com
Aniruddha KembhaviSenior Director of Computer Vision, Allen Institute of Artificial IntelligenceVerified email at allenai.org
Jordi Pont-TusetResearch Scientist at Google DeepmindVerified email at google.com
Jason BaldridgeResearch Scientist, GoogleVerified email at google.com
Su WangGoogle, Senior Research EngineerVerified email at google.com
Heng JiProfessor of Computer Science, University of Illinois Urbana-Champaign, Amazon ScholarVerified email at illinois.edu
Shoubin YuUNC, Chapel HillVerified email at cs.unc.edu
Prateek YadavPhD, University of North Carolina Chapel HillVerified email at cs.unc.edu

Jaemin Cho

PhD Student at UNC Chapel Hill

Verified email at cs.unc.edu - Homepage

Multimodal Learning Natural Language Processing Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Unifying Vision-and-Language Tasks via Text Generation J Cho, J Lei, H Tan, M Bansal ICML, 2021	480	2021
VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks YL Sung, J Cho, M Bansal CVPR, 2022	281	2022
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models J Cho, A Zala, M Bansal ICCV, 2023	183*	2023
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning YL Sung, J Cho, M Bansal NeurIPS, 2022	152	2022
A Hierarchical Latent Structure for Variational Conversation Modeling Y Park, J Cho, G Kim NAACL, 2018	129	2018
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers J Cho, J Lu, D Schwenk, H Hajishirzi, A Kembhavi EMNLP, 2020	104	2020
Self-Chained Image-Language Model for Video Localization and Question Answering S Yu, J Cho, P Yadav, M Bansal NeurIPS, 2023	67	2023
Fine-grained Image Captioning with CLIP Reward J Cho, S Yoon, A Kale, F Dernoncourt, T Bui, M Bansal Findings of NAACL, 2022	66	2022
Mixture Content Selection for Diverse Sequence Generation J Cho, M Seo, H Hajishirzi EMNLP, 2019	66	2019
Visual Programming for Step-by-Step Text-to-Image Generation and Evaluation J Cho, A Zala, M Bansal NeurIPS, 2023	37	2023
TVLT: Textless Vision-Language Transformer Z Tang, J Cho, Y Nie, M Bansal NeurIPS, 2022	30	2022
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer Z Tang, J Cho, H Tan, M Bansal NeurIPS, 2021	28	2021
VideoDirectorGPT: Consistent Multi-Scene Video Generation via LLM-Guided Planning H Lin, A Zala, J Cho, M Bansal arXiv preprint arXiv:2309.15091, 2023	27	2023
Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation J Cho, Y Hu, R Garg, P Anderson, R Krishna, J Baldridge, M Bansal, ... ICLR, 2024	26	2024
Hierarchical Video-Moment Retrieval and Step-Captioning A Zala, J Cho, S Kottur, X Chen, B Oğuz, Y Mehdad, M Bansal CVPR, 2023	23	2023
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding RG Reddy, X Rui, M Li, X Lin, H Wen, J Cho, L Huang, M Bansal, A Sil, ... AAAI, 2022	17	2022
Paxion: Patching Action Knowledge in Video-Language Foundation Models Z Wang, A Blume, S Li, G Liu, J Cho, Z Tang, M Bansal, H Ji NeurIPS, 2023	13	2023
Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention Z Tang, J Cho, J Lei, M Bansal WACV, 2023	7	2023
DOCCI: Descriptions of Connected and Contrasting Images Y Onoe, S Rane, Z Berger, Y Bitton, J Cho, R Garg, A Ku, Z Parekh, ... arXiv preprint arXiv:2404.19753, 2024	6	2024
Contrastive region guidance: Improving grounding in vision-language models without training D Wan, J Cho, E Stengel-Eskin, M Bansal arXiv preprint arXiv:2403.02325, 2024	5	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors