Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024 | 857 | 2024 |
Explore, establish, exploit: Red teaming language models from scratch S Casper, J Lin, J Kwon, G Culp, D Hadfield-Menell arXiv preprint arXiv:2306.09442, 2023 | 80 | 2023 |
ScreenAI: A Vision-Language Model for UI and Infographics Understanding G Baechler, S Sunkara, M Wang, F Zubach, H Mansoor, V Etter, ... The 33rd International Joint Conference on Artificial Intelligence (IJCAI), 2024 | 40 | 2024 |
Screenqa: Large-scale question-answer pairs over mobile app screenshots YC Hsiao, F Zubach, G Baechler, V Carbune, J Lin, M Wang, S Sunkara, ... arXiv preprint arXiv:2209.08199, 2022 | 22 | 2022 |
Zorro: the masked multimodal transformer A Recasens, J Lin, J Carreira, D Jaegle, L Wang, J Alayrac, P Luc, ... arXiv preprint arXiv:2301.09595, 2023 | 21 | 2023 |
Interactive classification for deep learning interpretation ÁA Cabrera, F Hohman, J Lin, DH Chau arXiv preprint arXiv:1806.05660, 2018 | 10 | 2018 |
IOTA: A cryptographic perspective B Baek, J Lin Harvard University: Cambridge, MA, USA, 2019 | 8 | 2019 |
Webquest: A benchmark for multimodal qa on web page sequences M Wang, S Sunkara, G Baechler, J Lin, Y Zhu, F Zubach, L Shu, J Chen arXiv preprint arXiv:2409.13711, 2024 | 2 | 2024 |
Diffusion Models as Visual Reasoners J Lin, M Srikanth AAAI 23 Workshop on Creative AI Across Modalities, 2023 | 1 | 2023 |
Quantum computing and its effects on deciphering public key encryptions J Lin Ethical Hacking and Systems Defense, 2015 | 1 | 2015 |
A Distribution-Aware Approach to Dense Retrieval J Lin, J Young, S Arora Stanford University, 2022 | | 2022 |
Data Augmentation with LSTM and Proximal Policy Optimization J Lin, Y Emam | | 2019 |
AdVis: Visualizing and Attributing ML Attacks to Adversarial Examples J Lin, D Soylu | | 2018 |