Follow
Vedanuj Goswami
Vedanuj Goswami
Llama Team, Research Engineer, Meta AI
Verified email at meta.com
Title
Cited by
Cited by
Year
Llama 2: Open foundation and fine-tuned chat models
H Touvron, L Martin, K Stone, P Albert, A Almahairi, Y Babaei, ...
arXiv preprint arXiv:2307.09288, 2023
119352023
The llama 3 herd of models
A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ...
arXiv preprint arXiv:2407.21783, 2024
19062024
No language left behind: Scaling human-centered machine translation
MR Costa-jussą, J Cross, O Ēelebi, M Elbayad, K Heafield, K Heffernan, ...
arXiv preprint arXiv:2207.04672, 2022
8002022
Flava: A foundational language and vision alignment model
A Singh*, R Hu*, V Goswami*, G Couairon, W Galuba, M Rohrbach, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
7182022
The hateful memes challenge: Detecting hate speech in multimodal memes
D Kiela, H Firooz, A Mohan, V Goswami, A Singh, P Ringshia, ...
Advances in neural information processing systems 33, 2611-2624, 2020
6382020
12-in-1: Multi-task vision and language representation learning
J Lu*, V Goswami*, M Rohrbach, D Parikh, S Lee
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
5602020
MMF: A multimodal framework for vision and language research
A Singh, V Goswami, V Natarajan, Y Jiang, X Chen, M Shah, M Rohrbach, ...
URL: https://github. com/facebookresearch/mmf, 0
378*
Only time can tell: Discovering temporal data for temporal modeling
L Sevilla-Lara, S Zha, Z Yan, V Goswami, M Feiszli, L Torresani
Proceedings of the IEEE/CVF winter conference on applications of computer …, 2021
912021
Creative sketch generation
S Ge, V Goswami, CL Zitnick, D Parikh
arXiv preprint arXiv:2011.10039, 2020
872020
The hateful memes challenge: Competition report
D Kiela, H Firooz, A Mohan, V Goswami, A Singh, CA Fitzpatrick, P Bull, ...
NeurIPS 2020 Competition and Demonstration Track, 344-360, 2021
742021
Human-adversarial visual question answering
S Sheng, A Singh, V Goswami, J Magana, T Thrush, W Galuba, D Parikh, ...
Advances in Neural Information Processing Systems 34, 20346-20359, 2021
672021
Are we pretraining it right? digging deeper into visio-linguistic pretraining
A Singh, V Goswami, D Parikh
arXiv preprint arXiv:2004.08744, 2020
502020
Movie: Revisiting modulated convolutions for visual counting and beyond
DK Nguyen, V Goswami, X Chen
arXiv preprint arXiv:2004.11883, 2020
372020
Speechmatrix: A large-scale mined corpus of multilingual speech-to-speech translations
PA Duquenne, H Gong, N Dong, J Du, A Lee, V Goswani, C Wang, J Pino, ...
arXiv preprint arXiv:2211.04508, 2022
332022
Muavic: A multilingual audio-visual corpus for robust speech recognition and robust speech-to-text translation
M Anwar, B Shi, V Goswami, WN Hsu, J Pino, C Wang
arXiv preprint arXiv:2303.00628, 2023
312023
The llama 3 herd of models
A Grattafiori, A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, ...
arXiv e-prints, arXiv: 2407.21783, 2024
292024
Small data, big impact: Leveraging minimal data for effective machine translation
J Maillard, C Gao, E Kalbassi, KR Sadagopan, V Goswami, P Koehn, ...
Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023
272023
Scaling neural machine translation to 200 languages
NLLB Team
Nature 630 (8018), 841, 2024
252024
Revisiting machine translation for cross-lingual classification
M Artetxe, V Goswami, S Bhosale, A Fan, L Zettlemoyer
arXiv preprint arXiv:2305.14240, 2023
232023
Tricks for training sparse translation models
D Dua, S Bhosale, V Goswami, J Cross, M Lewis, A Fan
arXiv preprint arXiv:2110.08246, 2021
222021
The system can't perform the operation now. Try again later.
Articles 1–20