Palm: Scaling language modeling with pathways A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ... Journal of Machine Learning Research 24 (240), 1-113, 2023 | 6001 | 2023 |
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805, 2023 | 3600 | 2023 |
Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023 | 1789 | 2023 |
Program synthesis with large language models J Austin, A Odena, M Nye, M Bosma, H Michalewski, D Dohan, E Jiang, ... arXiv preprint arXiv:2108.07732, 2021 | 1703 | 2021 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024 | 1428 | 2024 |
Gemma: Open models based on gemini research and technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024 | 1149 | 2024 |
Structured denoising diffusion models in discrete state-spaces J Austin, DD Johnson, J Ho, D Tarlow, R Van Den Berg Advances in neural information processing systems 34, 17981-17993, 2021 | 939 | 2021 |
Show your work: Scratchpads for intermediate computation with language models M Nye, AJ Andreassen, G Gur-Ari, H Michalewski, J Austin, D Bieber, ... | 685 | 2021 |
Scaling up models and data with t5x and seqio A Roberts, HW Chung, G Mishra, A Levskaya, J Bradbury, D Andor, ... Journal of Machine Learning Research 24 (377), 1-8, 2023 | 164 | 2023 |
Palm: Scaling language modeling with pathways, 2022 A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ... arXiv preprint arXiv:2204.02311, 2022 | 140 | 2022 |
Language model cascades D Dohan, W Xu, A Lewkowycz, J Austin, D Bieber, RG Lopes, Y Wu, ... arXiv preprint arXiv:2207.10342, 2022 | 90 | 2022 |
Measuring the impact of programming language distribution G Orlanski, K Xiao, X Garcia, J Hui, J Howland, J Malmaud, J Austin, ... International Conference on Machine Learning, 26619-26645, 2023 | 28 | 2023 |
Titan: A parallel asynchronous library for multi-agent and soft-body robotics using nvidia cuda J Austin, R Corrales-Fatou, S Wyetzner, H Lipson 2020 IEEE International Conference on Robotics and Automation (ICRA), 7754-7760, 2020 | 27 | 2020 |
Beyond in-place corruption: Insertion and deletion in denoising probabilistic models DD Johnson, J Austin, R Berg, D Tarlow arXiv preprint arXiv:2107.07675, 2021 | 19 | 2021 |
Resolving code review comments with machine learning A Frömmgen, J Austin, P Choy, N Ghelani, L Kharatyan, G Surita, ... Proceedings of the 46th International Conference on Software Engineering …, 2024 | 18 | 2024 |
Large vacuum flux surfaces generated by tilted planar coils JL Li, J Austin, KC Hammond, BY Israeli, FA Volpe Plasma physics and controlled fusion 61 (7), 075005, 2019 | 3 | 2019 |
Tilted Planar Interlinked Coils as a Means of Generating Rotational Transform-Modelling and Experiment. SF Mazhar, F Volpe, R Diaz-Pacheco, K Hammond, B Israeli, J Li, J Mann, ... APS April Meeting Abstracts 2018, F01. 013, 2018 | 1 | 2018 |
Image Analysis by Prompting of Machine-Learned Models Using Chain of Thought JW Wei, D Zhou, X Wang, DE Schuurmans, QV Le, MP Bosma, EH Chi, ... US Patent App. 18/967,327, 2025 | | 2025 |
How to Scale Your Model J Austin, S Douglas, R Frostig, A Levskaya, C Chen, S Vikram, F Lebron, ... Google DeepMind, 2025 | | 2025 |
Machine-learned models for generating code snippets with predicted placeholders for optimizing software development DDW Johnson, DS Tarlow, M Tabachnyk, MH Rasi, J Austin, ... US Patent App. 18/618,371, 2024 | | 2024 |