Compute trends across three eras of machine learning J Sevilla, L Heim, A Ho, T Besiroglu, M Hobbhahn, P Villalobos 2022 International Joint Conference on Neural Networks (IJCNN), 1-8, 2022 | 218 | 2022 |
Will we run out of data? an analysis of the limits of scaling datasets in machine learning P Villalobos, J Sevilla, L Heim, T Besiroglu, M Hobbhahn, A Ho arXiv preprint arXiv:2211.04325, 2022 | 94 | 2022 |
Machine learning model sizes and the parameter gap P Villalobos, J Sevilla, T Besiroglu, L Heim, A Ho, M Hobbhahn arXiv preprint arXiv:2207.02852, 2022 | 39 | 2022 |
Forecasting timelines of quantum computing J Sevilla, CJ Riedel arXiv preprint arXiv:2009.05045, 2020 | 38 | 2020 |
Parameter, compute and data trends in machine learning J Sevilla, P Villalobos, JF Cerón, M Burtell, L Heim, AB Nanjajjar, A Ho, ... 2022-05-30]. https://docs. google. com/spreadsheets/d/1AAIebj …, 2021 | 15 | 2021 |
Estimating training compute of deep learning models J Sevilla, L Heim, M Hobbhahn, T Besiroglu, A Ho, P Villalobos Epoch, January 20, 2022 | 13 | 2022 |
Parameter counts in machine learning J Sevilla, P Villalobos, J Cerón AI Alignment Forum, 2021 | 13 | 2021 |
Compute trends across three eras of machine learning. arXiv J Sevilla, L Heim, A Ho, T Besiroglu, M Hobbhahn, P Villalobos arXiv preprint arXiv:2202.05924, 2022 | 8 | 2022 |
Compute Trends Across Three Eras of Machine Learning.(2022) J Sevilla, L Heim, A Ho, T Besiroglu, M Hobbhahn, P Villalobos URL: https://arxiv. org/abs/2202.05924. doi 10, 2022 | 7 | 2022 |
Explaining data using causal Bayesian Networks J Sevilla 2nd Workshop on Interactive Natural Language Technology for Explainable …, 2020 | 7 | 2020 |
Algorithmic progress in language models A Ho, T Besiroglu, E Erdil, D Owen, R Rahman, ZC Guo, D Atkinson, ... arXiv preprint arXiv:2403.05812, 2024 | 3 | 2024 |
Finding, scoring and explaining arguments in Bayesian networks J Sevilla arXiv preprint arXiv:2112.00799, 2021 | 3 | 2021 |
Please Report Your Compute J Sevilla, A Ho, T Besiroglu Communications of the ACM 66 (5), 30-32, 2023 | 2 | 2023 |
A Bayesian model of records J Sevilla, J Lindbloom Authorea Preprints, 2022 | 1 | 2022 |
Implications of Quantum Computing for Artificial Intelligence alignment research J Sevilla, P Moreno arXiv preprint arXiv:1908.07613, 2019 | 1 | 2019 |
ZTF Observations of the Candidate Fast Blue Optical Transient AT2023vth J Sevilla, ML Li, AYQ Ho Transient Name Server AstroNote 297, 1, 2023 | | 2023 |
Power Law Trends in Speedrunning and Machine Learning E Erdil, J Sevilla arXiv preprint arXiv:2304.10004, 2023 | | 2023 |
Key trends and figures in Machine Learning Epoch https://epochai.org/trends, 2023 | | 2023 |
Modelling a Time Series of Records with PyMC3 J Sevilla, J Lindbloom Authorea Preprints, 2021 | | 2021 |
A conditional independence test for causality in econometrics J Sevilla, A Mayn arXiv preprint arXiv:2107.09765, 2021 | | 2021 |