Scaling language models: Methods, analysis & insights from training gopher JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ... arXiv preprint arXiv:2112.11446, 2021 | 568 | 2021 |
Mind the Gap: Assessing Temporal Generalization in Neural Language Models A Lazaridou, A Kuncoro, E Gribovskaya, D Agrawal, A Liska, T Terzi, ... arXiv preprint arXiv:2102.01951, 2021 | 146* | 2021 |
Streamingqa: A benchmark for adaptation to new knowledge over time in question answering models A Liska, T Kocisky, E Gribovskaya, T Terzi, E Sezener, D Agrawal, ... International Conference on Machine Learning, 13604-13622, 2022 | 14 | 2022 |
Detecting semi-plausible response patterns T Terzi London School of Economics and Political Science, 2017 | 3 | 2017 |