LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis Z Shen, R Zhang, M Dell, BCG Lee, J Carlson, W Li Document Analysis and Recognition–ICDAR 2021: 16th International Conference …, 2021 | 113 | 2021 |
A large dataset of historical Japanese documents with complex layouts Z Shen, K Zhang, M Dell Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020 | 46 | 2020 |
The semantic scholar open data platform R Kinney, C Anastasiades, R Authur, I Beltagy, J Bragg, A Buraczynski, ... arXiv preprint arXiv:2301.10140, 2023 | 42 | 2023 |
Deep learning based framework for automatic damage detection in aircraft engine borescope inspection Z Shen, X Wan, F Ye, X Guan, S Liu 2019 International Conference on Computing, Networking and Communications …, 2019 | 36 | 2019 |
Multi-lexsum: Real-world summaries of civil rights lawsuits at multiple granularities Z Shen, K Lo, L Yu, N Dahlberg, M Schlanger, D Downey Advances in Neural Information Processing Systems 35, 13158-13173, 2022 | 34 | 2022 |
VILA: Improving structured content extraction from scientific PDFs using visual layout groups Z Shen, K Lo, LL Wang, B Kuehl, DS Weld, D Downey Transactions of the Association for Computational Linguistics 10, 376-392, 2022 | 30* | 2022 |
Don't Say What You Don't Know: Improving the Consistency of Abstractive Summarization by Constraining Beam Search D King*, Z Shen*, N Subramani, DS Weld, I Beltagy, D Downey arXiv preprint arXiv:2203.08436, 2022 | 24 | 2022 |
PAWLS: PDF annotation with labels and structure M Neumann, Z Shen, S Skjonsberg arXiv preprint arXiv:2101.10281, 2021 | 15 | 2021 |
Dolma: An Open Corpus of Three Trillion Tokens for Language Model Pretraining Research L Soldaini, R Kinney, A Bhagia, D Schwenk, D Atkinson, R Authur, ... arXiv preprint arXiv:2402.00159, 2024 | 13 | 2024 |
OLALA: Object-level active learning for efficient document layout annotation Z Shen, J Zhao, M Dell, Y Yu, W Li arXiv preprint arXiv:2010.01762, 2020 | 13* | 2020 |
Beyond summarization: Designing ai support for real-world expository writing tasks Z Shen, T August, P Siangliulue, K Lo, J Bragg, J Hammerbacher, ... arXiv preprint arXiv:2304.02623, 2023 | 11 | 2023 |
The semantic reader project: Augmenting scholarly documents through ai-powered interactive reading interfaces K Lo, JC Chang, A Head, J Bragg, AX Zhang, C Trier, C Anastasiades, ... arXiv preprint arXiv:2303.14334, 2023 | 10 | 2023 |
American stories: A large-scale structured text dataset of historical us newspapers M Dell, J Carlson, T Bryan, E Silcock, A Arora, Z Shen, L D'Amico-Wong, ... Advances in Neural Information Processing Systems 36, 2024 | 7 | 2024 |
Information Extraction from Text Regions with Complex Tabular Structure. K Zhang, Z Shen, J Zhou, M Dell Conference on Neural Information Processing Systems, 2019 | 5 | 2019 |
A Design Space for Intelligent and Interactive Writing Assistants M Lee, KI Gero, JJY Chung, SB Shum, V Raheja, H Shen, S Venugopalan, ... arXiv preprint arXiv:2403.14117, 2024 | 2 | 2024 |
Conceptualizing machine learning for dynamic information retrieval of electronic health record notes S Jiang, S Shen, M Agrawal, B Lam, N Kurtzman, S Horng, DR Karger, ... Machine Learning for Healthcare Conference, 343-359, 2023 | 2 | 2023 |
PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents K Lo, Z Shen, B Newman, JZ Chang, R Authur, E Bransom, S Candra, ... EMNLP 2023 : System Demonstrations (🏆 Best Paper Demo Award 🏆 ), 495-507, 2023 | 1 | 2023 |
Towards Verifiable Text Generation with Symbolic References LT Hennigen*, S Shen*, A Nrusimha, B Gapp, D Sontag, Y Kim arXiv preprint arXiv:2311.09188, 2023 | 1 | 2023 |
Are layout-infused language models robust to layout distribution shifts? a case study with scientific documents C Chen, Z Shen, D Klein, G Stanovsky, D Downey, K Lo arXiv preprint arXiv:2306.01058, 2023 | 1 | 2023 |
Generating object stamps YA Mejjati, Z Shen, M Snower, A Gokaslan, O Wang, J Tompkin, KI Kim arXiv preprint arXiv:2001.02595, 2020 | 1 | 2020 |