Stance detection in web and social media: a comparative study S Ghosh, P Singhania, S Singh, K Rudra, S Ghosh Experimental IR Meets Multilinguality, Multimodality, and Interaction: 10th …, 2019 | 92 | 2019 |
AxoNN: An asynchronous, message-driven parallel framework for extreme-scale deep learning S Singh, A Bhatele 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2022 | 8 | 2022 |
Inducing Cooperation in Multi-Agent Games Through Status-Quo Loss P Badjatiya, M Sarkar, A Sinha, S Singh, N Puri, B Krishnamurthy arXiv preprint, 2020 | 8* | 2020 |
A survey and empirical evaluation of parallel deep learning frameworks D Nichols, S Singh, SH Lin, A Bhatele arXiv preprint arXiv:2111.04949, 2021 | 6* | 2021 |
Exploiting Sparsity in Pruned Neural Networks to Optimize Large Model Training S Singh, A Bhatele 2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2023 | 3 | 2023 |
A Hybrid Tensor-Expert-Data Parallelism Approach to Optimize Mixture-of-Experts Training S Singh, O Ruwase, AA Awan, S Rajbhandari, Y He, A Bhatele Proceedings of the 37th International Conference on Supercomputing, 203-214, 2023 | 2 | 2023 |
PySchedCL: Leveraging Concurrency in Heterogeneous Data-Parallel Systems A Ghose, S Singh, V Kulaharia, L Dokara, S Maity, S Dey IEEE Transactions on Computers 71 (9), 2234-2247, 2021 | 2 | 2021 |
A 4D Hybrid Algorithm to Scale Parallel Training to Thousands of GPUs S Singh, Z Sating, A Bhatele arXiv preprint arXiv:2305.13525, 2023 | 1* | 2023 |
Jorge: Approximate Preconditioning for GPU-efficient Second-order Optimization S Singh, Z Sating, A Bhatele arXiv preprint arXiv:2310.12298, 2023 | | 2023 |