Follow
Saurabh Pujar
Title
Cited by
Cited by
Year
Codenet: A large-scale ai for code dataset for learning a diversity of coding tasks
R Puri, DS Kung, G Janssen, W Zhang, G Domeniconi, V Zolotov, J Dolby, ...
arXiv preprint arXiv:2105.12655, 2021
2112021
D2a: A dataset built for ai-based vulnerability detection methods using differential analysis
Y Zheng, S Pujar, B Lewis, L Buratti, E Epstein, B Yang, J Laredo, ...
2021 IEEE/ACM 43rd International Conference on Software Engineering …, 2021
1012021
Exploring software naturalness through neural language models
L Buratti, S Pujar, M Bornea, S McCarley, Y Zheng, G Rossiello, A Morari, ...
arXiv preprint arXiv:2006.12641, 2020
892020
Towards Learning (Dis)-Similarity of Source Code from Program Contrasts
Y Ding, L Buratti, S Pujar, A Morari, B Ray, S Chakraborty
arXiv preprint arXiv:2110.03868, 2021
52*2021
The techqa dataset
V Castelli, R Chakravarti, S Dana, A Ferritto, R Florian, M Franz, D Garg, ...
arXiv preprint arXiv:1911.02984, 2019
36*2019
Beyond accuracy: Evaluating self-consistency of code large language models with identitychain
MJ Min, Y Ding, L Buratti, S Pujar, G Kaiser, S Jana, B Ray
arXiv preprint arXiv:2310.14053, 2023
5*2023
Automated code generation for information technology tasks in yaml through large language models
S Pujar, L Buratti, X Guo, N Dupuis, B Lewis, S Suneja, A Sood, ...
2023 60th ACM/IEEE Design Automation Conference (DAC), 1-4, 2023
52023
Can Large Language Models Identify And Reason About Security Vulnerabilities? Not Yet
S Ullah, M Han, S Pujar, H Pearce, A Coskun, G Stringhini
arXiv preprint arXiv:2312.12575, 2023
42023
CONCORD: Clone-Aware Contrastive Learning for Source Code
Y Ding, S Chakraborty, L Buratti, S Pujar, A Morari, G Kaiser, B Ray
Proceedings of the 32nd ACM SIGSOFT International Symposium on Software …, 2023
32023
Building pre-trained contextual embeddings for programming languages using specialized vocabulary
S Pujar, L Buratti, A Morari, JA Laredo, AM Gliozzo, G Rossiello
US Patent 11,429,352, 2022
32022
System and method to share and utilize healthcare data
A Malvankar, S Pujar, EA Epstein, L Degenaro, B Lewis
US Patent 11,250,937, 2022
32022
Vulnerability analysis using contextual embeddings
S Pujar, L Buratti, A Morari, JA Laredo, AM Gliozzo, G Rossiello
US Patent App. 16/917,962, 2022
32022
Varangian: a git bot for augmented static analysis
S Pujar, Y Zheng, L Buratti, B Lewis, A Morari, J Laredo, K Postlethwait, ...
Proceedings of the 19th International Conference on Mining Software …, 2022
22022
Learning Transfers over Several Programming Languages
R Baltaji, S Pujar, L Mandel, M Hirzel, L Buratti, L Varshney
arXiv preprint arXiv:2310.16937, 2023
12023
Distributed QA System
S Pujar, B Priyaa, K Sethia
Research Report, New York University, USA, 2015
12015
Analyzing source code vulnerabilities in the D2A dataset with ML ensembles and C-BERT
S Pujar, Y Zheng, L Buratti, B Lewis, Y Chen, J Laredo, A Morari, ...
Empirical Software Engineering 29 (2), 48, 2024
2024
Ansible Lightspeed: A Code Generation Service for IT Automation
P Sahoo, S Pujar, G Nalawade, R Gebhardt, L Mandel, L Buratti
arXiv preprint arXiv:2402.17442, 2024
2024
Contextual embeddings for improving static analyzer output
S Pujar, L Buratti, A Morari, JA Laredo, MA Bornea, JS McCarley, Y Zheng
US Patent 11,765,193, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–18