Seguir
Sanyam Mehta
Sanyam Mehta
Senior Software Engineer, HPE
Email confirmado em hpe.com
Título
Citado por
Citado por
Ano
Real-time moving object detection algorithm on high-resolution videos using GPUs
P Kumar, A Singhal, S Mehta, A Mittal
Journal of Real-Time Image Processing 11 (1), 93-109, 2013
1142013
Tile Size Selection Revisited
S Mehta, G Beeraka, PC Yew
Transactions on Architecture and Code Optimization, 2014
422014
Revisiting Loop Fusion in the Polyhedral Framework
S Mehta, PH Lin, PC Yew
ACM SIGPLAN PPoPP, 2014
342014
Multi-Stage Coordinated Prefetching for Present-day Processors
S Mehta, Z Fang, A Zhai, PC Yew
Interational Conference on Supercomputing (ICS), 2014
312014
Improving compiler scalability: optimizing large programs at small price
S Mehta, PC Yew
ACM SIGPLAN PLDI, 143-152, 2015
262015
Measuring Micro-architectural Details of Multi- and Many-core Memory Systems Through Micro-benchmarking
Z Fang, S Mehta
Transactions on Architecture and Code Optimization, 2015
252015
TurboTiling: Leveraging Prefetching to Boost Performance of Tiled Codes
S Mehta, PC Yew
International Conference on Supercomputing (ICS), 2016
192016
Parallel Implementation of Video Surveillance Algorithms on GPU Architecture using CUDA
S Mehta, A Mishra, A Singhal, P Kumar, A Mittal, K Palaniappan
Advanced Computing and Communications, 2009
152009
A high-performance parallel implementation of sum of absolute differences algorithm for motion estimation using CUDA
S Mehta, A Misra, A Singhal, P Kumar, A Mittal
HiPC Conf 2 (4), 6, 2010
92010
Variable liberalization
S Mehta, PC Yew
ACM Transactions on Architecture and Code Optimization (TACO) 13 (3), 1-25, 2016
62016
WearCore: A core for wearable workloads
S Mehta, J Torrellas
Parallel Architecture and Compilation Techniques (PACT), 2016 International …, 2016
52016
Variable-sized blocks for locality-aware SpMV
N Namashivavam, S Mehta, PC Yew
2021 IEEE/ACM International Symposium on Code Generation and Optimization …, 2021
42021
Scalable Compiler Optimizations for Improving the Memory System Performance in Multi-and Many-core Processors
S Mehta
University of Minnesota, 2014
32014
High-bandwidth prefetcher for high-bandwidth memory
S Mehta, JR Kohn, DJ Ernst, HL Poxon, L DeRose
US Patent 9,946,654, 2018
22018
Software pre-execution for irregular memory accesses in the HBM era
S Mehta, G Elsesser, T Greyzck
Proceedings of the 31st ACM SIGPLAN International Conference on Compiler …, 2022
12022
Systems and methods for increased bandwidth utilization regarding irregular memory accesses using software pre-execution
S Mehta, GW Elsesser, TD Greyzck
US Patent 11,403,082, 2022
2022
Performance Analysis and Optimization with Little’s Law
S Mehta
2022 IEEE International Symposium on Performance Analysis of Systems and …, 2022
2022
Method and Apparatus for Front End Gather/Scatter Memory Coalescing
HW Cain III, RA Sugumar, NB Lakshminarayana, DJ Ernst, S Mehta
US Patent App. 16/944,141, 2022
2022
Method and Apparatus for Back End Gather/Scatter Memory Coalescing
HW Cain III, NB Lakshminarayana, DJ Ernst, S Mehta
US Patent App. 16/944,146, 2022
2022
Memory allocation system for multi-tier memory
HL Poxon, W Homer, DW Oehmke, L DeRose, CD Andreasen, S Mehta
US Patent 10,698,813, 2020
2020
O sistema não pode efectuar a operação agora. Tente mais tarde.
Artigos 1–20