Follow
Joshua Hursey
Title
Cited by
Cited by
Year
Why it’s worth the hassle: The value of in-situ studies when designing ubicomp
Y Rogers, K Connelly, L Tedesco, W Hazlewood, A Kurtz, RE Hall, ...
International conference on ubiquitous computing, 336-353, 2007
2512007
The design and implementation of checkpoint/restart process fault tolerance for Open MPI
J Hursey, JM Squyres, TI Mattox, A Lumsdaine
2007 IEEE International Parallel and Distributed Processing Symposium, 1-8, 2007
2502007
An evaluation of user-level failure mitigation support in MPI
W Bland, A Bouteiller, T Herault, J Hursey, G Bosilca, JJ Dongarra
European MPI Users' Group Meeting, 193-203, 2012
1342012
Interconnect agnostic checkpoint/restart in Open MPI
J Hursey, TI Mattox, A Lumsdaine
Proceedings of the 18th ACM international symposium on High Performance …, 2009
842009
Run-through stabilization: An MPI proposal for process fault tolerance
J Hursey, RL Graham, G Bronevetsky, D Buntinas, H Pritchard, DG Solt
European MPI Users' Group Meeting, 329-332, 2011
662011
PMIx: Process management for exascale environments
RH Castain, J Hursey, A Bouteiller, D Solt
Parallel Computing 79, 9-29, 2018
612018
An evaluation of user-level failure mitigation support in MPI
W Bland, A Bouteiller, T Herault, J Hursey, G Bosilca, JJ Dongarra
Computing 95 (12), 1171-1184, 2013
502013
A log-scaling fault tolerant agreement algorithm for a fault tolerant MPI
J Hursey, T Naughton, G Vallee, RL Graham
European MPI Users' Group Meeting, 255-263, 2011
432011
Coordinated checkpoint/restart process fault tolerance for MPI applications on HPC systems
J Hursey
Indiana University, 2010
392010
Locality-aware parallel process mapping for multi-core HPC systems
J Hursey, JM Squyres, T Dontje
2011 IEEE international conference on cluster computing, 527-531, 2011
362011
A checkpoint and restart service specification for Open MPI
J Hursey, JM Squyres, A Lumsdaine
Indiana University, Bloomington, Indiana, USA, Tech. Rep. TR635, 2006
322006
Netloc: Towards a comprehensive view of the HPC system topology
B Goglin, J Hursey, JM Squyres
2014 43rd International Conference on Parallel Processing Workshops, 216-225, 2014
272014
Building a fault tolerant MPI application: A ring communication example
J Hursey, RL Graham
2011 IEEE International Symposium on Parallel and Distributed Processing …, 2011
272011
A composable runtime recovery policy framework supporting resilient HPC applications
J Hursey, A Lumsdaine
Indiana University, Bloomington, Indiana, USA, Tech. Rep. TR686, 2010
182010
Preserving collective performance across process failure for a fault tolerant MPI
J Hursey, RL Graham
2011 IEEE International Symposium on Parallel and Distributed Processing …, 2011
172011
Checkpoint/restart-enabled parallel debugging
J Hursey, C January, M O’Connor, PH Hargrove, D Lecomber, ...
European MPI Users' Group Meeting, 219-228, 2010
172010
Advancing application process affinity experimentation: Open MPI's LAMA-based affinity interface
J Hursey, JM Squyres
Proceedings of the 20th European MPI Users' Group Meeting, 163-168, 2013
142013
Representing unit test data for large scale software development
JA Cottam, J Hursey, A Lumsdaine
Proceedings of the 4th ACM symposium on Software visualization, 57-66, 2008
142008
An extensible framework for distributed testing of mpi implementations
J Hursey, E Mallove, JM Squyres, A Lumsdaine
European Parallel Virtual Machine/Message Passing Interface Users’ Group …, 2007
142007
A performance analysis and optimization of PMIx-based HPC software stacks
AY Polyakov, BI Karasev, J Hursey, J Ladd, M Brinskii, E Shipunova
Proceedings of the 26th European MPI Users' Group Meeting, 1-10, 2019
102019
The system can't perform the operation now. Try again later.
Articles 1–20