Xiang Ni
Xiang Ni
Research Staff Member, IBM Thomas J. Watson Research Center
Verified email at ibm.com
Title
Cited by
Cited by
Year
Parallel programming with migratable objects: Charm++ in practice
B Acun, A Gupta, N Jain, A Langer, H Menon, E Mikida, X Ni, M Robson, ...
SC'14: Proceedings of the International Conference for High Performance …, 2014
1662014
A scalable double in-memory checkpoint and restart scheme towards exascale
G Zheng, X Ni, LV Kalé
IEEE/IFIP International Conference on Dependable Systems and Networks …, 2012
1332012
ACR: Automatic checkpoint/restart for soft and hard error protection
X Ni, E Meneses, N Jain, LV Kalé
Proceedings of the International Conference on High Performance Computing …, 2013
1032013
Maximizing throughput on a dragonfly network
N Jain, A Bhatele, X Ni, NJ Wright, LV Kale
SC'14: Proceedings of the International Conference for High Performance …, 2014
742014
Hiding checkpoint overhead in HPC applications with a semi-blocking algorithm
X Ni, E Meneses, LV Kalé
2012 IEEE International Conference on Cluster Computing, 364-372, 2012
532012
Using migratable objects to enhance fault tolerance schemes in supercomputers
E Meneses, X Ni, G Zheng, CL Mendes, LV Kale
IEEE transactions on parallel and distributed systems 26 (7), 2061-2074, 2014
362014
Migratable objects+ active messages+ adaptive runtime= productivity+ performance a submission to 2012 HPC class II challenge
L Kale, A Arya, N Jain, A Langer, J Lifflander, H Menon, X Ni, Y Sun, ...
Parallel Programming Laboratory, Tech. Rep, 12-47, 2012
312012
A message-logging protocol for multicore systems
E Meneses, X Ni, LV Kalé
IEEE/IFIP International Conference on Dependable Systems and Networks …, 2012
242012
Analyzing the interplay of failures and workload on a leadership-class supercomputer
E Meneses, X Ni, T Jones, D Maxwell
computing 2 (3), 4, 2015
192015
Partitioning low-diameter networks to eliminate inter-job interference
N Jain, A Bhatele, X Ni, T Gamblin, LV Kale
2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2017
172017
FlipBack: automatic targeted protection against silent data corruption
X Ni, LV Kale
SC'16: Proceedings of the International Conference for High Performance …, 2016
152016
Lossy compression for checkpointing: Fallible or feasible?
X Ni, T Islam, K Mohror, A Moody, LV Kale
Proceedings of the International Conference For High Performance Computing …, 2014
112014
Mitigation of failures in high performance computing via runtime techniques
X Ni
University of Illinois at Urbana-Champaign, 2016
92016
A memory heterogeneity-aware runtime system for bandwidth-sensitive HPC applications
K Chandrasekar, X Ni, LV Kale
2017 IEEE International Parallel and Distributed Processing Symposium …, 2017
82017
Scalable asynchronous contact mechanics using Charm++
X Ni, LV Kale, R Tamstorf
2015 IEEE International Parallel and Distributed Processing Symposium, 677-686, 2015
82015
Design and analysis of a message logging protocol for fault tolerant multicore systems
E Meneses, X Ni, LV Kalé
Parallel Programming Laboratory, Department of Computer Science, University …, 2011
72011
Runtime Techniques for Programming with Fast and Slow Memory
X Ni, N Jain, K Chandrasekar, LV Kale
2017 IEEE International Conference on Cluster Computing (CLUSTER), 147-151, 2017
22017
Automated multidimensional elasticity for streaming application runtimes
X Ni, S Schneider, KL Wu
US Patent App. 16/426,644, 2020
2020
Adaptive locking in elastic threading systems
XR Guérin, S Schneider, X Ni
US Patent 10,831,500, 2020
2020
Generalizable Resource Allocation in Stream Processing via Deep Reinforcement Learning
X Ni, J Li, M Yu, W Zhou, KL Wu
Proceedings of the AAAI Conference on Artificial Intelligence 34 (01), 857-864, 2020
2020
The system can't perform the operation now. Try again later.
Articles 1–20