Xiang Ni
Xiang Ni
Research Staff Member, IBM Thomas J. Watson Research Center
Verified email at ibm.com
Title
Cited by
Cited by
Year
Parallel programming with migratable objects: Charm++ in practice
B Acun, A Gupta, N Jain, A Langer, H Menon, E Mikida, X Ni, M Robson, ...
SC'14: Proceedings of the International Conference for High Performance …, 2014
1512014
A scalable double in-memory checkpoint and restart scheme towards exascale
G Zheng, X Ni, LV Kalé
IEEE/IFIP International Conference on Dependable Systems and Networks …, 2012
1242012
ACR: Automatic checkpoint/restart for soft and hard error protection
X Ni, E Meneses, N Jain, LV Kalé
Proceedings of the International Conference on High Performance Computing …, 2013
922013
Maximizing throughput on a dragonfly network
N Jain, A Bhatele, X Ni, NJ Wright, LV Kale
SC'14: Proceedings of the International Conference for High Performance …, 2014
652014
Hiding checkpoint overhead in HPC applications with a semi-blocking algorithm
X Ni, E Meneses, LV Kalé
2012 IEEE International Conference on Cluster Computing, 364-372, 2012
492012
Using migratable objects to enhance fault tolerance schemes in supercomputers
E Meneses, X Ni, G Zheng, CL Mendes, LV Kale
IEEE transactions on parallel and distributed systems 26 (7), 2061-2074, 2014
332014
Migratable objects+ active messages+ adaptive runtime= productivity+ performance a submission to 2012 HPC class II challenge
L Kale, A Arya, N Jain, A Langer, J Lifflander, H Menon, X Ni, Y Sun, ...
Parallel Programming Laboratory, Tech. Rep, 12-47, 2012
302012
A message-logging protocol for multicore systems
E Meneses, X Ni, LV Kalé
IEEE/IFIP International Conference on Dependable Systems and Networks …, 2012
232012
Analyzing the interplay of failures and workload on a leadership-class supercomputer
E Meneses, X Ni, T Jones, D Maxwell
computing 2 (3), 4, 2015
172015
Partitioning low-diameter networks to eliminate inter-job interference
N Jain, A Bhatele, X Ni, T Gamblin, LV Kale
2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2017
152017
FlipBack: automatic targeted protection against silent data corruption
X Ni, LV Kale
SC'16: Proceedings of the International Conference for High Performance …, 2016
122016
Lossy compression for checkpointing: Fallible or feasible?
X Ni, T Islam, K Mohror, A Moody, LV Kale
International Conference for High Performance Computing, Networking, Storage …, 2014
102014
A memory heterogeneity-aware runtime system for bandwidth-sensitive HPC applications
K Chandrasekar, X Ni, LV Kale
2017 IEEE International Parallel and Distributed Processing Symposium …, 2017
82017
Mitigation of failures in high performance computing via runtime techniques
X Ni
University of Illinois at Urbana-Champaign, 2016
82016
Scalable asynchronous contact mechanics using Charm++
X Ni, LV Kale, R Tamstorf
2015 IEEE International Parallel and Distributed Processing Symposium, 677-686, 2015
72015
Design and analysis of a message logging protocol for fault tolerant multicore systems
E Meneses, X Ni, LV Kale
Parallel Programming Laboratory, Department of Computer Science, University …, 2011
72011
Runtime Techniques for Programming with Fast and Slow Memory
X Ni, N Jain, K Chandrasekar, LV Kale
2017 IEEE International Conference on Cluster Computing (CLUSTER), 147-151, 2017
12017
Generalizable Resource Allocation in Stream Processing via Deep Reinforcement Learning
X Ni, J Li, M Yu, W Zhou, KL Wu
Proceedings of the AAAI Conference on Artificial Intelligence 34 (01), 857-864, 2020
2020
Adaptive locking in elastic threading systems
XR Guérin, S Schneider, X Ni
US Patent App. 16/004,412, 2019
2019
Automating Multi-level Performance Elastic Components for IBM Streams
X Ni, S Schneider, R Pavuluri, J Kaus, KL Wu
Proceedings of the 20th International Middleware Conference, 163-175, 2019
2019
The system can't perform the operation now. Try again later.
Articles 1–20