AWB-GCN: A graph convolutional network accelerator with runtime workload rebalancing T Geng, A Li, R Shi, C Wu, T Wang, Y Li, P Haghi, A Tumeo, S Che, ... 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020 | 243 | 2020 |
FPGAs in the network and novel communicator support accelerate MPI collectives P Haghi, A Guo, Q Xiong, R Patel, C Yang, T Geng, JT Broaddus, ... 2020 IEEE High Performance Extreme Computing Conference (HPEC), 1-10, 2020 | 24 | 2020 |
FP-AMG: FPGA-based acceleration framework for algebraic multigrid solvers P Haghi, T Geng, A Guo, T Wang, M Herbordt 2020 IEEE 28th Annual International Symposium on Field-Programmable Custom …, 2020 | 21 | 2020 |
Accelerating MPI collectives with FPGAs in the network and novel communicator support Q Xiong, C Yang, P Haghi, A Skjellum, M Herbordt 2020 IEEE 28th Annual International Symposium on Field-Programmable Custom …, 2020 | 18 | 2020 |
A survey: Handling irregularities in neural network acceleration with fpgas T Geng, C Wu, C Tan, C Xie, A Guo, P Haghi, SY He, J Li, M Herbordt, ... 2021 IEEE High Performance Extreme Computing Conference (HPEC), 1-8, 2021 | 14 | 2021 |
Workload imbalance in hpc applications: Effect on performance of in-network processing P Haghi, A Guo, T Geng, A Skjellum, MC Herbordt 2021 IEEE High Performance Extreme Computing Conference (HPEC), 1-8, 2021 | 13 | 2021 |
A reconfigurable compute-in-the-network fpga assistant for high-level collective support with distributed matrix multiply case study P Haghi, A Guo, T Geng, J Broaddus, D Schafer, A Skjellum, M Herbordt 2020 International Conference on Field-Programmable Technology (ICFPT), 159-164, 2020 | 13 | 2020 |
Reconfigurable switches for high performance and flexible MPI collectives P Haghi, A Guo, Q Xiong, C Yang, T Geng, JT Broaddus, R Marshall, ... Concurrency and Computation: Practice and Experience 34 (6), e6769, 2022 | 12 | 2022 |
A framework for neural network inference on fpga-centric smartnics A Guo, T Geng, Y Zhang, P Haghi, C Wu, C Tan, Y Lin, A Li, M Herbordt 2022 32nd International Conference on Field-Programmable Logic and …, 2022 | 10 | 2022 |
Software-hardware co-design of heterogeneous SmartNIC system for recommendation models inference and training A Guo, Y Hao, C Wu, P Haghi, Z Pan, M Si, D Tao, A Li, M Herbordt, ... Proceedings of the 37th International Conference on Supercomputing, 336-347, 2023 | 8 | 2023 |
Flash: FPGA-accelerated smart switches with GCN case study P Haghi, W Krska, C Tan, T Geng, PH Chen, C Greenwood, A Guo, ... Proceedings of the 37th International Conference on Supercomputing, 450-462, 2023 | 6 | 2023 |
Distributed hardware accelerated secure joint computation on the copa framework R Patel, P Haghi, S Jain, A Kot, V Krishnan, M Varia, M Herbord 2022 IEEE High Performance Extreme Computing Conference (HPEC), 1-7, 2022 | 5 | 2022 |
& Geng, T.(2023, June). Software-hardware co-design of heterogeneous SmartNIC system for recommendation models inference and training A Guo, Y Hao, C Wu, P Haghi, Z Pan, M Si Proceedings of the 37th International Conference on Supercomputing, 336-347, 0 | 5 | |
FASDA: An FPGA-Aided, Scalable and Distributed Accelerator for Range-Limited Molecular Dynamics C Wu, T Geng, A Guo, S Bandara, P Haghi, C Liu, A Li, M Herbordt Proceedings of the International Conference for High Performance Computing …, 2023 | 4 | 2023 |
Optimized Mappings for Symmetric Range-Limited Molecular Force Calculations on FPGAs C Wu, S Bandara, T Geng, A Guo, P Haghi, V Sachdeva, W Sherman, ... 2022 32nd International Conference on Field-Programmable Logic and …, 2022 | 4 | 2022 |
FCsN: A FPGA-Centric SmartNIC Framework for Neural Networks A Guo, T Geng, Y Zhang, P Haghi, C Wu, C Tan, Y Lin, A Li, M Herbordt 2022 IEEE 30th Annual International Symposium on Field-Programmable Custom …, 2022 | 4 | 2022 |
O⁴-DNN: A Hybrid DSP-LUT-Based Processing Unit With Operation Packing and Out-of-Order Execution for Efficient Realization of Convolutional Neural Networks on FPGA Devices P Haghi, M Kamal, A Afzali-Kusha, M Pedram IEEE Transactions on Circuits and Systems I: Regular Papers 67 (9), 3056-3069, 2020 | 4 | 2020 |
Copa use case: Distributed secure joint computation R Patel, P Haghi, S Jain, A Kot, V Krishnan, M Varia, M Herbordt 2022 IEEE 30th Annual International Symposium on Field-Programmable Custom …, 2022 | 3 | 2022 |
The Viability of Using Online Prediction to Perform Extra Work while Executing BSP Applications PH Chen, P Haghi, JY Chung, T Geng, R West, A Skjellum, MC Herbordt 2022 IEEE High Performance Extreme Computing Conference (HPEC), 1-7, 2022 | 2 | 2022 |
A Survey of Potential MPI Complex Collectives: Large-Scale Mining and Analysis of HPC Applications P Haghi, R Marshall, PH Chen, A Skjellum, M Herbordt arXiv preprint arXiv:2305.19946, 2023 | 1 | 2023 |