GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy Efficient Inference A Hadi Zadeh, I Edo, OM Awad, A Moshovos 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), 2020 | 165 | 2020 |
Tensordash: Exploiting sparsity to accelerate deep neural network training M Mahmoud, I Edo, AH Zadeh, OM Awad, G Pekhimenko, J Albericio, ... 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020 | 81 | 2020 |
A matrix-inversion technique for FPGA-based real-time EMT simulation of power converters A Hadizadeh, M Hashemi, M Labbaf, M Parniani IEEE transactions on industrial electronics 66 (2), 1224-1234, 2018 | 48 | 2018 |
FPRaker: A processing element for accelerating neural network training OM Awad, M Mahmoud, I Edo, AH Zadeh, C Bannon, A Jayarajan, ... MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture …, 2021 | 18 | 2021 |
Mokey: enabling narrow fixed-point inference for out-of-the-box floating-point transformer models AH Zadeh, M Mahmoud, A Abdelhadi, A Moshovos Proceedings of the 49th Annual International Symposium on Computer …, 2022 | 17 | 2022 |
Deep Learning Language Modeling Workloads: Where Time Goes on Graphics Processors A Hadi Zadeh, Z Poulos, A Moshovos 2019 IEEE International Symposium on Workload Characterization (IISWC), 131-142, 2019 | 13 | 2019 |
Parallel processor architecture with a new algorithm for simultaneous processing of mips-based series instructions A Hadizadeh, E Tanghatari Emerging Science Journal 1 (4), 226-232, 2017 | 8 | 2017 |
Schrodinger's FP Training Neural Networks with Dynamic Floating-Point Containers M Nikolic, E Torres Sanchez, J Wang, A Hadi Zadeh, M Mahmoud, ... Proceedings of Machine Learning and Systems 6, 60-73, 2024 | 2* | 2024 |
Atalanta: A Bit is Worth a “Thousand” Tensor Values AD Lascorz, M Mahmoud, AH Zadeh, M Nikolic, K Ibrahim, C Giannoula, ... Proceedings of the 29th ACM International Conference on Architectural …, 2024 | 2 | 2024 |
Quantization for neural network computation A Moshovos, AH Zadeh, IE Vivancos, OM Awad US Patent App. 17/130,690, 2022 | 1 | 2022 |
Quantization for neural network computation A Moshovos, AH Zadeh, IE Vivancos, OM Awad US Patent App. 18/026,927, 2023 | | 2023 |
Fast and Energy-Efficient Inference for Attention-Based Natural Language Processing Models A Hadi Zadeh | | 2023 |
A Novel Algorithm for Design and Hardware Implementation of FPGA-Based Real-Time Simulator for Electrical Machines in HIL Applications A Hadizadeh, M Hashemi, M Parniani Journal of Iranian Association of Electrical and Electronics Engineers 16 …, 2019 | | 2019 |