Fleetrec: Large-scale recommendation inference on hybrid gpu-fpga clusters W Jiang, Z He, S Zhang, K Zeng, L Feng, J Zhang, T Liu, Y Li, J Zhou, ... Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data …, 2021 | 33 | 2021 |
MicroRec: efficient recommendation inference by hardware and data structure solutions W Jiang, Z He, S Zhang, TB Preußer, K Zeng, L Feng, J Zhang, T Liu, Y Li, ... Proceedings of Machine Learning and Systems 3, 845-859, 2021 | 29 | 2021 |
Microrec: accelerating deep recommendation systems to microseconds by hardware and data structure solutions W Jiang, Z He, S Zhang, TB Preußer, K Zeng, L Feng, J Zhang, T Liu, Y Li, ... arXiv preprint arXiv:2010.05894, 2020 | 5 | 2020 |
AtRec: Accelerating Recommendation Model Training on CPUs S Wang, T Feng, H Yang, X You, B Chen, T Liu, Z Luan, D Qian IEEE Transactions on Parallel and Distributed Systems, 2024 | | 2024 |
Minions: Accelerating Large Language Model Inference with Adaptive and Collective Speculative Decoding S Wang, H Yang, X Wang, T Liu, P Wang, X Liang, K Ma, T Feng, X You, ... arXiv preprint arXiv:2402.15678, 2024 | | 2024 |
PRmalloc: Leveraging Predictability for Deep Learning Memory Allocation W Xiao, S Ren, T Liu, Y Li | | 2019 |