Adaptdiffuser: Diffusion models as adaptive self-evolving planners Z Liang, Y Mu, M Ding, F Ni, M Tomizuka, P Luo arXiv preprint arXiv:2302.01877, 2023 | 38 | 2023 |
A multi-graph attributed reinforcement learning based optimization algorithm for large-scale hybrid flow shop scheduling problem F Ni, J Hao, J Lu, X Tong, M Yuan, J Duan, Y Ma, K He Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data …, 2021 | 36 | 2021 |
Adaptive large neighborhood search for solving the circle bin packing problem K He, K Tole, F Ni, Y Yuan, L Liao Computers & Operations Research 127, 105140, 2021 | 35 | 2021 |
Adaptive simulated annealing with greedy search for the circle bin packing problem Y Yuan, K Tole, F Ni, K He, Z Xiong, J Liu Computers & Operations Research 144, 105826, 2022 | 24 | 2022 |
Euclid: Towards efficient unsupervised reinforcement learning with multi-choice dynamics model Y Yuan, J Hao, F Ni, Y Mu, Y Zheng, Y Hu, J Liu, Y Chen, C Fan arXiv preprint arXiv:2210.00498, 2022 | 11 | 2022 |
Metadiffuser: Diffusion model as conditional planner for offline meta-rl F Ni, J Hao, Y Mu, Y Yuan, Y Zheng, B Wang, Z Liang International Conference on Machine Learning, 26087-26105, 2023 | 10 | 2023 |
A data-driven column generation algorithm for bin packing problem in manufacturing industry J Duan, X Tong, F Ni, Z He, L Chen, M Yuan arXiv preprint arXiv:2202.12466, 2022 | 8 | 2022 |
Domino: Decomposed mutual information optimization for generalized context in meta-reinforcement learning Y Mu, Y Zhuang, F Ni, B Wang, J Chen, J Hao, P Luo Advances in Neural Information Processing Systems 35, 27563-27575, 2022 | 5 | 2022 |
SplitNet: a reinforcement learning based sequence splitting method for the MinMax multiple travelling salesman problem H Liang, Y Ma, Z Cao, T Liu, F Ni, Z Li, J Hao Proceedings of the AAAI Conference on Artificial Intelligence 37 (7), 8720-8727, 2023 | 4 | 2023 |
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model Z Dong, Y Yuan, J Hao, F Ni, Y Mu, Y Zheng, Y Hu, T Lv, C Fan, Z Hu arXiv preprint arXiv:2310.02054, 2023 | 3 | 2023 |
Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models J Liu, Y Yuan, J Hao, F Ni, L Fu, Y Chen, Y Zheng arXiv preprint arXiv:2402.14245, 2024 | | 2024 |
DiffuserLite: Towards Real-time Diffusion Planning Z Dong, J Hao, Y Yuan, F Ni, Y Wang, P Li, Y Zheng arXiv preprint arXiv:2401.15443, 2024 | | 2024 |