Aligning Text-to-Image Models using Human Feedback K Lee, H Liu, M Ryu, O Watkins, Y Du, C Boutilier, P Abbeel, ... arXiv preprint arXiv:2302.12192, 2023 | 266 | 2023 |
Behavior From the Void: Unsupervised Active Pre-Training H Liu, P Abbeel Advances in Neural Information Processing Systems, 2021, 2021 | 253 | 2021 |
Koala: A dialogue model for academic research X Geng*, A Gudibande*, H Liu*, E Wallace*, P Abbeel†, S Levine†, ... Blog post, April 1, 2023 | 236 | 2023 |
Reinforcement learning for fine-tuning text-to-image diffusion models Y Fan, O Watkins, Y Du, H Liu, M Ryu, C Boutilier, P Abbeel, ... Advances in Neural Information Processing Systems 36, 2024 | 207* | 2024 |
Chain of Hindsight Aligns Language Models with Feedback H Liu, C Sferrazza, P Abbeel International Conference on Learning Representations(ICLR), 2024, 2023 | 189* | 2023 |
The false promise of imitating proprietary llms A Gudibande, E Wallace, C Snell, X Geng, H Liu, P Abbeel, S Levine, ... The Twelfth International Conference on Learning Representations (ICLR 2024), 2023 | 182 | 2023 |
Openllama: An open reproduction of llama X Geng*, H Liu* URL: https://github. com/openlm-research/open_llama, 2023 | 176 | 2023 |
RingAttention with Blockwise Transformers for Near-Infinite Context H Liu, M Zaharia, P Abbeel The Twelfth International Conference on Learning Representations (ICLR 2024), 2024 | 175 | 2024 |
URLB: Unsupervised Reinforcement Learning Benchmark M Laskin*, D Yarats*, H Liu, K Lee, A Zhan, K Lu, C Cang, L Pinto, ... arXiv preprint arXiv:2110.15191, 2021 | 172 | 2021 |
APS: Active Pretraining with Successor Features H Liu, P Abbeel International Conference on Machine Learning, 6736-6747, 2021 | 162 | 2021 |
Masked world models for visual control Y Seo, D Hafner, H Liu, F Liu, S James, K Lee, P Abbeel Conference on Robot Learning, 1332-1344, 2023 | 149 | 2023 |
World model on million-length video and language with ringattention H Liu, W Yan, M Zaharia, P Abbeel arXiv e-prints, arXiv: 2402.08268, 2024 | 136* | 2024 |
Multimodal Masked Autoencoders Learn Transferable Representations X Geng*, H Liu*, L Lee, D Schuurams, S Levine, P Abbeel arXiv preprint arXiv:2205.14204, 2022 | 112 | 2022 |
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery M Laskin, H Liu, XB Peng, D Yarats, A Rajeswaran, P Abbeel Advances in Neural Information Processing Systems, 2022, 2022 | 106* | 2022 |
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning D Yarats*, D Brandfonbrener*, H Liu, M Laskin, P Abbeel, A Lazaric, ... arXiv preprint arXiv:2201.13425, 2022 | 106 | 2022 |
Taming MAML: Efficient unbiased meta-reinforcement learning H Liu, R Socher, C Xiong International Conference on Machine Learning (ICML), 4061-4071, 2019 | 105 | 2019 |
Action-depedent Control Variates for Policy Optimization via Stein's Identity H Liu, Y Feng, Y Mao, D Zhou, J Peng, Q Liu International Conference on Learning Representations, 2017 | 103 | 2017 |
Competitive Experience Replay H Liu, A Trott, R Socher, C Xiong International Conference on Learning Representations(ICLR) 2019, 2019 | 70 | 2019 |
Variational inference with tail-adaptive f-divergence D Wang, H Liu, Q Liu Advances in Neural Information Processing Systems 31, 2018 | 70 | 2018 |
Instruction-Following Agents with Multimodal Transformer H Liu, L Lee, K Lee, P Abbeel arXiv preprint arXiv:2210.13431, 2022 | 53* | 2022 |