Qiyang Li
Cited by
Cited by
Offline reinforcement learning as one big sequence modeling problem
M Janner, Q Li, S Levine
Advances in neural information processing systems 34, 1273-1286, 2021
Timbretron: A wavenet (cyclegan (cqt (audio))) pipeline for musical timbre transfer
S Huang, Q Li, C Anil, X Bao, S Oore, RB Grosse
arXiv preprint arXiv:1811.09620, 2018
Preventing gradient attenuation in lipschitz constrained convolutional networks
Q Li, S Haque, C Anil, J Lucas, RB Grosse, JH Jacobsen
Advances in neural information processing systems 32, 2019
Deep neural networks for improved, impromptu trajectory tracking of quadrotors
Q Li, J Qian, Z Zhu, X Bao, MK Helwa, AP Schoellig
2017 IEEE International Conference on Robotics and Automation (ICRA), 5183-5189, 2017
Efficient deep reinforcement learning requires regulating overfitting
Q Li, A Kumar, I Kostrikov, S Levine
arXiv preprint arXiv:2304.10466, 2023
Building a winning self-driving car in six months
K Burnett, A Schimpe, S Samavi, M Gridseth, CW Liu, Q Li, Z Kroeze, ...
2019 International Conference on Robotics and Automation (ICRA), 9583-9589, 2019
Understanding the complexity gains of single-task rl with a curriculum
Q Li, Y Zhai, Y Ma, S Levine
International Conference on Machine Learning, 20412-20451, 2023
Learning of coordination policies for robotic swarms
Q Li, X Du, Y Huang, Q Sykora, AP Schoellig
arXiv preprint arXiv:1709.06620, 2017
Learning Visuotactile Skills with Two Multifingered Hands
T Lin, Y Zhang, Q Li, H Qi, B Yi, S Levine, J Malik
arXiv preprint arXiv:2404.16823, 2024
REFACTOR: Learning to Extract Theorems from Proofs
JP Zhou, Y Wu, Q Li, R Grosse
arXiv preprint arXiv:2402.17032, 2024
Accelerating exploration with unlabeled prior data
Q Li, J Zhang, D Ghosh, A Zhang, S Levine
Advances in Neural Information Processing Systems 36, 2024
AdaCat: Adaptive categorical discretization for autoregressive models
Q Li, A Jain, P Abbeel
Uncertainty in Artificial Intelligence, 1188-1198, 2022
R-LAtte: Attention Module for Visual Control via Reinforcement Learning
M Zhao, Q Li, A Srinivas, I Clavera, K Lee, P Abbeel
The system can't perform the operation now. Try again later.
Articles 1–13