Neural combinatorial optimization with reinforcement learning I Bello, H Pham, QV Le, M Norouzi, S Bengio arXiv preprint arXiv:1611.09940, 2016 | 1416 | 2016 |
Stand-alone self-attention in vision models P Ramachandran, N Parmar, A Vaswani, I Bello, A Levskaya, J Shlens Advances in neural information processing systems 32, 2019 | 1133* | 2019 |
Attention augmented convolutional networks I Bello, B Zoph, A Vaswani, J Shlens, QV Le Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 1059 | 2019 |
Neural optimizer search with reinforcement learning I Bello, B Zoph, V Vasudevan, QV Le International Conference on Machine Learning, 459-468, 2017 | 386 | 2017 |
Revisiting resnets: Improved training and scaling strategies I Bello, W Fedus, X Du, ED Cubuk, A Srinivas, TY Lin, J Shlens, B Zoph Advances in Neural Information Processing Systems 34, 22614-22627, 2021 | 245 | 2021 |
Lambdanetworks: Modeling long-range interactions without attention I Bello arXiv preprint arXiv:2102.08602, 2021 | 162 | 2021 |
Seq2Slate: Re-ranking and slate optimization with RNNs I Bello, S Kulkarni, S Jain, C Boutilier, E Chi, E Eban, X Luo, A Mackey, ... arXiv preprint arXiv:1810.02019, 2018 | 73 | 2018 |
Designing effective sparse expert models B Zoph, I Bello, S Kumar, N Du, Y Huang, J Dean, N Shazeer, W Fedus arXiv preprint arXiv:2202.08906 2, 2022 | 50 | 2022 |
Global self-attention networks for image recognition Z Shen, I Bello, R Vemulapalli, X Jia, CH Chen arXiv preprint arXiv:2010.03019, 2020 | 27 | 2020 |
St-moe: Designing stable and transferable sparse expert models B Zoph, I Bello, S Kumar, N Du, Y Huang, J Dean, N Shazeer, W Fedus arXiv preprint arXiv:2202.08906, 2022 | 17 | 2022 |
Backprop evolution M Alber, I Bello, B Zoph, PJ Kindermans, P Ramachandran, Q Le arXiv preprint arXiv:1808.02822, 2018 | 17 | 2018 |
Revisiting 3D ResNets for video recognition X Du, Y Li, Y Cui, R Qian, J Li, I Bello arXiv preprint arXiv:2109.01696, 2021 | 12 | 2021 |
Neural network optimizer search I Bello, B Zoph, V Vasudevan, QV Le US Patent App. 17/145,524, 2021 | 5 | 2021 |
Systems and Methods for Slate Optimization with Recurrent Neural Networks OP Meshi, I Bello, S Kulkarni, S Jain US Patent App. 16/415,854, 2019 | 3 | 2019 |
Fully attentional computer vision J Shlens, AT Vaswani, NJ Parmar, P Ramachandran, AC Levskaya, ... US Patent App. 17/606,976, 2022 | 1 | 2022 |
Neural network optimizer search I Bello, B Zoph, V Vasudevan, QV Le US Patent 10,922,611, 2021 | 1 | 2021 |
Learning Control Policies from High-Dimensional Visual Inputs I Bello, Y Tkachenko Stanford CS231N, 2015 | 1 | 2015 |
Modeling of Long-Range Interactions with Reduced Feature Materialization via Lambda Functions I Bello US Patent App. 18/011,636, 2023 | | 2023 |
Revisiting ResNets: Improved Training Methodologies and Scaling Principles I Bello, LB Fedus, X Du, ED Cubuk, A Srinivas, TY Lin, J Shlens, BR Zoph | | 2021 |
GLOBAL SELF-ATTENTION NETWORKS FOR IMAGE RECOGNITION S Zhuoran, I Bello, R Vemulapalli, X Jia, CH Chen arXiv preprint arXiv:2010.03019, 2020 | | 2020 |