Vatt: Transformers for multimodal self-supervised learning from raw video, audio and text H Akbari, L Yuan, R Qian, WH Chuang, SF Chang, Y Cui, B Gong Advances in Neural Information Processing Systems 34, 24206-24221, 2021 | 679 | 2021 |
Unsupervised event-based learning of optical flow, depth, and egomotion AZ Zhu, L Yuan, K Chaney, K Daniilidis Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 597 | 2019 |
EV-FlowNet: Self-supervised optical flow estimation for event-based cameras AZ Zhu, L Yuan, K Chaney, K Daniilidis Robotics: Science and Systems (RSS), 2018 | 517 | 2018 |
Movinets: Mobile video networks for efficient video recognition D Kondratyuk, L Yuan, Y Li, L Zhang, M Tan, M Brown, B Gong Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 293 | 2021 |
Surrogate gap minimization improves sharpness-aware training J Zhuang, B Gong, L Yuan, Y Cui, H Adam, N Dvornek, S Tatikonda, ... International Conference on Learning Representations (ICLR), 2022 | 164 | 2022 |
Unsupervised event-based optical flow using motion compensation A Zihao Zhu, L Yuan, K Chaney, K Daniilidis Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 0-0, 2018 | 92 | 2018 |
Deeplab2: A tensorflow library for deep labeling M Weber, H Wang, S Qiao, J Xie, MD Collins, Y Zhu, L Yuan, D Kim, Q Yu, ... arXiv preprint arXiv:2106.09748, 2021 | 65 | 2021 |
Human gaze-driven spatial tasking of an autonomous MAV L Yuan, C Reardon, G Warnell, G Loianno IEEE Robotics and Automation Letters 4 (2), 1343-1350, 2019 | 51 | 2019 |
Learning view-disentangled human pose representation by contrastive cross-view mutual information maximization L Zhao, Y Wang, J Zhao, L Yuan, JJ Sun, F Schroff, H Adam, X Peng, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 37 | 2021 |
Zoom-in-to-check: Boosting video interpolation via instance-level discrimination L Yuan, Y Chen, H Liu, T Kong, J Shi Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 33 | 2019 |
Contextualized spatio-temporal contrastive learning with self-supervision L Yuan, R Qian, Y Cui, B Gong, F Schroff, MH Yang, H Adam, T Liu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 26 | 2022 |
Videoprism: A foundational visual encoder for video understanding L Zhao, NB Gundavarapu, L Yuan, H Zhou, S Yan, JJ Sun, L Friedman, ... International Conference on Machine Learning (ICML), 2024 | 25 | 2024 |
View-invariant, occlusion-robust probabilistic embedding for human pose T Liu, JJ Sun, L Zhao, J Zhao, L Yuan, Y Wang, LC Chen, F Schroff, ... International Journal of Computer Vision 130 (1), 111-135, 2022 | 20 | 2022 |
Videoglue: Video general understanding evaluation of foundation models L Yuan, NB Gundavarapu, L Zhao, H Zhou, Y Cui, L Jiang, X Yang, M Jia, ... Transactions on Machine Learning Research (TMLR), 2024 | 14 | 2024 |
Unified visual relationship detection with vision and language models L Zhao, L Yuan, B Gong, Y Cui, F Schroff, MH Yang, H Adam, T Liu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 14 | 2023 |
Distilling vision-language models on millions of videos Y Zhao, L Zhao, X Zhou, J Wu, CT Chu, H Miao, F Schroff, H Adam, T Liu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 13 | 2024 |
On Temporal Granularity in Self-Supervised Video Representation Learning. R Qian, Y Li, L Yuan, B Gong, T Liu, M Brown, SJ Belongie, MH Yang, ... BMVC, 541, 2022 | 12* | 2022 |
Polymax: General dense prediction with mask transformer X Yang, L Yuan, K Wilber, A Sharma, X Gu, S Qiao, S Debats, H Wang, ... Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024 | 11 | 2024 |
Learning from semantic alignment between unpaired multiviews for egocentric video recognition Q Wang, L Zhao, L Yuan, T Liu, X Peng Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 9 | 2023 |
Spatiotemporally discriminative video-language pre-training with text grounding Y Xiong, L Zhao, B Gong, MH Yang, F Schroff, T Liu, CJ Hsieh, L Yuan International Conference on Learning Representations (ICLR), 2024 | 4 | 2024 |