Follow
Pierre Sermanet
Pierre Sermanet
Research Scientist, Google
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Going deeper with convolutions
C Szegedy, W Liu, Y Jia, P Sermanet, S Reed, D Anguelov, D Erhan, ...
[CVPR 2015] Computer Vision and Pattern Recognition, 2015
636352015
Overfeat: Integrated recognition, localization and detection using convolutional networks
P Sermanet, D Eigen, X Zhang, M Mathieu, R Fergus, Y LeCun
[ICLR 2014] International Conference on Learning Representations, 16, 2013
78862013
Do as i can, not as i say: Grounding language in robotic affordances
M Ahn, A Brohan, N Brown, Y Chebotar, O Cortes, B David, C Finn, C Fu, ...
arXiv preprint arXiv:2204.01691, 2022
1641*2022
Palm-e: An embodied multimodal language model
D Driess, F Xia, MSM Sajjadi, C Lynch, A Chowdhery, B Ichter, A Wahid, ...
arXiv preprint arXiv:2303.03378, 2023
13262023
Pedestrian detection with unsupervised multi-stage feature learning
P Sermanet, K Kavukcuoglu, S Chintala, Y LeCun
Computer Vision and Pattern Recognition (CVPR 2013), 3626-3633, 2013
11162013
Traffic sign recognition with multi-scale convolutional networks
P Sermanet, Y LeCun
The 2011 international joint conference on neural networks, 2809-2813, 2011
10472011
Time-contrastive networks: Self-supervised learning from video
P Sermanet, C Lynch, Y Chebotar, J Hsu, E Jang, S Schaal, S Levine, ...
2018 IEEE international conference on robotics and automation (ICRA), 1134-1141, 2018
10322018
Convolutional Neural Networks Applied to House Numbers Digit Classification
P Sermanet, S Chintala, Y LeCun
21st International Conference on Pattern Recognition (ICPR 2012), 3288-3291, 2012
7762012
Inner monologue: Embodied reasoning through planning with language models
W Huang, F Xia, T Xiao, H Chan, J Liang, P Florence, A Zeng, J Tompson, ...
arXiv preprint arXiv:2207.05608, 2022
7572022
Learning convolutional feature hierarchies for visual recognition
K Kavukcuoglu, P Sermanet, YL Boureau, K Gregor, M Mathieu, Y Cun
Advances in neural information processing systems 23, 2010
7512010
Rt-2: Vision-language-action models transfer web knowledge to robotic control
A Brohan, N Brown, J Carbajal, Y Chebotar, X Chen, K Choromanski, ...
arXiv preprint arXiv:2307.15818, 2023
5842023
Learning long‐range vision for autonomous off‐road driving
R Hadsell, P Sermanet, J Ben, A Erkan, M Scoffier, K Kavukcuoglu, ...
Journal of Field Robotics 26 (2), 120-144, 2009
4862009
With a little help from my friends: Nearest-neighbor contrastive learning of visual representations
D Dwibedi, Y Aytar, J Tompson, P Sermanet, A Zisserman
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
4842021
Learning latent plans from play
C Lynch, M Khansari, T Xiao, V Kumar, J Tompson, S Levine, P Sermanet
Conference on robot learning, 1113-1132, 2020
4122020
Temporal cycle-consistency learning
D Dwibedi, Y Aytar, J Tompson, P Sermanet, A Zisserman
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
3292019
Language conditioned imitation learning over unstructured data
C Lynch, P Sermanet
Robotics: Science and Systems 2021, http://www.roboticsproceedings.org/rss17, 2020
301*2020
Open x-embodiment: Robotic learning datasets and rt-x models
A O'Neill, A Rehman, A Gupta, A Maddukuri, A Gupta, A Padalkar, A Lee, ...
arXiv preprint arXiv:2310.08864, 2023
272*2023
Attention for fine-grained categorization
P Sermanet, A Frome, E Real
[ICLR 2015] International Conference on Learning Representations Workshop, 2014
2092014
Unsupervised Perceptual Rewards for Imitation Learning
P Sermanet, K Xu, S Levine
[RSS 2017] Robotics: Science and Systems + Deep Learning for Action and …, 2016
1852016
Rt-2: Vision-language-action models transfer web knowledge to robotic control
B Zitkovich, T Yu, S Xu, P Xu, T Xiao, F Xia, J Wu, P Wohlhart, S Welker, ...
Conference on Robot Learning, 2165-2183, 2023
1392023
The system can't perform the operation now. Try again later.
Articles 1–20