Big transfer (bit): General visual representation learning A Kolesnikov, L Beyer, X Zhai, J Puigcerver, J Yung, S Gelly, N Houlsby Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 1430 | 2020 |
Pali: A jointly-scaled multilingual language-image model X Chen, X Wang, S Changpinyo, AJ Piergiovanni, P Padlewski, D Salz, ... arXiv preprint arXiv:2209.06794, 2022 | 571 | 2022 |
Scaling vision with sparse mixture of experts C Riquelme, J Puigcerver, B Mustafa, M Neumann, R Jenatton, ... Advances in Neural Information Processing Systems 34, 8583-8595, 2021 | 489 | 2021 |
Scaling vision transformers to 22 billion parameters M Dehghani, J Djolonga, B Mustafa, P Padlewski, J Heek, J Gilmer, ... International Conference on Machine Learning, 7480-7512, 2023 | 429 | 2023 |
A large-scale study of representation learning with the visual task adaptation benchmark X Zhai, J Puigcerver, A Kolesnikov, P Ruyssen, C Riquelme, M Lucic, ... arXiv preprint arXiv:1910.04867, 2019 | 415* | 2019 |
Are multidimensional recurrent layers really necessary for handwritten text recognition? J Puigcerver 2017 14th IAPR international conference on document analysis and recognition …, 2017 | 364 | 2017 |
Transforming scholarship in the archives through handwritten text recognition: Transkribus as a case study G Muehlberger, L Seaward, M Terras, SA Oliveira, V Bosch, M Bryan, ... Journal of documentation 75 (5), 954-976, 2019 | 186 | 2019 |
On robustness and transferability of convolutional neural networks J Djolonga, J Yung, M Tschannen, R Romijnders, L Beyer, A Kolesnikov, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 151 | 2021 |
Multimodal contrastive learning with limoe: the language-image mixture of experts B Mustafa, C Riquelme, J Puigcerver, R Jenatton, N Houlsby Advances in Neural Information Processing Systems 35, 9564-9576, 2022 | 148 | 2022 |
From sparse to soft mixtures of experts J Puigcerver, C Riquelme, B Mustafa, N Houlsby arXiv preprint arXiv:2308.00951, 2023 | 83 | 2023 |
Sparse upcycling: Training mixture-of-experts from dense checkpoints A Komatsuzaki, J Puigcerver, J Lee-Thorp, CR Ruiz, B Mustafa, J Ainslie, ... arXiv preprint arXiv:2212.05055, 2022 | 78 | 2022 |
Scalable transfer learning with expert models J Puigcerver, C Riquelme, B Mustafa, C Renggli, AS Pinto, S Gelly, ... arXiv preprint arXiv:2009.13239, 2020 | 64 | 2020 |
Preparatory KWS experiments for large-scale indexing of a vast medieval manuscript collection in the HIMANIS project T Bluche, S Hamel, C Kermorvant, J Puigcerver, D Stutzmann, AH Toselli, ... 2017 14th IAPR international conference on document analysis and recognition …, 2017 | 64 | 2017 |
ICFHR2016 handwritten keyword spotting competition (H-KWS 2016) I Pratikakis, K Zagoris, B Gatos, J Puigcerver, AH Toselli, E Vidal 2016 15th International Conference on Frontiers in Handwriting Recognition …, 2016 | 62 | 2016 |
Learning to merge tokens in vision transformers C Renggli, AS Pinto, N Houlsby, B Mustafa, J Puigcerver, C Riquelme arXiv preprint arXiv:2202.12015, 2022 | 57 | 2022 |
ICDAR2015 competition on keyword spotting for handwritten documents J Puigcerver, AH Toselli, E Vidal 2015 13th International Conference on Document Analysis and Recognition …, 2015 | 52 | 2015 |
Patch n’pack: Navit, a vision transformer for any aspect ratio and resolution M Dehghani, B Mustafa, J Djolonga, J Heek, M Minderer, M Caron, ... Advances in Neural Information Processing Systems 36, 2024 | 50 | 2024 |
A probabilistic formulation of keyword spotting J Puigcerver PhD thesis, 2018 | 47 | 2018 |
Probabilistic indexing and search for information extraction on handwritten german parish records E Lang, J Puigcerver, AH Toselli, E Vidal 2018 16th International Conference on Frontiers in Handwriting Recognition …, 2018 | 40 | 2018 |
Paligemma: A versatile 3b vlm for transfer L Beyer, A Steiner, AS Pinto, A Kolesnikov, X Wang, D Salz, M Neumann, ... arXiv preprint arXiv:2407.07726, 2024 | 38 | 2024 |