High-resolution image synthesis with latent diffusion models R Rombach, A Blattmann, D Lorenz, P Esser, B Ommer Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 10470 | 2022 |
Taming transformers for high-resolution image synthesis P Esser, R Rombach, B Ommer Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 2188 | 2021 |
Sdxl: Improving latent diffusion models for high-resolution image synthesis D Podell, Z English, K Lacey, A Blattmann, T Dockhorn, J Müller, J Penna, ... arXiv preprint arXiv:2307.01952, 2023 | 811 | 2023 |
Align your latents: High-resolution video synthesis with latent diffusion models A Blattmann, R Rombach, H Ling, T Dockhorn, SW Kim, S Fidler, K Kreis Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 567 | 2023 |
Stable video diffusion: Scaling latent video diffusion models to large datasets A Blattmann, T Dockhorn, S Kulal, D Mendelevitch, M Kilian, D Lorenz, ... arXiv preprint arXiv:2311.15127, 2023 | 298 | 2023 |
On distillation of guided diffusion models C Meng, R Rombach, R Gao, D Kingma, S Ermon, J Ho, T Salimans Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 284 | 2023 |
High-resolution image synthesis with latent diffusion models. 2022 IEEE R Rombach, A Blattmann, D Lorenz, P Esser, B Ommer CVF Conference on Computer Vision and Pattern Recognition (CVPR) 1, 2021 | 188 | 2021 |
Imagebart: Bidirectional context with multinomial diffusion for autoregressive image synthesis P Esser, R Rombach, A Blattmann, B Ommer Advances in neural information processing systems 34, 3518-3532, 2021 | 144 | 2021 |
Adversarial diffusion distillation A Sauer, D Lorenz, A Blattmann, R Rombach arXiv preprint arXiv:2311.17042, 2023 | 130 | 2023 |
Scaling rectified flow transformers for high-resolution image synthesis P Esser, S Kulal, A Blattmann, R Entezari, J Müller, H Saini, Y Levi, ... Forty-first International Conference on Machine Learning, 2024 | 124 | 2024 |
Retrieval-augmented diffusion models A Blattmann, R Rombach, K Oktay, J Müller, B Ommer Advances in Neural Information Processing Systems 35, 15309-15324, 2022 | 112 | 2022 |
Geometry-free view synthesis: Transformers and no 3d priors R Rombach, P Esser, B Ommer Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 85 | 2021 |
A disentangling invertible interpretation network for explaining latent representations P Esser, R Rombach, B Ommer Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020 | 79 | 2020 |
High-resolution image synthesis with latent diffusion models, 2021 R Rombach, A Blattmann, D Lorenz, P Esser, B Ommer | 70 | 2021 |
Text-guided synthesis of artistic images with retrieval-augmented diffusion models R Rombach, A Blattmann, B Ommer arXiv preprint arXiv:2207.13038, 2022 | 62 | 2022 |
Stochastic image-to-video synthesis using cinns M Dorkenwald, T Milbich, A Blattmann, R Rombach, KG Derpanis, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 54 | 2021 |
Network-to-network translation with conditional invertible neural networks R Rombach, P Esser, B Ommer Advances in Neural Information Processing Systems 33, 2784-2797, 2020 | 48 | 2020 |
High-resolution image synthesis with latent diffusion models. arXiv 2021 R Rombach, A Blattmann, D Lorenz, P Esser, B Ommer arXiv preprint arXiv:2112.10752, 2021 | 46 | 2021 |
Neuralfield-ldm: Scene generation with hierarchical latent diffusion models SW Kim, B Brown, K Yin, K Kreis, K Schwarz, D Li, R Rombach, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 45 | 2023 |
Sv3d: Novel multi-view synthesis and 3d generation from a single image using latent video diffusion V Voleti, CH Yao, M Boss, A Letts, D Pankratz, D Tochilkin, C Laforte, ... arXiv preprint arXiv:2403.12008, 2024 | 37 | 2024 |