All are worth words: A vit backbone for diffusion models F Bao, S Nie, K Xue, Y Cao, C Li, H Su, J Zhu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 84 | 2023 |
One transformer fits all distributions in multi-modal diffusion at scale F Bao, S Nie, K Xue, C Li, S Pu, Y Wang, G Yue, Y Cao, H Su, J Zhu International Conference on Machine Learning, 1692-1717, 2023 | 67 | 2023 |