ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models J Choi, S Kim, Y Jeong, Y Gwon, S Yoon ICCV 2021 (arXiv preprint arXiv:2108.02938), 2021 | 450* | 2021 |
Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search J Kim, S Kim, J Kong, S Yoon Advances in Neural Information Processing Systems 33 (NeurIPS 2020), 2020 | 408 | 2020 |
FloWaveNet: A generative flow for raw audio S Kim, S Lee, J Song, J Kim, S Yoon Proceedings of the International Conference on Machine Learning (ICML), 2018 | 196 | 2018 |
Perception Prioritized Training of Diffusion Models J Choi, J Lee, C Shin, S Kim, H Kim, S Yoon CVPR 2022 (arXiv preprint arXiv:2204.00227), 2022 | 125 | 2022 |
Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance H Kim, S Kim, S Yoon Proceedings of the International Conference on Machine Learning (ICML), 2021 | 61 | 2021 |
Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data S Kim, H Kim, S Yoon arXiv preprint arXiv:2205.15370, 2022 | 34 | 2022 |
AligNART: Non-autoregressive Neural Machine Translation by Jointly Learning to Estimate Alignment and Translate J Song, S Kim, S Yoon EMNLP 2021 (arXiv preprint arXiv:2109.06481), 2021 | 31 | 2021 |
FICGAN: Facial Identity Controllable GAN for De-identification Y Jeong, J Choi, S Kim, Y Ro, TH Oh, D Kim, H Ha, S Yoon arXiv preprint arXiv:2110.00740, 2021 | 12 | 2021 |
NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity S Lee, S Kim, S Yoon Advances in Neural Information Processing Systems 33 (NeurIPS 2020), 2020 | 12 | 2020 |
UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data H Kim, S Kim, J Yeom, S Yoon InterSpeech 2023, 2023 | 7 | 2023 |
P-Flow: A Fast and Data-Efficient Zero-Shot TTS through Speech Prompting S Kim, K Shih, JF Santos, E Bakhturina, M Desta, R Valle, S Yoon, ... Advances in Neural Information Processing Systems 36, 2024 | 2 | 2024 |
Scaling NVIDIA's multi-speaker multi-lingual TTS systems with voice cloning to Indic Languages A Arora, R Badlani, S Kim, R Valle, B Catanzaro arXiv preprint arXiv:2401.13851, 2024 | | 2024 |