Jonathan Shen
Jonathan Shen
Dirección de correo verificada de
Citado por
Citado por
Natural tts synthesis by conditioning wavenet on mel spectrogram predictions
J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ...
2018 IEEE international conference on acoustics, speech and signal …, 2018
Transfer learning from speaker verification to multispeaker text-to-speech synthesis
Y Jia, Y Zhang, R Weiss, Q Wang, J Shen, F Ren, P Nguyen, R Pang, ...
Advances in neural information processing systems 31, 2018
Hierarchical generative modeling for controllable speech synthesis
WN Hsu, Y Zhang, RJ Weiss, H Zen, Y Wu, Y Wang, Y Cao, Y Jia, Z Chen, ...
arXiv preprint arXiv:1810.07217, 2018
Lingvo: a modular and scalable framework for sequence-to-sequence modeling
J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ...
arXiv preprint arXiv:1902.08295, 2019
SATzilla2012: Improved algorithm selection based on cost-sensitive classification models
L Xu, F Hutter, J Shen, HH Hoos, K Leyton-Brown
Proceedings of SAT Challenge, 57-58, 2012
Parallel tacotron: Non-autoregressive and controllable tts
I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Weiss, Y Wu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
Non-attentive tacotron: Robust and controllable neural tts synthesis including unsupervised duration modeling
J Shen, Y Jia, M Chrzanowski, Y Zhang, I Elias, H Zen, Y Wu
arXiv preprint arXiv:2010.04301, 2020
PnG BERT: Augmented BERT on phonemes and graphemes for neural TTS
Y Jia, H Zen, J Shen, Y Zhang, Y Wu
arXiv preprint arXiv:2103.15060, 2021
Neural program synthesis with priority queue training
DA Abolafia, M Norouzi, J Shen, R Zhao, QV Le
arXiv preprint arXiv:1801.03526, 2018
Parallel Tacotron 2: A non-autoregressive neural TTS model with differentiable duration modeling
I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Skerry-Ryan, Y Wu
arXiv preprint arXiv:2103.14574, 2021
Synthesizing speech from text using neural networks
Y Wu, J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, ...
US Patent 10,971,170, 2021
In teacher we trust: Learning compressed models for pedestrian detection
J Shen, N Vesdapunt, VN Boddeti, KM Kitani
arXiv preprint arXiv:1612.00478, 2016
Examining scaling and transfer of language model architectures for machine translation
B Zhang, B Ghorbani, A Bapna, Y Cheng, X Garcia, J Shen, O Firat
International Conference on Machine Learning, 26176-26192, 2022
Synthesis of speech from text in a voice of a target speaker using neural networks
Y Jia, Z Chen, Y Wu, J Shen, R Pang, RJ Weiss, IL Moreno, F Ren, ...
US Patent 11,488,575, 2022
Training text-to-speech systems from synthetic data: A practical approach for accent transfer tasks
L Finkelstein, H Zen, N Casagrande, C Chan, Y Jia, T Kenter, A Petelin, ...
arXiv preprint arXiv:2208.13183, 2022
Parallel tacotron non-autoregressive and controllable TTS
I Elias, J Shen, Y Zhang, Y Jia, RJ Weiss, Y Wu, B Chun
US Patent 11,908,448, 2024
Modelling intonation in spectrograms for neural vocoder based text-to-speech
V Wan, J Shen, H Silen, R Clark
Speech Prosody 2020, 2020
Text-to-speech using duration prediction
Y Zhang, I Elias, B Chun, Y Jia, Y Wu, M Chrzanowski, J Shen
US Patent App. 17/492,543, 2022
Phonemes and graphemes for neural text-to-speech
Y Jia, B Chun, Y Zhang, J Shen, Y Wu
US Patent 12,020,685, 2024
Parallel Tacotron Non-Autoregressive and Controllable TTS
I Elias, J Shen, Y Zhang, Y Jia, RJ Weiss, Y Wu, B Chun
US Patent App. 18/421,116, 2024
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20