Seguir
Xinjian Li
Xinjian Li
Google DeepMind
Dirección de correo verificada de google.com - Página principal
Título
Citado por
Citado por
Año
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
6992024
Universal phone recognition with a multilingual allophone system
X Li, S Dalmia, J Li, M Lee, P Littell, J Yao, A Anastasopoulos, ...
ICASSP 2020, 2020
1412020
Adversarial music: Real world audio adversary against wake-word detection system
J Li, S Qu, X Li, J Szurley, JZ Kolter, F Metze
NeurIPS 2019, 2019
872019
Machine Listening for Heart Status Monitoring: Introducing and Benchmarking HSS–the Heart Sounds Shenzhen Corpus
BS Fengquan Dong, Kun Qian, Ren Zhao, Alice Baird, Xinjian Li, Zhenyu Dai ...
IEEE Journal of Biomedical and Health Informatics, 1-13, 2019
51*2019
Reproducing whisper-style training using an open-source toolkit and publicly available data
Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
332023
Towards Zero-shot Learning for Automatic Phonemic Transcription
X Li, S Dalmia, DR Mortensen, J Li, AW Black, F Metze
AAAI 2020, 2020
332020
Multilingual Speech Recognition with Corpus Relatedness Sampling
X Li, S Dalmia, AW Black, F Metze
20th Annual Conference of the International Speech Communication Association …, 2019
252019
ASR2K: Speech Recognition for Around 2000 Languages without Audio
X Li, F Metze, DR Mortensen, AW Black, S Watanabe
Interspeech 2022, 2022
232022
Textless Direct Speech-to-Speech Translation with Discrete Speech Representation
X Li, Y Jia, CC Chiu
ICASSP 2023, 2022
222022
Zero-shot learning for grapheme to phoneme conversion with language ensemble
X Li, F Metze, DR Mortensen, S Watanabe, AW Black
Findings of the Association for Computational Linguistics: ACL 2022, 2106-2115, 2022
212022
Domain robust feature extraction for rapid low resource asr development
S Dalmia, X Li, F Metze, AW Black
2018 IEEE Spoken Language Technology Workshop (SLT), 258-265, 2018
192018
Multilingual phonetic dataset for low resource speech recognition
X Li, DR Mortensen, F Metze, AW Black
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
162021
Acoustics based intent recognition using discovered phonetic units for low resource languages
A Gupta, X Li, SK Rallabandi, AW Black
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
152021
Hierarchical Phone Recognition with Compositional Phonetics
X Li, J Li, F Metze, AW Black
Proc. Interspeech 2021, 2461-2465, 2021
152021
A summary of the first workshop on language technology for language documentation and revitalization
G Neubig, S Rijhwani, A Palmer, J MacKenzie, H Cruz, X Li, M Lee, ...
arXiv preprint arXiv:2004.13203, 2020
152020
AlloVera: a multilingual allophone database
DR Mortensen, X Li, P Littell, A Michaud, S Rijhwani, A Anastasopoulos, ...
LREC 2020, 2020
152020
Learning to speak from text: Zero-shot multilingual text-to-speech with unsupervised text pretraining
T Saeki, S Maiti, X Li, S Watanabe, S Takamichi, H Saruwatari
arXiv preprint arXiv:2301.12596, 2023
122023
Towards Context-Aware End-to-End Code-Switching Speech Recognition
Z Qiu, Y Li, X Li, F Metze, WM Campbell, AA AI
Interspeech 2020, 2020
122020
The ariel-cmu systems for lorehlt18
A Chaudhary, S Dalmia, J Hu, X Li, A Matthews, AO Muis, N Otani, ...
arXiv preprint arXiv:1902.08899, 2019
102019
On Prosody Modeling for ASR+ TTS based Voice Conversion
WC Huang, T Hayashi, X Li, S Watanabe, T Toda
ASRU 2021, 2021
92021
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20