Seguir
Sanjeev Khudanpur
Título
Citado por
Citado por
Año
Recurrent neural network based language model.
T Mikolov, M Karafiát, L Burget, J Cernocký, S Khudanpur
Interspeech 2 (3), 1045-1048, 2010
77482010
Librispeech: an asr corpus based on public domain audio books
V Panayotov, G Chen, D Povey, S Khudanpur
2015 IEEE international conference on acoustics, speech and signal …, 2015
58742015
X-vectors: Robust dnn embeddings for speaker recognition
D Snyder, D Garcia-Romero, G Sell, D Povey, S Khudanpur
2018 IEEE international conference on acoustics, speech and signal …, 2018
28832018
Extensions of recurrent neural network language model
T Mikolov, S Kombrink, L Burget, J Černocký, S Khudanpur
2011 IEEE international conference on acoustics, speech and signal …, 2011
16362011
Audio augmentation for speech recognition.
T Ko, V Peddinti, D Povey, S Khudanpur
Interspeech 2015, 3586, 2015
13102015
A time delay neural network architecture for efficient modeling of long temporal contexts.
V Peddinti, D Povey, S Khudanpur
Interspeech, 3214-3218, 2015
12782015
Deep neural network embeddings for text-independent speaker verification.
D Snyder, D Garcia-Romero, D Povey, S Khudanpur
Interspeech 2017, 999-1003, 2017
10152017
Purely sequence-trained neural networks for ASR based on lattice-free MMI.
D Povey, V Peddinti, D Galvez, P Ghahremani, V Manohar, X Na, Y Wang, ...
Interspeech, 2751-2755, 2016
9772016
A study on data augmentation of reverberant speech for robust speech recognition
T Ko, V Peddinti, D Povey, ML Seltzer, S Khudanpur
2017 IEEE international conference on acoustics, speech and signal …, 2017
9432017
Semi-orthogonal low-rank matrix factorization for deep neural networks.
D Povey, G Cheng, Y Wang, K Li, H Xu, M Yarmohammadi, S Khudanpur
Interspeech, 3743-3747, 2018
5762018
Deep neural network-based speaker embeddings for end-to-end speaker verification
D Snyder, P Ghahremani, D Povey, D Garcia-Romero, Y Carmiel, ...
2016 IEEE Spoken Language Technology Workshop (SLT), 165-170, 2016
4262016
Jhu-isi gesture and skill assessment working set (jigsaws): A surgical activity dataset for human motion modeling
Y Gao, SS Vedula, CE Reiley, N Ahmidi, B Varadarajan, HC Lin, L Tao, ...
MICCAI workshop: M2cai 3 (2014), 3, 2014
4022014
Improving deep neural network acoustic models using generalized maxout networks
X Zhang, J Trmal, D Povey, S Khudanpur
2014 IEEE international conference on acoustics, speech and signal …, 2014
3982014
A pitch extraction algorithm tuned for automatic speech recognition
P Ghahremani, B BabaAli, D Povey, K Riedhammer, J Trmal, ...
2014 IEEE international conference on acoustics, speech and signal …, 2014
3812014
A smorgasbord of features for statistical machine translation
FJ Och, D Gildea, S Khudanpur, A Sarkar, K Yamada, A Fraser, S Kumar, ...
Proceedings of the Human Language Technology Conference of the North …, 2004
3642004
Developments and directions in speech recognition and understanding, Part 1 [DSP Education]
JM Baker, L Deng, J Glass, S Khudanpur, CH Lee, N Morgan, ...
IEEE Signal processing magazine 26 (3), 75-80, 2009
3582009
Highway long short-term memory rnns for distant speech recognition
Y Zhang, G Chen, D Yu, K Yao, S Khudanpur, J Glass
2016 IEEE international conference on acoustics, speech and signal …, 2016
3532016
Speaker recognition for multi-speaker conversations using x-vectors
D Snyder, D Garcia-Romero, G Sell, A McCree, D Povey, S Khudanpur
ICASSP 2019-2019 IEEE International conference on acoustics, speech and …, 2019
3432019
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings
S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ...
arXiv preprint arXiv:2004.09249, 2020
2852020
Spoken language recognition using x-vectors.
D Snyder, D Garcia-Romero, A McCree, G Sell, D Povey, S Khudanpur
Odyssey 2018, 105-111, 2018
2582018
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20