Seguir
Mike Seltzer
Título
Citado por
Citado por
Ano
The llama 3 herd of models
A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ...
arXiv preprint arXiv:2407.21783, 2024
13172024
A study on data augmentation of reverberant speech for robust speech recognition
T Ko, V Peddinti, D Povey, ML Seltzer, S Khudanpur
2017 IEEE international conference on acoustics, speech and signal …, 2017
11482017
Recent advances in deep learning for speech research at Microsoft
L Deng, J Li, JT Huang, K Yao, D Yu, F Seide, M Seltzer, G Zweig, X He, ...
2013 IEEE international conference on acoustics, speech and signal …, 2013
10652013
The Microsoft 2017 conversational speech recognition system
W Xiong, L Wu, F Alleva, J Droppo, X Huang, A Stolcke
2018 IEEE international conference on acoustics, speech and signal …, 2018
9742018
An investigation of deep neural networks for noise robust speech recognition
ML Seltzer, D Yu, Y Wang
2013 IEEE international conference on acoustics, speech and signal …, 2013
8112013
Achieving human parity in conversational speech recognition
W Xiong, J Droppo, X Huang, F Seide, M Seltzer, A Stolcke, D Yu, ...
arXiv preprint arXiv:1610.05256, 2016
7282016
Binary coding of speech spectrograms using a deep auto-encoder
L Deng, ML Seltzer, D Yu, A Acero, A Mohamed, G Hinton
Eleventh annual conference of the international speech communication association, 2010
5022010
An introduction to computational networks and the computational network toolkit
D Yu, A Eversole, M Seltzer, K Yao, Z Huang, B Guenter, O Kuchaiev, ...
Microsoft Technical Report MSR-TR-2014–112, 2014
4762014
Improved bottleneck features using pretrained deep neural networks
D Yu, ML Seltzer
Twelfth annual conference of the international speech communication association, 2011
3922011
Feature learning in deep neural networks-studies on speech recognition tasks
D Yu, ML Seltzer, J Li, JT Huang, F Seide
arXiv preprint arXiv:1301.3605, 2013
3242013
Multi-task learning in deep neural networks for improved phoneme recognition
ML Seltzer, J Droppo
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
2972013
Crowdmos: An approach for crowdsourcing mean opinion score studies
F Ribeiro, D Florêncio, C Zhang, M Seltzer
2011 IEEE international conference on acoustics, speech and signal …, 2011
2952011
Augmenting speech recognition with depth imaging
J Kapur, I Tashev, M Seltzer, SE Hodges
US Patent App. 13/662,293, 2014
2832014
Reconstruction of missing features for robust speech recognition
B Raj, ML Seltzer, RM Stern
Speech communication 43 (4), 275-296, 2004
2822004
Transformer-based acoustic modeling for hybrid speech recognition
Y Wang, A Mohamed, D Le, C Liu, A Xiao, J Mahadeokar, H Huang, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
2702020
Toward human parity in conversational speech recognition
W Xiong, J Droppo, X Huang, F Seide, ML Seltzer, A Stolcke, D Yu, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (12 …, 2017
2592017
Deep beamforming networks for multi-channel speech recognition
X Xiao, S Watanabe, H Erdogan, L Lu, J Hershey, ML Seltzer, G Chen, ...
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
2242016
A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition
ML Seltzer, B Raj, RM Stern
Speech Communication 43 (4), 379-393, 2004
2152004
Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network
J Xue, J Li, D Yu, M Seltzer, Y Gong
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
2012014
Speech processing for digital home assistants: Combining signal processing with deep-learning techniques
R Haeb-Umbach, S Watanabe, T Nakatani, M Bacchiani, B Hoffmeister, ...
IEEE Signal processing magazine 36 (6), 111-124, 2019
1932019
O sistema não pode efectuar a operação agora. Tente mais tarde.
Artigos 1–20