Follow
Daniel Garcia-Romero
Daniel Garcia-Romero
Principal Applied Scientist, AWS AI
Verified email at amazon.com - Homepage
Title
Cited by
Cited by
Year
X-vectors: Robust dnn embeddings for speaker recognition
D Snyder, D Garcia-Romero, G Sell, D Povey, S Khudanpur
2018 IEEE international conference on acoustics, speech and signal …, 2018
33292018
Analysis of i-vector length normalization in speaker recognition systems.
D Garcia-Romero, CY Espy-Wilson
Interspeech 2011, 249-252, 2011
12712011
Deep neural network embeddings for text-independent speaker verification.
D Snyder, D Garcia-Romero, D Povey, S Khudanpur
Interspeech 2017, 999-1003, 2017
10972017
Deep neural network-based speaker embeddings for end-to-end speaker verification
D Snyder, P Ghahremani, D Povey, D Garcia-Romero, Y Carmiel, ...
2016 IEEE spoken language technology workshop (SLT), 165-170, 2016
4562016
Speaker recognition for multi-speaker conversations using x-vectors
D Snyder, D Garcia-Romero, G Sell, A McCree, D Povey, S Khudanpur
ICASSP 2019-2019 IEEE International conference on acoustics, speech and …, 2019
3872019
Speaker diarization using deep neural network embeddings
D Garcia-Romero, D Snyder, G Sell, D Povey, A McCree
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
3182017
Recent developments on espnet toolkit boosted by conformer
P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
2992021
Spoken language recognition using x-vectors.
D Snyder, D Garcia-Romero, A McCree, G Sell, D Povey, S Khudanpur
Odyssey 2018, 105-111, 2018
2852018
Speaker diarization with PLDA i-vector scoring and unsupervised calibration
G Sell, D Garcia-Romero
2014 IEEE Spoken Language Technology Workshop (SLT), 413-417, 2014
2712014
Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge.
G Sell, D Snyder, A McCree, D Garcia-Romero, J Villalba, M Maciejewski, ...
Interspeech, 2808-2812, 2018
2572018
Time delay deep neural network-based universal background models for speaker recognition
D Snyder, D Garcia-Romero, D Povey
2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015
2062015
Linear versus mel frequency cepstral coefficients for speaker recognition
X Zhou, D Garcia-Romero, R Duraiswami, C Espy-Wilson, S Shamma
2011 IEEE workshop on automatic speech recognition & understanding, 559-564, 2011
1932011
Multicondition training of Gaussian PLDA models in i-vector space for noise and reverberation robust speaker recognition
D Garcia-Romero, X Zhou, CY Espy-Wilson
2012 IEEE international conference on acoustics, speech and signal …, 2012
1602012
Supervised domain adaptation for i-vector based speaker recognition
D Garcia-Romero, A McCree
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
1562014
A comparative evaluation of fusion strategies for multimodal biometric verification
J Fiérrez-Aguilar, J Ortega-Garcia, D Garcia-Romero, ...
Audio-and Video-Based Biometric Person Authentication: 4th International …, 2003
1552003
State-of-the-art speaker recognition with neural network embeddings in NIST SRE18 and speakers in the wild evaluations
J Villalba, N Chen, D Snyder, D Garcia-Romero, A McCree, G Sell, ...
Computer Speech & Language 60, 101026, 2020
1502020
Unsupervised Domain Adaptation for i-vector Speaker Recognition
D Garcia-Romero, A McCree, S Shum, N Brummer, C Vaquero
Odyssey 2014: The Speaker and Language Recognition Workshop, 2014
1462014
The NIST 2014 Speaker Recognition i-Vector Machine Learning Challenge
CS Greenberg, D Bansé, GR Doddington, D Garcia-Romero, JJ Godfrey, ...
1322014
Automatic acquisition device identification from speech recordings
D Garcia-Romero, CY Espy-Wilson
2010 IEEE International Conference on Acoustics, Speech and Signal …, 2010
1302010
State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18.
J Villalba, N Chen, D Snyder, D Garcia-Romero, A McCree, G Sell, ...
Interspeech, 1488-1492, 2019
1262019
The system can't perform the operation now. Try again later.
Articles 1–20