Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions YW Jonathan Shen, Ruoming Pang, Ron J. Weiss, Mike Schuster, Navdeep Jaitly ... ICASSP, 2018 | 3234* | 2018 |
Tacotron: Towards end-to-end speech synthesis Y Wang, RJ Skerry-Ryan, D Stanton, Y Wu, RJ Weiss, N Jaitly, Z Yang, ... arXiv preprint arXiv:1703.10135, 2017 | 2175 | 2017 |
Tacotron: A fully end-to-end text-to-speech synthesis model Y Wang, RJ Skerry-Ryan, D Stanton, Y Wu, RJ Weiss, N Jaitly, Z Yang, ... arXiv preprint arXiv:1703.10135 164, 2017 | 285 | 2017 |
Method and system for non-parametric voice conversion I Agiomyrgiannakis US Patent 9,183,830, 2015 | 260 | 2015 |
Method and system for building text-to-speech voice from diverse recordings I Agiomyrgiannakis, A Gutkin US Patent 9,542,927, 2017 | 185 | 2017 |
Fast, compact, and high quality LSTM-RNN based statistical parametric speech synthesizers for mobile devices H Zen, Y Agiomyrgiannakis, N Egberts, F Henderson, P Szczepaniak arXiv preprint arXiv:1606.06061, 2016 | 155 | 2016 |
Vocaine the vocoder and applications in speech synthesis Y Agiomyrgiannakis 2015 IEEE international conference on acoustics, speech and signal …, 2015 | 107 | 2015 |
AutoMOS: Learning a non-intrusive assessor of naturalness-of-speech B Patton, Y Agiomyrgiannakis, M Terry, K Wilson, RA Saurous, D Sculley arXiv preprint arXiv:1611.09207, 2016 | 97 | 2016 |
Determining pitch dynamics of an audio signal I Agiomyrgiannakis US Patent 8,645,128, 2014 | 94 | 2014 |
Wrapped Gaussian mixture models for modeling and high-rate quantization of phase data of speech Y Agiomyrgiannakis, Y Stylianou IEEE Transactions on Audio, Speech, and Language Processing 17 (4), 775-786, 2009 | 62 | 2009 |
Google's Next-Generation Real-Time Unit-Selection Synthesizer Using Sequence-to-Sequence LSTM-Based Autoencoders. V Wan, Y Agiomyrgiannakis, H Silen, J Vit INTERSPEECH, 1143-1147, 2017 | 61 | 2017 |
Synthesizing speech from text using neural networks Y Wu, J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, ... US Patent 10,971,170, 2021 | 59 | 2021 |
End-to-end text-to-speech conversion S Bengio, Y Wang, Z Yang, Z Chen, Y Wu, I Agiomyrgiannakis, RJ Weiss, ... US Patent 10,573,293, 2020 | 49 | 2020 |
Using instantaneous frequency and aperiodicity detection to estimate F0 for high-quality speech synthesis H Kawahara, Y Agiomyrgiannakis, H Zen arXiv preprint arXiv:1605.07809, 2016 | 40 | 2016 |
Systems and methods for three-dimensional audio CAPTCHA Y Agiomyrgiannakis, E Tan, DJ Abraham US Patent 9,263,055, 2016 | 38 | 2016 |
Text-to-speech synthesis using an autoencoder BH Chun, J Gonzalvo, C Chan, I Agiomyrgiannakis, VPL Wan, RAJ Clark, ... US Patent 10,249,289, 2019 | 34 | 2019 |
ARX-LF-based source-filter methods for voice modification and transformation Y Agiomyrgiannakis, O Rosec 2009 IEEE International Conference on Acoustics, Speech and Signal …, 2009 | 34 | 2009 |
Conditional vector quantization for speech coding Y Agiomyrgiannakis, Y Stylianou IEEE transactions on audio, speech, and language processing 15 (2), 377-386, 2007 | 34 | 2007 |
Combined estimation/coding of highband spectral envelopes for speech spectrum expansion Y Agiomyrgiannakis, Y Stylianou 2004 IEEE International Conference on Acoustics, Speech, and Signal …, 2004 | 30 | 2004 |
Method and system for cross-lingual voice conversion I Agiomyrgiannakis US Patent 9,177,549, 2015 | 29 | 2015 |