Follow
Shinji Watanabe
Title
Cited by
Cited by
Year
Deep clustering: Discriminative embeddings for segmentation and separation
JR Hershey, Z Chen, J Le Roux, S Watanabe
2016 IEEE international conference on acoustics, speech and signal …, 2016
14782016
Espnet: End-to-end speech processing toolkit
S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ...
arXiv preprint arXiv:1804.00015, 2018
14762018
Joint CTC-attention based end-to-end speech recognition using multi-task learning
S Kim, T Hori, S Watanabe
2017 IEEE international conference on acoustics, speech and signal …, 2017
10252017
Hybrid CTC/attention architecture for end-to-end speech recognition
S Watanabe, T Hori, S Kim, JR Hershey, T Hayashi
IEEE Journal of Selected Topics in Signal Processing 11 (8), 1240-1253, 2017
8362017
A comparative study on transformer vs rnn in speech applications
S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ...
2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019
7732019
The third ‘CHiME’speech separation and recognition challenge: Dataset, task and baselines
J Barker, R Marxer, E Vincent, S Watanabe
2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015
7422015
Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks
H Erdogan, JR Hershey, S Watanabe, J Le Roux
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
7412015
Superb: Speech processing universal performance benchmark
S Yang, PH Chi, YS Chuang, CIJ Lai, K Lakhotia, YY Lin, AT Liu, J Shi, ...
arXiv preprint arXiv:2105.01051, 2021
6862021
Speech enhancement with LSTM recurrent neural networks and its application to noise-robust ASR
F Weninger, H Erdogan, S Watanabe, E Vincent, J Le Roux, JR Hershey, ...
Latent Variable Analysis and Signal Separation: 12th International …, 2015
6702015
Single-channel multi-speaker separation using deep clustering
Y Isik, JL Roux, Z Chen, S Watanabe, JR Hershey
arXiv preprint arXiv:1607.02173, 2016
4792016
An analysis of environment, microphone and data simulation mismatches in robust speech recognition
E Vincent, S Watanabe, AA Nugraha, J Barker, R Marxer
Computer Speech & Language 46, 535-557, 2017
4052017
The fifth'CHiME'speech separation and recognition challenge: dataset, task and baselines
J Barker, S Watanabe, E Vincent, J Trmal
arXiv preprint arXiv:1803.10609, 2018
4002018
Improved mvdr beamforming using single-channel mask prediction networks.
H Erdogan, JR Hershey, S Watanabe, MI Mandel, J Le Roux
Interspeech, 1981-1985, 2016
3522016
Advances in joint CTC-attention based end-to-end speech recognition with a deep CNN encoder and RNN-LM
T Hori, S Watanabe, Y Zhang, W Chan
arXiv preprint arXiv:1706.02737, 2017
3452017
A review of speaker diarization: Recent advances with deep learning
TJ Park, N Kanda, D Dimitriadis, KJ Han, S Watanabe, S Narayanan
Computer Speech & Language 72, 101317, 2022
2942022
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings
S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ...
arXiv preprint arXiv:2004.09249, 2020
2892020
The second ‘CHiME’speech separation and recognition challenge: Datasets, tasks and baselines
E Vincent, J Barker, S Watanabe, J Le Roux, F Nesta, M Matassoni
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
2672013
Recent developments on espnet toolkit boosted by conformer
P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
2662021
Self-supervised speech representation learning: A review
A Mohamed, H Lee, L Borgholt, JD Havtorn, J Edin, C Igel, K Kirchhoff, ...
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1179-1210, 2022
2412022
Improving transformer-based end-to-end speech recognition with connectionist temporal classification and language model integration
S Karita, NEY Soplin, S Watanabe, M Delcroix, A Ogawa, T Nakatani
Proc. Interspeech 2019, 2019
2382019
The system can't perform the operation now. Try again later.
Articles 1–20