Sound source direction estimation apparatus, sound source direction estimation method and computer program product N Ding, Y Kida US Patent 9,473,849, 2016 | 121 | 2016 |
Voice activity detection: Merging source and filter-based information T Drugman, Y Stylianou, Y Kida, M Akamine IEEE Signal Processing Letters 23 (2), 252-256, 2015 | 103 | 2015 |
Voice activity detection based on optimally weighted combination of multiple features. Y Kida, T Kawahara INTERSPEECH, 2621-2624, 2005 | 51 | 2005 |
Television apparatus and a remote operation apparatus K Ouchi, A Kawamura, M Sakai, K Suzuki, Y Kida US Patent 9,154,848, 2015 | 31 | 2015 |
Neural diarization with non-autoregressive intermediate attractors Y Fujita, T Komatsu, R Scheibler, Y Kida, T Ogawa ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 11 | 2023 |
Apparatus, method and computer program product for feature extraction Y Kida, T Masuko US Patent 8,073,686, 2011 | 10 | 2011 |
Apparatus and method for discriminating speech, and computer readable medium K Suzuki, M Sakai, Y Kida US Patent 9,330,682, 2016 | 9 | 2016 |
Evaluation of voice activity detection by combining multiple features with weight adaptation. Y Kida, T Kawahara INTERSPEECH, 2006 | 9 | 2006 |
Apparatus and method for discriminating speech of acoustic signal with exclusion of disturbance sound, and non-transitory computer readable medium K Suzuki, M Sakai, Y Kida US Patent 9,330,683, 2016 | 8 | 2016 |
Speaker selective beamformer with keyword mask estimation Y Kida, D Tran, M Omachi, T Taniguchi, Y Fujita 2018 IEEE Spoken Language Technology Workshop (SLT), 528-534, 2018 | 7 | 2018 |
Minimum classification error interactive training for speaker identification [interactive robot applications] Y Kida, H Yamamoto, C Miyajima, K Tokuda, T Kitamura Proceedings.(ICASSP'05). IEEE International Conference on Acoustics, Speech …, 2005 | 7 | 2005 |
Robust F0 estimation based on log-time scale autocorrelation and its application to Mandarin tone recognition Y Kida, M Sakai, T Masuko, A Kawamura Tenth Annual Conference of the International Speech Communication Association, 2009 | 6 | 2009 |
Simultaneous Detection and Localization of a Wake-Up Word Using Multi-Task Learning of the Duration and Endpoint. T Maekaku, Y Kida, A Sugiyama INTERSPEECH, 4240-4244, 2019 | 5 | 2019 |
Tourist guidance robot based on HyperCLOVA T Yamazaki, K Yoshikawa, T Kawamoto, M Ohagi, T Mizumoto, S Ichimura, ... arXiv preprint arXiv:2210.10400, 2022 | 4 | 2022 |
Multi-sequence intermediate conditioning for ctc-based asr Y Fujita, T Komatsu, Y Kida arXiv preprint, 2022 | 4 | 2022 |
Using duration and pitch for mandarin digit string recognition R Zhao, Y Kida, X Yan, P Ding, L He 2010 IEEE International Conference on Acoustics, Speech and Signal …, 2010 | 4 | 2010 |
Better intermediates improve CTC inference T Komatsu, Y Fujita, J Lee, L Lee, S Watanabe, Y Kida arXiv preprint arXiv:2204.00176, 2022 | 2 | 2022 |
InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR Y Nakagome, T Komatsu, Y Fujita, S Ichimura, Y Kida arXiv preprint arXiv:2204.00174, 2022 | 2 | 2022 |
Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers Y Kida, T Komatsu, M Togami arXiv preprint arXiv:2104.10328, 2021 | 2 | 2021 |
Creating device, creating method, and non-transitory computer readable storage medium Y Kida, D Tran US Patent App. 16/131,561, 2019 | 2 | 2019 |