Recursive speech separation for unknown number of speakers N Takahashi, S Parthasaarathy, N Goswami, Y Mitsufuji arXiv preprint arXiv:1904.03065, 2019 | 94 | 2019 |
STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events A Politis, K Shimada, P Sudarsanam, S Adavanne, D Krause, Y Koyama, ... arXiv preprint arXiv:2206.01948, 2022 | 69 | 2022 |
Improving voice separation by incorporating end-to-end speech recognition N Takahashi, MK Singh, S Basak, P Sudarsanam, S Ganapathy, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 22 | 2020 |
Clotho-aqa: A crowdsourced dataset for audio question answering S Lipping, P Sudarsanam, K Drossos, T Virtanen 2022 30th European Signal Processing Conference (EUSIPCO), 1140-1144, 2022 | 19 | 2022 |
Assessment of self-attention on learned features for sound event localization and detection P Sudarsanam, A Politis, K Drossos arXiv preprint arXiv:2107.09388, 2021 | 16 | 2021 |
STARSS23: An audio-visual dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events K Shimada, A Politis, P Sudarsanam, DA Krause, K Uchida, S Adavanne, ... Advances in Neural Information Processing Systems 36, 2024 | 11 | 2024 |
Attention-Based Methods For Audio Question Answering P Sudarsanam, T Virtanen 2023 31st European Signal Processing Conference (EUSIPCO), 750-754, 2023 | | 2023 |
Toward an Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events Kazuki Shimada1, Archontis Politis2, Parthasaarathy … K Shimada, A Politis, P Sudarsanam, D Krause, N Takahashi, ... | | |