Wav2vec-switch: Contrastive learning from original-noisy speech pairs for robust speech recognition Y Wang, J Li, H Wang, Y Qian, C Wang, Y Wu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 53 | 2022 |
Towards robust speech super-resolution H Wang, DL Wang IEEE/ACM transactions on audio, speech, and language processing 29, 2058-2066, 2021 | 36 | 2021 |
Improving noise robustness of contrastive speech representation learning with speech reconstruction H Wang, Y Qian, X Wang, Y Wang, C Wang, S Liu, T Yoshioka, J Li, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 27 | 2022 |
Time-frequency loss for CNN based speech super-resolution H Wang, D Wang ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 26 | 2020 |
Neural cascade architecture with triple-domain loss for speech enhancement H Wang, DL Wang IEEE/ACM transactions on audio, speech, and language processing 30, 734-743, 2021 | 19 | 2021 |
Attention-based fusion for bone-conducted and air-conducted speech enhancement in the complex domain H Wang, X Zhang, DL Wang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 9 | 2022 |
Fusing bone-conduction and air-conduction sensors for complex-domain speech enhancement H Wang, X Zhang, DL Wang IEEE/ACM transactions on audio, speech, and language processing 30, 3134-3143, 2022 | 8 | 2022 |
Cross-domain diffusion based speech enhancement for very noisy speech H Wang, DL Wang ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 5 | 2023 |
Cross-domain speech enhancement with a neural cascade architecture H Wang, DL Wang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 4 | 2022 |
A diffusion-based two-dimensional Empirical Mode Decomposition (EMD) algorithm for image analysis H Wang, R Mann, ER Vrscay Image Analysis and Recognition: 15th International Conference, ICIAR 2018 …, 2018 | 3 | 2018 |
Densely-connected Convolutional Recurrent Network for Fundamental Frequency Estimation in Noisy Speech. Y Zhang, H Wang, DL Wang INTERSPEECH, 401-405, 2022 | 2 | 2022 |
uSee: Unified Speech Enhancement And Editing with Conditional Diffusion Models M Yang, C Zhang, Y Xu, Z Xu, H Wang, B Raj, D Yu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
Unifying Robustness and Fidelity: A Comprehensive Study of Pretrained Generative Methods for Speech Enhancement in Adverse Conditions H Wang, M Yu, H Zhang, C Zhang, Z Xu, M Yang, Y Zhang, D Yu arXiv preprint arXiv:2309.09028, 2023 | 1 | 2023 |
Estimation and Voicing Detection With Cascade Architecture in Noisy Speech Y Zhang, H Wang, DL Wang IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023 | 1 | 2023 |
DATA2VEC-SG: Improving Self-Supervised Learning Representations for Speech Generation Tasks H Wang, Y Qian, H Yang, N Kanda, P Wang, T Yoshioka, X Wang, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 1 | 2023 |
A Novel Foward-PDE Approach as an Alternative to Empirical Mode Decomposition H Wang, R Mann, ER Vrscay arXiv preprint arXiv:1802.00835, 2018 | 1 | 2018 |
SPATIALCODEC: Neural Spatial Speech Coding Z Xu, Y Xu, V Kothapally, H Wang, M Yang, D Yu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
Combined Generative and Predictive Modeling for Speech Super-resolution H Wang, EW Healy, DL Wang arXiv preprint arXiv:2401.14269, 2024 | | 2024 |
Leveraging Laryngograph Data for Robust Voicing Detection in Speech Y Zhang, H Wang, DL Wang arXiv preprint arXiv:2312.03129, 2023 | | 2023 |
Diffusion Conditional Expectation Model for Efficient and Robust Target Speech Extraction L Zhang, Y Qian, L Yu, H Wang, X Wang, H Yang, L Zhou, S Liu, Y Qian, ... arXiv preprint arXiv:2309.13874, 2023 | | 2023 |