Abstract: In this work, we propose CleanMel, a single-channel Mel-spectrogram denoising and dereverberation network for improving both speech quality and automatic speech recognition (ASR) performance ...
Abstract: Environmental Sound Recognition (ESR) is an essential task in audio analysis, involving the identification and classification of sounds from various environmental contexts. This study ...