A Physical Model-Based Self-Supervised Learning Method for Signal Enhancement Under Reverberant Environment

Citation:

Gao S, Wu X, Qu T. A Physical Model-Based Self-Supervised Learning Method for Signal Enhancement Under Reverberant Environment. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2023;31:2100-2110.

摘要:

In a reverberant environment, interferences such as reflections and background noise can degrade the perception of the sound source signal. Although the DNN-based methods have made a tremendous breakthrough in addressing this issue, the performance of these models is highly dependent on the completeness of the training dataset, which will limit its generalization under unknown environments. In this article, we propose a physical model-based self-supervised learning (PMSSL) method to realize the DNN model optimization under unknown scenarios. This method incorporates a room reverberation physical model into the sound source enhancement model optimization process, realizing the self-learning of the DNN model under physical constraints. In this process, the time-frequency characteristics of the input signal and the spatial feature of the reverberation environment are utilized for parameter optimization, improving the adaptability of the DNN model under unknown scenarios. Experimental results based on simulated and measured data prove that the proposed method can obtain much more accurate source signal enhancement results compared with the pre-trained models, verifying its effectiveness and adaptability in new environments.