Gao S, Liu R, Wu X, Qu T.
Eigen Beam Based Sound Source Localization Algorithms Evaluation on a Non-Spherical Microphone Array, in
2019 IEEE 2nd International Conference on Information Communication and Signal Processing (ICICSP). Weihai, China; 2019:185-189.
AbstractThe traditional eigen beam based localization algorithms are usually not employed on the non-spherical microphone array, for which the eigen beam is hard to be obtained. In this paper, the transfer functions are introduced to calculated the eigen beam on the non-spherical microphone array. Based on it, three localization algorithms including the eigen beam based intensity vector, eigen beam based beamforming, eigen beam based MUSIC, are employed and their performance on localization are evaluated.
Huang Y, Wu X, Qu T.
A Time-domain End-to-End Method for Sound Source Localization Using Multi-Task Learning, in
2019 IEEE 2nd International Conference on Information Communication and Signal Processing (ICICSP). Weihai, China; 2019:52-56.
AbstractIn recent years, many researches focus on sound source localization based on neural networks, which is an appealing but difficult problem. In this paper, a novel time-domain end-to-end method for sound source localization is proposed, where the model is trained by two strategies with both cross entropy loss and mean square error loss. Based on the idea of multi-task learning, CNN is used as the shared hidden layers to extract features and DNN is used as the output layers for each task. Compared with SRP-PHAT, MUSIC and a DNN-based method, the proposed method has better performance.
Zhang M, Qiao Y, Wu X, Qu T.
Distance-dependent Modeling of Head-related Transfer Functions, in
international conference on acoustics speech and signal processing(ICASSP). Brighton, United Kingdom ; 2019:276-280.
AbstractIn this paper, a method for modeling distance dependent head-related transfer functions is presented. The HRTFs are first decomposed by spatial principal component analysis. Using deep neural networks, we model the spatial principal component weights of different distances. Then we realize the prediction of HRTFs in arbitrary spatial distances. The objective and subjective experiments are conducted to evaluate the proposed distance model and the distance variation function model, and the results have shown that the proposed model has less spectral distortions than distance variation function model, and the virtual sound generated by the proposed model has better performance in terms of distance localization.