科研成果 by Type: Conference Paper

2024
Wu D, Wu X, Qu T. A HYBRID DEEP-ONLINE LEARNING BASED METHOD FOR ACTIVE NOISE CONTROLIN WAVE DOMAIN, in International Conference on Acoustics, Speech and Signal Processing (ICASSP). COEX, Seoul, Korea; 2024.
2023
Yuan Z, Wu D, Wu X, Qu T. Sound event localization and detection based on iterative separation in embedding space, in 2023 6th International Conference on Information Communication and Signal Processing (ICICSP). Xian, China; 2023:455-459.
Wang Y, Lan Z, Wu X, Qu T. TT-Net: Dual-Path Transformer Based Sound Field Translation in the Spherical Harmonic Domain, in International Conference on Acoustics, Speech and Signal Processing (ICASSP). Rhodes Island, Greece; 2023:1-5.
Ge Z, Tian P, Li L, Qu T. Rendering Near-field Point Sound Sources Through an Iterative Weighted Crosstalk Cancellation Method, in Audio Engineering Society Convention 154. Helsinki, Finland; 2023:10649.
2022
Qu T, Xu J, Yuan Z, Wu X. Higher order ambisonics compression method based onautoencoder, in Audio Engineering Society Convention 153. online; 2022:Express paper 9. 访问链接Abstract
The compression of three-dimensional sound field signals has always been a very important issue. Recently, an Independent Component Analysis (ICA) based Higher Order Ambisonics (HOA) compression method introduces blind source separation to solve the shortcomings of discontinuity between frames in the existing Singular Value Decomposition (SVD) based methods. However, ICA is weak to model the reverberant environment, and its target is not to recover original signal. In this work, we replace ICA with autoencoder to further improve the above method’s ability to cope with reverberation conditions and ensure the unanimous optimization both in separation and recovery by reconstruction loss. We constructed a dataset with simulated and recorded signals, and verified the effectiveness of our method through objective and subjective experiments.
Chao P, Wang Y, Wu X, Qu T. A Multi-channel Speech Separation System for Unknown Number of Multiple Speakers, in 2022 5th International Conference on Information Communication and Signal Processing (ICICSP). Shenzhen, China; 2022.
Gao S, Wu X, Qu T. Localization of Direct Source and Early Reflections Using HOA Processing and DNN Model, in Audio Engineering Society Convention 152.; 2022:10560. 访问链接
Wang Y, Wu X, Qu T. UP-WGAN: Upscaling Ambisonic Sound Scenes Using Wasserstein Generative Adversarial Networks, in Audio Engineering Society Convention 152.; 2022:10577. 访问链接
2021
Chen J, Wu X, Qu T. Early Reflections Based Speech Enhancement, in 2021 4th International Conference on Information Communication and Signal Processing (ICICSP). ShangHai, China; 2021:183-187.
Xu J, Niu Y, Wu X, Qu T. Higher order ambisonics compression method based on independent component analysis, in Audio Engineering Society Convention 150.; 2021:10456.
Zhang M, Guan T, Chen L, Fu T, Su D, Qu T. Individualized HRTF-based Binaural Renderer for Higher-Order Ambisonics, in Audio Engineering Society Convention 150.; 2021:10454.
2020
Jin X, Li S, Qu T, Manocha D, Wang G. Deep-modal: real-time impact sound synthesis for arbitrary shapes, in The 28th ACM International Conference on Multimedia.; 2020:1171-1179.
Lin J, Wu X, Qu T. Anti Spatial Aliasing HOA Encoding Method based on Aliasing Projection Matrix, in 2020 IEEE 3rd International Conference on Information Communication and Signal Processing (ICICSP).; 2020:321-325.
Huang Y, Wu X, Qu T. A Time-domain Unsupervised Learning Based Sound Source Localization Method, in 2020 IEEE 3rd International Conference on Information Communication and Signal Processing (ICICSP).; 2020:26-32.
Peng C, Wu X, Qu T. Competing speaker count estimation on the fusion of the spectral and spatial embedding space, in INTERSPEECH 2020. Shanghai China; 2020:3077-3081.
Zhang M, Wu X, Qu T. Individual Distance-Dependent HRTFS Modeling Through A Few Anthropometric Measurements, in International Conference on Acoustics, Speech and Signal Processing (ICASSP) . Barcelona, Spain; 2020:401-405.
Ge Z, Li L, Qu T. The Ambisonic Partially Matching Projection Decoding Method for Near-field Sound Sources, in 148 AES Convension. Vienna, Austria; 2020:10372.
Wang Y, Wu X, Qu T. Direction of arrival estimation based on transfer function learning using autoencoder network, in 148 AES Convention. Vienna, Austria; 2020:10370.
2019
Huang Y, Wu X, Qu T. A Time-domain End-to-End Method for Sound Source Localization Using Multi-Task Learning, in 2019 IEEE 2nd International Conference on Information Communication and Signal Processing (ICICSP). Weihai, China; 2019:52-56.Abstract
In recent years, many researches focus on sound source localization based on neural networks, which is an appealing but difficult problem. In this paper, a novel time-domain end-to-end method for sound source localization is proposed, where the model is trained by two strategies with both cross entropy loss and mean square error loss. Based on the idea of multi-task learning, CNN is used as the shared hidden layers to extract features and DNN is used as the output layers for each task. Compared with SRP-PHAT, MUSIC and a DNN-based method, the proposed method has better performance.
Gao S, Liu R, Wu X, Qu T. Eigen Beam Based Sound Source Localization Algorithms Evaluation on a Non-Spherical Microphone Array, in 2019 IEEE 2nd International Conference on Information Communication and Signal Processing (ICICSP). Weihai, China; 2019:185-189.Abstract
The traditional eigen beam based localization algorithms are usually not employed on the non-spherical microphone array, for which the eigen beam is hard to be obtained. In this paper, the transfer functions are introduced to calculated the eigen beam on the non-spherical microphone array. Based on it, three localization algorithms including the eigen beam based intensity vector, eigen beam based beamforming, eigen beam based MUSIC, are employed and their performance on localization are evaluated.

Pages