科研成果 by Type: Conference Paper

2019
Zhang M, Qiao Y, Wu X, Qu T. Distance-dependent Modeling of Head-related Transfer Functions, in international conference on acoustics speech and signal processing(ICASSP). Brighton, United Kingdom ; 2019:276-280.Abstract
In this paper, a method for modeling distance dependent head-related transfer functions is presented. The HRTFs are first decomposed by spatial principal component analysis. Using deep neural networks, we model the spatial principal component weights of different distances. Then we realize the prediction of HRTFs in arbitrary spatial distances. The objective and subjective experiments are conducted to evaluate the proposed distance model and the distance variation function model, and the results have shown that the proposed model has less spectral distortions than distance variation function model, and the virtual sound generated by the proposed model has better performance in terms of distance localization.
Ge Z, Wu X, Qu T. Improvements to the matching projection decoding method for Ambisonic system with irregular loudspeaker layouts, in international conference on acoustics speech and signal processing(ICASSP). Brighton, United Kingdom; 2019:121-125.Abstract
The Ambisonic technique has been widely used for soundfield recording and reproduction recently. However, the basicAmbisonic decoding method will break down when the play-back loudspeakers distribute unevenly. Various methods havebeen proposed to solve this problem. This paper introducesseveral improvements to a recently proposed Ambisonic de-coding method, the matching projection method, for unevenloudspeaker layouts. The first improvement is energy preserv-ing; the second is introducing the “in-phase” weight, and thethird is introducing partial projection coefficients. To eval-uate the improved method, we compared it with the origi-nal one and the all-round Ambisonic decoding method witha 2-dimension unevenly arranged loudspeaker array. The re-sult shows our method greatly improves the original methodwhere the loudspeaker arranges very sparsely or densely.
Zhang S, Wu X, Qu T. Sparse Autoencoder Based Multiple Audio Objects Coding Method, in 146 AES Convention. Dublin, Ireland; 2019:10172. 访问链接Abstract
The traditional multiple audio objects codec extracts the parameters of each object in the frequency domain and produces serious confusion because of high coincidence degree in subband among objects. This paper uses sparse domain instead of frequency domain and reconstruct audio object using the binary mask from the down-mixed signal based on the sparsity of each audio object. In order to overcome high coincidence degree of subband among different audio objects, the sparse autoencoder neural network is established. On this basis, a multiple audio objects codec system is built up. To evaluate this proposed system, the objective and subjective evaluation are carried on and the results show that the proposed system has the better performance than SAOC.
2018
Huang Q, Wu X, Qu T. A Parametric Spatial Audio Coding Method Based on Convolutional Neural Networks, in 145 AES Convention. New York, USA; 2018:10126. 访问链接
Ge Z, Qiao Y, Wang S, Wu X, Qu T. Subjective Evaluation of Virtual Room Auralization System Based on the Ambisonics Matching Projection Decoding Method, in 145 AES Convention. New York, USA; 2018:10124. 访问链接
Qu T, Huang Z, Qiao Y, Wu X. Matching Projection Decoding Method for Ambisonics System, in International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018). Calgary, Alberta, Canada; 2018:561-565. 访问链接
Huang Q, Wu X, Qu T. Bandwidth Extension Method Based on Generative Adversarial Nets for Audio Compression., in 144 AES Convention. Milan, Italian; 2018:9954. 访问链接
Gao S, Wu X, Qu T. High order Ambisonics encoding method using differential microphone array, in 144 AES Convention. Milan Italian; 2018:10020. 访问链接
2016
Huang Z, Gao S, Qu T, Li L, Wu X. An environment adaptive loudspeaker calibration method for Ambisonics decoding system, in the 5th International Conference on Audio, Language and Image Processing. Shanghai, China; 2016:277.
Huang Q, Qu T, Li L, Wu X. Inter-channeltransferfunctionbased parametric stereo coding system, in Proceedings of the 22nd International Congress on Acoustics. Buenos Airs, Argentina; 2016:125.
Song T, Chen J, Zhang DB, Qu T, Wu X. A sound source localization algorithm using microphone array with rigid body, in Proceedings of the 22nd International Congress on Acoustics. Buenos Airs, Argentina; 2016:352.
Zhang M, Zhu F, Qu T, Wu X. An asynchronous HRTF measurement method based on phase alignment, in Proceedings of the 22nd International Congress on Acoustics. Buenos Airs, Argentina; 2016:342.
Qu T, Huang Q, Huang Y, Li L, Wu X. An Accurate Decorrelation Method for Parametric Stereo Coding, in the 5th International Conference on Audio, Language and Image Processing. Shanghai, China; 2016:283.
2013
Qu TS. Head-Related Transfer Function Measurement, in the 16th International Conference on Digital Audio Effects(DAFx13). Maynooth, Ireland; 2013.
Qu TS, Sun HZ, Wang N, Wu XH, Hartmann W. Perceived Elevation Cued by Images Rotating in Horizontal Planes, in the 21st International Congress on Acoustics (ICA2013) . Montreal, Canada; 2013:1-6.
He WX, Qu TS. Audio Lossless Coding/decoding Method using Basis Pursuit Algorithm, in IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP2013). Vancouver, Canada; 2013:552-555.
2012
Qu TS, Hartmann W. Using low-frequency threshold interaural time differences to test models of binaural hearing, in J. Acoust. Soc. Am.Vol 131.; 2012:3270.
Qu TS, Cao Y, Chen X, Huang Y, Wu XH, Schneider B, Li L. Detection of Spectral Changes Induced by a Break in Sound Correlation in Younger Adults and Older Adults, in J. Acoust. Soc. Am.Vol 131.; 2012:3270.
2009
Qu TS, Song Y, Li XD, Wu XH. A Measurement of Structural Head-related Transfer Functions in Proximal Region, in The 10th Western Pacific Acoustics Conference. Beijing China; 2009.
2008
Qu T, Xiao Z, Gong M, Huang Y, Li X, Wu X. Distance dependent head-related transfer function database of KEMAR, in International Conference on Audio, Language and Image Processing.; 2008.

Pages