<?xml version="1.0" encoding="UTF-8"?><xml><records><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Zeyu Yuan</style></author><author><style face="normal" font="default" size="100%">Shan Gao</style></author><author><style face="normal" font="default" size="100%">Xihong Wu</style></author><author><style face="normal" font="default" size="100%">Tianshu Qu</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Spatial Covariant Matrix based Learning for DOA Estimationin Spherical Harmonics Domain</style></title><secondary-title><style face="normal" font="default" size="100%">the AES 156th Convention</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2024</style></year><pub-dates><date><style  face="normal" font="default" size="100%">15-17 June 2024</style></date></pub-dates></dates><pub-location><style face="normal" font="default" size="100%">Madrid, Spain</style></pub-location><pages><style face="normal" font="default" size="100%">10701</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Direction of arrival (DoA) estimation in complex environments is a challenging task. The traditional methods suffer from invalidity under low signal-to-noise ratio (SNR) and reverberation conditions, and the data-driven methods lack of generalization to unseen data types. In this paper we propose a robust DoA estimation approach by combining the two methods above. To focus on spatial information modeling, the proposed method directly uses the compressed covariance matrix of the first-order ambisonics (FOA) signal as input, while only white noise is used during training. To adapt to different characteristics of FOA signals in different frequency bands, our method estimates DoA in different frequency bands by particular models, and the subband results are finally integrated together. Experiments are carried out on both simulated and measured datasets, and the results show the superiority of the proposed method than existing baselines under complex conditions and the scalability for unseen data types.</style></abstract></record></records></xml>