Citation:
Guo R, Qu L, Niu D, Qi Y, Yue W, Shi J, Xing B, Ying X. Open-Vocabulary Audio-Visual Semantic Segmentation, in Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024 - 1 November 2024. ACM; 2024:7533–7541.