科研成果 by Type: Conference Paper

2024
Liu Z, Li Z, Jiang T. BLADE: Box-Level Supervised Amodal Segmentation through Directed Expansion, in The 38th AAAI Conference on Artificial Intelligence, AAAI 2024, Feb. 20-27. Vancouver, Canada: AAAI Press; 2024:3846–3854. 访问链接
Cochet M, Swaminathan K, Loscalzo EJ, Zuckerman J, dos Santos MC, Giri D, Buyuktosunoglu A, Jia T, Brooks D, Wei G-Y, et al. BlitzCoin: Fully Decentralized Hardware Power Management for Accelerator-Rich SoCs, in International Symposium on Computer Architecture (ISCA).; 2024.
Zhong Y, Wu X, Zhang L, Yang C, Jiang T. Causal-IQA: Towards the Generalization of Image Quality Assessment Based on Causal Inference, in The 41st International Conference on Machine Learning, ICML 2024, July 21-27. Vienna, Austria; 2024. 访问链接
Li W, Tang Z, Zhang P, Wang J. Collaborative Data Behaviors in Digital Humanities Research Teams, in ACM IEEE Joint Conference on Digital Libraries (JCDL 24). Hongkong, China; 2024.
Yang L, Xu C, Ma Xinyu, Gao Xin, Li Ruiqing, Yasha W. Combating Label Sparsity in Short Text Topic Modeling via Nearest Neighbor Augmentation, in In Findings of The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024).; 2024.
Xu X, Wang B, Yan Y, Zhu H, Zhang Z, Wu X, Chen J*. ConvConcatNet: a deep convolutional neural network to reconstruct mel spectrogram from the EEG, in arXiv; 2024. 访问链接
Hagag A, Gomaa A, Kornek D, Maier A, Fietkau R, Bert C, Huang Y, Putz F. Deep Learning for Cancer Prognosis Prediction Using Portrait Photos by StyleGAN Embedding, in International Conference on Medical Image Computing and Computer-Assisted Intervention.; 2024:early–acceptance.
Liu Y, Yang C, Li D, Ding J, Jiang T. Defense Against Adversarial Attacks on No-Reference Image Quality Models with Gradient Norm Regularization, in IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, June 16-22. Seattle, WA, USA: IEEE; 2024:25554–25563. 访问链接
Xu X, Wang B, Yan Y, Wu X, Chen J*. A DenseNet-Based Method for Decoding Auditory Spatial Attention with EEG, in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).; 2024:1946–1950. 访问链接
Salminen J, Liu C, Pian W, Chi J, Häyhänen E, Jansen BJ. Deus ex machina and personas from large language models: investigating the composition of AI-generated persona descriptions, in Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems.; 2024:1–20.
Du Y, Wang Z, Sun Z, Ma Y, Liu H, Zhang J. Disentangled Multi-interest Representation Learning for Sequential Recommendation, in Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2024, Barcelona, Spain, August 25-29, 2024. ACM; 2024:677–688. 访问链接
Wang Q, Wang Y, Wang Y, Ying X. Dissecting the Failure of Invariant Learning on Graphs, in Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, NeurIPS 2024, Vancouver, BC, Canada, December 10 - 15, 2024.; 2024. 访问链接
Wang Q, Wang Y, Wang Y, Ying X. Dissecting the Failure of Invariant Learning on Graphs, in Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, NeurIPS 2024, Vancouver, BC, Canada, December 10 - 15, 2024.; 2024. 访问链接
Gao S, Wu X, Qu T. DOA-Informed Self-Supervised Learning Method for SoundSource Enhancement, in the AES 156th Convention. Madrid, Spain; 2024:10683.Abstract
The multiple-channel[1] sound source enhancement methods have made a great progress in recent years, especially when combined with the learning-based algorithms. However, the performance of these techniques is limited by the completeness of the training dataset, which may degrade in mismatched environments. In this paper, we propose a reconstruction Model based Self-supervised Learning (RMSL) method for sound source enhancement. A reconstruction module is used to integrate the estimated target signal and noise components to regenerate the multi-channel mixed signals, and it is connected with a separating model to form a closed loop.In this case, the optimization of the separation model can be achieved by continuously iterating the separation-reconstruction process. We use the separation error, the reconstruction error, and the signal-noise independence error as lossfunctions in the self-supervised learning process. This method is applied to the state-of-the-art sound source separation model (ADL-MVDR) and evaluated under different scenarios. Experimental results demonstrate that the proposed method can improve the performance of ADL-MVDR algorithm under different number of sound sources, bringing about 0.5 dB to 1 dB Si-SNR gain, while maintaining good clarity and intelligibility in practical application.
Qian W, Shen Q, Xu H, Huang X, Wu Z. DROPFL: Client Dropout Attacks Against Federated Learning Under Communication Constraints, in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2024, Seoul, Republic of Korea, April 14-19, 2024. IEEE; 2024:4870–4874. 访问链接
Du Y, Luo D, Yan R, Wang X, Liu H, Zhu H, Song Y, Zhang J. Enhancing Job Recommendation through LLM-Based Generative Adversarial Networks, in Thirty-Eighth AAAI Conference on Artificial Intelligence, AAAI 2024, February 20-27, 2024, Vancouver, Canada. AAAI Press; 2024:8363–8371. 访问链接
Gao X, Lin Y, Li R, Wang Y, Chu X, Ma X, Yu H. Enhancing Topic Interpretability for Neural Topic Modeling Through Topic-Wise Contrastive Learning, in 2024 IEEE 40th International Conference on Data Engineering (ICDE 2024).; 2024.
Yu B, Ren J, Han J, Wang F, Liang J, Shi B. EventPS: Real-Time Photometric Stereo Using an Event Camera, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).; 2024:9602–9611.Abstract
Photometric stereo is a well-established technique to estimate the surface normal of an object. However the requirement of capturing multiple high dynamic range images under different illumination conditions limits the speed and real-time applications. This paper introduces EventPS a novel approach to real-time photometric stereo using an event camera. Capitalizing on the exceptional temporal resolution dynamic range and low bandwidth characteristics of event cameras EventPS estimates surface normal only from the radiance changes significantly enhancing data efficiency. EventPS seamlessly integrates with both optimization-based and deep-learning-based photometric stereo techniques to offer a robust solution for non-Lambertian surfaces. Extensive experiments validate the effectiveness and efficiency of EventPS compared to frame-based counterparts. Our algorithm runs at over 30 fps in real-world scenarios unleashing the potential of EventPS in time-sensitive and high-speed downstream applications.
Shi R, Duan L, Huang T, Jiang T. Evidential Uncertainty-Guided Mitochondria Segmentation for 3D EM Images, in The 38th AAAI Conference on Artificial Intelligence, AAAI 2024, Feb. 20-27. Vancouver, Canada: AAAI Press; 2024:4847–4855. 访问链接
Wu D, Wu X, Qu T. Exploiting Motion Information in Sound Source Localizationand Tracking, in the AES 156th Convention. Madrid, Spain; 2024:10687.Abstract
Deep neural networks can be employed for estimating the direction of arrival (DOA) of individual sound sources from audio signals. Existing methods mostly focus on estimating the DOA of each source on individual frames, without utilizing the motion information of the sources. This paper proposes a method for estimating trajectories of sources, leveraging the differential of trajectories across different time scales. Additionally, a neural network is employed for enhancing the trajectories wrongly estimated especially for sound sources with low-energy. Experimental evaluations conducted on simulated dataset validate that the proposed method achieves more precise localization and tracking performance and encounters less interference when the sound source energy is low.

Pages