科研成果 by Type: Conference Paper

2024

Guo R, Niu D, Qu L, Qi Y, Shi J, Yue W, Xing B, Chen T, Ying X. Instance-Level Panoramic Audio-Visual Saliency Detection and Ranking, in Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024 - 1 November 2024. ACM; 2024:9426–9434. 访问链接

Zhong H, Hong Y, Weng S, Liang J, Shi B. Language-Guided Image Reflection Separation, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).; 2024:24913–24922.Abstract

This paper studies the problem of language-guided reflection separation which aims at addressing the ill-posed reflection separation problem by introducing language descriptions to provide layer content. We propose a unified framework to solve this problem which leverages the cross-attention mechanism with contrastive learning strategies to construct the correspondence between language descriptions and image layers. A gated network design and a randomized training strategy are employed to tackle the recognizable layer ambiguity. The effectiveness of the proposed method is validated by the significant performance advantage over existing reflection separation methods on both quantitative and qualitative comparisons.

Yang Y, Liang J, Yu B, Chen Y, Ren JS, Shi B. Latency Correction for Event-guided Deblurring and Frame Interpolation, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).; 2024:24977–24986.Abstract

Event cameras with their high temporal resolution dynamic range and low power consumption are particularly good at time-sensitive applications like deblurring and frame interpolation. However their performance is hindered by latency variability especially under low-light conditions and with fast-moving objects. This paper addresses the challenge of latency in event cameras – the temporal discrepancy between the actual occurrence of changes in the corresponding timestamp assigned by the sensor. Focusing on event-guided deblurring and frame interpolation tasks we propose a latency correction method based on a parameterized latency model. To enable data-driven learning we develop an event-based temporal fidelity to describe the sharpness of latent images reconstructed from events and the corresponding blurry images and reformulate the event-based double integral model differentiable to latency. The proposed method is validated using synthetic and real-world datasets demonstrating the benefits of latency correction for deblurring and interpolation across different lighting conditions.

Hong Y, Zhong H, Weng S, Liang J, Shi B. L-DiffER: Single Image Reflection Removal with Language-based Diffusion Model, in Proceedings of the European Conference on Computer Vision (ECCV).; 2024.Abstract

In this paper, we introduce L-DiffER, a language-based diffusion model designed for the ill-posed single image reflection removal task. Although having shown impressive performance for image generation, existing language-based diffusion models struggle with precise control and faithfulness in image restoration. To overcome these limitations, we propose an iterative condition refinement strategy to resolve the problem of inaccurate control conditions. A multi-condition constraint mechanism is employed to ensure the recovery faithfulness of image color and structure while retaining the generation capability to handle low-transmitted reflections. We demonstrate the superiority of the proposed method through extensive experiments, showcasing both quantitative and qualitative improvements over existing methods.

Xing B, Ying X, Wang R. Masked local-global representation learning for 3d point cloud domain adaptation, in IEEE International Conference on Robotics and Automation. IEEE; 2024:418–424.

Xing B, Ying X, Wang R. Masked Local-Global Representation Learning for 3D Point Cloud Domain Adaptation, in IEEE International Conference on Robotics and Automation, ICRA 2024, Yokohama, Japan, May 13-17, 2024. IEEE; 2024:418–424. 访问链接

Zhai S, Chen H, Dong Y, Li J, Shen Q, Gao Y, Su H, Liu Y. Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy, in Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, NeurIPS 2024, Vancouver, BC, Canada, December 10 - 15, 2024.; 2024. 访问链接

Xie L, Lin M, Xu CM, Luan T, Zeng Z, Qian W, Li C, Fang Y, Shen Q, Wu Z. MH-pFLGB: Model Heterogeneous Personalized Federated Learning via Global Bypass for Medical Image Analysis, in Medical Image Computing and Computer Assisted Intervention - MICCAI 2024 - 27th International Conference, Marrakesh, Morocco, October 6-10, 2024, Proceedings, Part X.Vol 15010. Springer; 2024:534–545. 访问链接

Xie L, Lin M, Luan T, Li C, Fang Y, Shen Q, Wu Z. MH-pFLID: Model Heterogeneous personalized Federated Learning via Injection and Distillation for Medical Data Analysis, in Forty-first International Conference on Machine Learning, ICML 2024, Vienna, Austria, July 21-27, 2024. OpenReview.net; 2024. 访问链接

Jing Y, Sun Y, Wu M, Zhu Z, Zhou J, HUANG R, Ye L, Jia T. NeRF-Learner: A 2.79mJ/Frame NeRF-SLAM Processor with Unified Inference/Training Compute-in-Memory for Large-Scale Neural Rendering, in 50th European Solid-State Electronics Research Conference (ESSERC).; 2024.

Zhou Y, HUANG R, Tang K. A Novel Hybrid-FE-layer FeFET with Enhanced Linearity for On-chip Training of CIM Accelerator, in 2024 8th IEEE Electron Devices Technology & Manufacturing Conference (EDTM).; 2024:1-3.

Wu C-Y. “Obey…for the Common Good”: Building a Sense of Community in the Bakers’ Strike Edict, in Community and Communication in Classical Antiquity：第13届中日韩三国欧洲古代史学术研讨会，2024 年 10 月 17-20 日. Fudan University, Shanghai; 2024.Abstract

This paper discusses the so-called Bakers’ Strike Edict from Ephesus (Ephesos 231 = IK 12.215 p. 27) in light of recent studies on the Roman imperial toolkit to build empire-wide communities. Clifford Ando and Myles Lavan argued that Roman emperors in the first two centuries CE were consciously blurring distinctions between Roman and non-Roman populations, so that there could be a shared sense of an empire-wide community among people in the provinces. This paper argues that, in addition to Lavan’s observations, gubernatorial edicts also show concerns and measures that sought to communicate a sense of the communal at the local level. While the focus of discussion is on the edict responding to a bakers’ strike at Ephesus, several other examples from a corpus of gubernatorial edicts are also used in connection with this example. This paper hopes to contribute to Ando’s and Lavan’s arguments by pointing to a lower register of community building visible in gubernatorial edicts. The governors’ concerns for and efforts to creating communal cohesion and their need to balance parallel and at times competing “common goods” not only adds another nuance to the grander community building project at the imperial level, but demonstrates further complications on how praesidial governors – and in particular proconsuls – can and would react to difficult issues at the local level.

Guo R, Qu L, Niu D, Qi Y, Yue W, Shi J, Xing B, Ying X. Open-Vocabulary Audio-Visual Semantic Segmentation, in Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024 - 1 November 2024. ACM; 2024:7533–7541. 访问链接

Xie L, Lin M, Liu S, Xu CM, Luan T, Li C, Fang Y, Shen Q, Wu Z. pFLFE: Cross-silo Personalized Federated Learning via Feature Enhancement on Medical Image Segmentation, in Medical Image Computing and Computer Assisted Intervention - MICCAI 2024 - 27th International Conference, Marrakesh, Morocco, October 6-10, 2024, Proceedings, Part X.Vol 15010. Springer; 2024:599–610. 访问链接

Yu Z, Zhang C, Wang Y, Tang W, Wang J, Ma L. Predict and Interpret Health Risk Using Ehr Through Typical Patients, in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024).; 2024.

Zhihao Y, Xu C, Yujie J, Yasha W, Junfeng Z. Predict and Interpret Health Risk Using Ehr Through Typical Patients, in Thirty-Eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024).; 2024.

Feng X, Shen Q, Li C, Fang Y, Wu Z. Privacy Preserving Federated Learning from Multi-Input Functional Proxy Re-Encryption, in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2024, Seoul, Republic of Korea, April 14-19, 2024. IEEE; 2024:6955–6959. 访问链接

Xu Yongxin, Jiang Xinke, Xu， C, Yuzhen， X, Zhang Chaohe, Ding Hongxin, Junfeng Z, Yasha W, Bing X. ProtoMix: Augmenting Health Status Representation Learning via Prototype-based Mixup, in the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2024).; 2024.

Pages