科研成果 by Type: Conference Paper

2024

Hong Y, Zhong H, Weng S, Liang J, Shi B. L-DiffER: Single Image Reflection Removal with Language-based Diffusion Model, in Proceedings of the European Conference on Computer Vision (ECCV).; 2024.Abstract

In this paper, we introduce L-DiffER, a language-based diffusion model designed for the ill-posed single image reflection removal task. Although having shown impressive performance for image generation, existing language-based diffusion models struggle with precise control and faithfulness in image restoration. To overcome these limitations, we propose an iterative condition refinement strategy to resolve the problem of inaccurate control conditions. A multi-condition constraint mechanism is employed to ensure the recovery faithfulness of image color and structure while retaining the generation capability to handle low-transmitted reflections. We demonstrate the superiority of the proposed method through extensive experiments, showcasing both quantitative and qualitative improvements over existing methods.

Xing B, Ying X, Wang R. Masked local-global representation learning for 3d point cloud domain adaptation, in IEEE International Conference on Robotics and Automation. IEEE; 2024:418–424.

Xing B, Ying X, Wang R. Masked Local-Global Representation Learning for 3D Point Cloud Domain Adaptation, in IEEE International Conference on Robotics and Automation, ICRA 2024, Yokohama, Japan, May 13-17, 2024. IEEE; 2024:418–424. 访问链接

Jing Y, Sun Y, Wu M, Zhu Z, Zhou J, HUANG R, Ye L, Jia T. NeRF-Learner: A 2.79mJ/Frame NeRF-SLAM Processor with Unified Inference/Training Compute-in-Memory for Large-Scale Neural Rendering, in 50th European Solid-State Electronics Research Conference (ESSERC).; 2024.

Zhou Y, HUANG R, Tang K. A Novel Hybrid-FE-layer FeFET with Enhanced Linearity for On-chip Training of CIM Accelerator, in 2024 8th IEEE Electron Devices Technology & Manufacturing Conference (EDTM).; 2024:1-3.

Wu C-Y. “Obey…for the Common Good”: Building a Sense of Community in the Bakers’ Strike Edict, in Community and Communication in Classical Antiquity：第13届中日韩三国欧洲古代史学术研讨会，2024 年 10 月 17-20 日. Fudan University, Shanghai; 2024.Abstract

This paper discusses the so-called Bakers’ Strike Edict from Ephesus (Ephesos 231 = IK 12.215 p. 27) in light of recent studies on the Roman imperial toolkit to build empire-wide communities. Clifford Ando and Myles Lavan argued that Roman emperors in the first two centuries CE were consciously blurring distinctions between Roman and non-Roman populations, so that there could be a shared sense of an empire-wide community among people in the provinces. This paper argues that, in addition to Lavan’s observations, gubernatorial edicts also show concerns and measures that sought to communicate a sense of the communal at the local level. While the focus of discussion is on the edict responding to a bakers’ strike at Ephesus, several other examples from a corpus of gubernatorial edicts are also used in connection with this example. This paper hopes to contribute to Ando’s and Lavan’s arguments by pointing to a lower register of community building visible in gubernatorial edicts. The governors’ concerns for and efforts to creating communal cohesion and their need to balance parallel and at times competing “common goods” not only adds another nuance to the grander community building project at the imperial level, but demonstrates further complications on how praesidial governors – and in particular proconsuls – can and would react to difficult issues at the local level.

Guo R, Qu L, Niu D, Qi Y, Yue W, Shi J, Xing B, Ying X. Open-Vocabulary Audio-Visual Semantic Segmentation, in Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024 - 1 November 2024. ACM; 2024:7533–7541. 访问链接

Yu Z, Zhang C, Wang Y, Tang W, Wang J, Ma L. Predict and Interpret Health Risk Using Ehr Through Typical Patients, in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024).; 2024.

Zhihao Y, Xu C, Yujie J, Yasha W, Junfeng Z. Predict and Interpret Health Risk Using Ehr Through Typical Patients, in Thirty-Eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024).; 2024.

Xu Yongxin, Jiang Xinke, Xu， C, Yuzhen， X, Zhang Chaohe, Ding Hongxin, Junfeng Z, Yasha W, Bing X. ProtoMix: Augmenting Health Status Representation Learning via Prototype-based Mixup, in the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2024).; 2024.

Qiu Y, Ma Y, Wu M, Jia Y, Qu X, Zhou Z, Lou J, Jia T, Ye L, HUANG R. Quartet: A 22nm 0.09mJ/inference digital compute-in-memory versatile AI accelerator with heterogeneous tensor engines and off-chip-less dataflow, in IEEE Custom Integrated Circuit Conference (CICC).; 2024.

Zhou Y, Huang W, Zhu R, HUANG R, Tang K. A Reliable 2 bit MLC FeFET with High Uniformity and 109 Endurance by Gate Stack and Write Pulse Co-optimization, in 2024 IEEE European Solid-State Electronics Research Conference (ESSERC).; 2024:657-660.

Wu M, Ren W, Chen P, Zhao W, Jing Y, Ru J, Wang Z, Ma Y, HUANG R, Jia T, et al. S2D-CIM: A 22nm 128Kb systolic digital compute-in-memory macro with domino data path for flexible vector operation and 2-D weight update in edge AI applications, in IEEE Custom Integrated Circuit Conference (CICC).; 2024.

Wang B, Xu X, Zhang Z, Zhu H, Yan Y, Wu X, Chen J*. Self-supervised speech representation and contextual text embedding for match-mismatch classification with EEG recording, in arXiv; 2024. 访问链接

Wang B, Xu X, Zhang L, Xiao B, Wu X, Chen J*. Semantic Reconstruction of Continuous Language from MEG Signals, in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).; 2024:2190–2194. 访问链接

Shi R, Pang Q, Ma L, Duan L, Huang T-J, Jiang T. ShapeMamba-EM: Fine-Tuning Foundation Model with Local Shape Descriptors and Mamba Blocks for 3D EM Image Segmentation, in Medical Image Computing and Computer Assisted Intervention - MICCAI 2024 - 27th International Conference, Marrakesh, Morocco, October 6-10, 2024, Proceedings, Part XII.Vol 15012. Springer; 2024:731–741. 访问链接

Tang F, Wang Z, Cheng Y. Simultaneous Parameter and State Estimation with Extended Kalman Filter for Dynamic Parameters, in 2024 IEEE MTT-S International Wireless Symposium (IWS).; 2024:1-3.

Li M, Zhi Q, Dong Y, Ye L, Jia T. SPARK: An Efficient Hybrid Acceleration Architecture with Run-Time Sparsity-Aware Scheduling for TinyML Learning, in Design Automation Conference (DAC).; 2024.

Yuan Z, Gao S, Wu X, Qu T. Spatial Covariant Matrix based Learning for DOA Estimationin Spherical Harmonics Domain, in the AES 156th Convention. Madrid, Spain; 2024:10701.Abstract

Direction of arrival (DoA) estimation in complex environments is a challenging task. The traditional methods suffer from invalidity under low signal-to-noise ratio (SNR) and reverberation conditions, and the data-driven methods lack of generalization to unseen data types. In this paper we propose a robust DoA estimation approach by combining the two methods above. To focus on spatial information modeling, the proposed method directly uses the compressed covariance matrix of the first-order ambisonics (FOA) signal as input, while only white noise is used during training. To adapt to different characteristics of FOA signals in different frequency bands, our method estimates DoA in different frequency bands by particular models, and the subband results are finally integrated together. Experiments are carried out on both simulated and measured datasets, and the results show the superiority of the proposed method than existing baselines under complex conditions and the scalability for unseen data types.

Yue W, Ying X, Guo R, Chen DD, Shi J, Xing B, Zhu Y, Chen T. Sub-Adjacent Transformer: Improving Time Series Anomaly Detection with Reconstruction Error from Sub-Adjacent Neighborhoods, in Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, IJCAI 2024, Jeju, South Korea, August 3-9, 2024. ijcai.org; 2024:2524–2532. 访问链接

Pages