科研成果 by Type: Conference Paper

2021

Zhang T, Li Y, Li S, Ye Q, Wang C, Xie G. Decentralized Circle Formation Control for Fish-like Robots in the Real-world via Reinforcement Learning, in 2021 IEEE International Conference on Robotics and Automation (ICRA).Vol 2021-May. IEEE; 2021:8814–8820. 访问链接 Abstract

In this paper, the circle formation control problem is addressed for a group of cooperative underactuated fish-like robots involving unknown nonlinear dynamics and disturbances. Based on the reinforcement learning and cognitive consistency theory, we propose a decentralized controller without the knowledge of the dynamics of the fish-like robots. The proposed controller can be transferred from simulation to reality. It is only trained in our established simulation environment, and the trained controller can be deployed to real robots without any manual tuning. Simulation results confirm that the proposed model-free robust formation control method is scalable with respect to the group size of the robots and outperforms other representative RL algorithms. Several experiments in the real world verify the effectiveness of our RL-based approach for circle formation control.

Fang W, Yu Z*, Chen Y, Huang T, Masquelier T, Tian YH*. Deep Residual Learning in Spiking Neural Networks, in Thirty-Fifth Conference on Neural Information Processing Systems (NeurIPS).; 2021. PDF Abstract

Deep Spiking Neural Networks (SNNs) present optimization difficulties for gradient-based approaches due to discrete binary activation and complex spatial-temporal dynamics. Considering the huge success of ResNet in deep learning, it would be natural to train deep SNNs with residual learning. Previous Spiking ResNet mimics the standard residual block in ANNs and simply replaces ReLU activation layers with spiking neurons, which suffers the degradation problem and can hardly implement residual learning. In this paper, we propose the spike-element-wise (SEW) ResNet to realize residual learning in deep SNNs. We prove that the SEW ResNet can easily implement identity mapping and overcome the vanishing/exploding gradient problems of Spiking ResNet. We evaluate our SEW ResNet on ImageNet, DVS Gesture, and CIFAR10-DVS datasets, and show that SEW ResNet outperforms the state-of-the-art directly trained SNNs in both accuracy and time-steps. Moreover, SEW ResNet can achieve higher performance by simply adding more layers, providing a simple method to train deep SNNs. To our best knowledge, this is the first time that directly training deep SNNs with more than 100 layers becomes possible. Our codes are available at https://github.com/fangwei123456/Spike-Element-Wise-ResNet.

Xiong X, Tong A, Wang X, Liu S, Li X, HUANG R, Wu Y. Demonstration of Vertically-stacked CVD Monolayer Channels: MoS 2 Nanosheets GAA-FET with I on> 700 $μ$A/$μ$m and MoS2/WSe2 CFET, in 2021 IEEE International Electron Devices Meeting (IEDM). IEEE; 2021:7–5.

Ma L, Ma X, Gao J, Jiao X, Yu Z, Zhang C, Ruan W, Wang Y, Tang W, Wang J. Distilling knowledge from publicly available online EMR data to emerging epidemic for prognosis, in Proceedings of the Web Conference 2021.; 2021:3558–3568.

Li B, Li S, Wang C, Fan R, Shao J, Xie G. Distributed Circle Formation Control for Quadrotors Based on Multi-agent Deep Reinforcement Learning, in 2021 China Automation Congress (CAC). IEEE; 2021:4750–4755. 访问链接

Tao Y, Zhang Z. DNC-Aided SCL-Flip Decoding of Polar Codes, in 2021 IEEE Global Communications Conference (GLOBECOM). IEEE; 2021:01–06. 访问链接

Yan P, Schroeder R. Drifting away from the mainstream: Media attention and the politics of hyperpartisan news websites, in . The 71st Annual International Communication Association (ICA) Conference.; 2021.

Chen J, Wu X, Qu T. Early Reflections Based Speech Enhancement, in 2021 4th International Conference on Information Communication and Signal Processing (ICICSP). ShangHai, China; 2021:183-187.

Tambe T, Hooper C, Pentecost L, Jia T, Yang E-Y, Donato M, Sanh V, Whatmough P, Rush A, Brooks D, et al. EdgeBERT: sentence-level energy optimizations for latency-aware multi-task NLP inference, in International Symposium on Microarchitecture (MICRO).; 2021.

Wang F, Chen J, Chen F*. Effect of carrier bandwidth on understanding mandarin sentences in simulated electric-acoustic hearing, in 22th Annual Conference of the International Speech Communication Association (INTERSPEECH). Brno, Czechia; 2021.

Xu Y, Li F, Chen Z, Liang J, Quan Y. Encoding Spatial Distribution of Convolutional Features for Texture Representation, in Advances in Neural Information Processing Systems (NeurIPS).; 2021.Abstract

With frame-based cameras, capturing fast-moving scenes without suffering from blur often comes at the cost of low SNR and low contrast. Worse still, the photometric constancy that enhancement techniques heavily relied on is fragile for frames with short exposure. Event cameras can record brightness changes at an extremely high temporal resolution. For low-light videos, event data are not only suitable to help capture temporal correspondences but also provide alternative observations in the form of intensity ratios between consecutive frames and exposure-invariant information. Motivated by this, we propose a low-light video enhancement method with hybrid inputs of events and frames. Specifically, a neural network is trained to establish spatiotemporal coherence between visual signals with different modalities and resolutions by constructing correlation volume across space and time. Experimental results on synthetic and real data demonstrate the superiority of the proposed method compared to the state-of-the-art methods.

Fu Z(PhD student), Wang B, F C, Wu X, Chen J *. Eye gaze estimation with HEOG and Neck EMG using deep neural networks, in 29th European Signal Processing Conference (EUSIPCO). Dublin, Ireland; 2021.

Wang C, Li S, Fan R, Sun J, Shao J, Xie G. Finite-time Circle Formation Control with Collision Avoidance, in 2021 China Automation Congress (CAC). IEEE; 2021:7104–7109. 访问链接

Yang E-Y, Jia T, Brooks D, Wei G-Y. FlexACC: A programmable accelerator with application-specific ISA for flexible deep neural network inference, in International Conference on Application-specific Systems, Architectures and Processors (ASAP).; 2021.

Li S, Wang C, Xie G. Formation control of multiple nonholonomic vehicles with local measurements in 3D space, in 2021 60th IEEE Conference on Decision and Control (CDC). IEEE; 2021:7112–7117. 访问链接

Zhao H, Shi Y, Tong X, Wen J, Ying X, Zha H. G-FAN: graph-based feature aggregation network for video face recognition, in International Conference on Pattern Recognition. IEEE; 2021:1672–1678.

Yan P, Schroeder R. Globalization and anti-globalization, media trust, and populism: A comparative study of the US and Germany., in The 71st Annual International Communication Association (ICA) Conference.; 2021.

Zhang C, Gao X, Ma L, Wang Y, Wang J, Tang W. GRASP: generic framework for health status representation learning based on incorporating knowledge from similar patients, in Proceedings of the AAAI conference on artificial intelligence.Vol 35.; 2021:715–723.

Xu J, Niu Y, Wu X, Qu T. Higher order ambisonics compression method based on independent component analysis, in Audio Engineering Society Convention 150.; 2021:10456.

Zheng Y+, Zheng L+, Yu Z*, Shi B, Tian YH, Huang T. High-Speed Image Reconstruction Through Short-Term Plasticity for Spiking Cameras, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).; 2021:6358-6367. PDF Abstract

Fovea, located in the centre of the retina, is specialized for high-acuity vision. Mimicking the sampling mechanism of the fovea, a retina-inspired camera, named spiking camera, is developed to record the external information with a sampling rate of 40,000 Hz, and outputs asynchronous binary spike streams. Although the temporal resolution of visual information is improved, how to reconstruct the scenes is still a challenging problem. In this paper, we present a novel high-speed image reconstruction model through the short-term plasticity (STP) mechanism of the brain. We derive the relationship between postsynaptic potential regulated by STP and the firing frequency of each pixel. By setting up the STP model at each pixel of the spiking camera, we can infer the scene radiance with the temporal regularity of the spike stream. Moreover, we show that STP can be used to distinguish the static and motion areas and further enhance the reconstruction results. The experimental results show that our methods achieve state-of-the-art performance in both image quality and computing time.

Pages