科研成果

2025

You Y, Qian Y, Qu T, Wang B, Lv X. Spherical harmonic beamforming basedAmbisonics encoding and upscaling method for smartphonemicrophone array, in the AES 158th Convention. Warsaw, Poland; 2025:10230.Abstract

With the rapid development of virtual reality (VR) and augmented reality (AR), spatial audio recording and reproductionhave gained increasing research interest. Higher Order Ambisonics (HOA) stands out for its adaptabilityto various playback devices and its ability to integrate head orientation. However, current HOA recordings oftenrely on bulky spherical microphone arrays (SMA), and portable devices like smartphones are limited by arrayconfiguration and number of microphones. We propose SHB-AE, a spherical harmonic beamforming based methodfor Ambisonics encoding using a smartphone microphone array (SPMA). By designing beamformers for eachorder of spherical harmonic functions based on the array manifold, the method enables Ambisonics encoding andup-scaling. Validation on a real SPMA and its simulated free-field counterpart in noisy and reverberant conditionsshowed that the method successfully encodes and up-scales Ambisonics up to the fourth order with just fourirregularly arranged microphones.

Tang E, Gao M, You W. Structural transformation and the urban growth shadows: County-level evidence from China, 1990–2020. Regional Science and Urban Economics [Internet]. 2025;115:104141. 全文链接 DOI: 10.1016/j.regsciurbeco.2025.104141 Abstract

This paper investigates whether a location's growth benefits or suffers from proximity to a big city and explores the underlying mechanisms. Using county-level data from China for 1990–2020, we find that an area's being close to a big city (in the 150–250 km range) reduces its decadal population growth rate by 2.9–3.6 percentage points relative to areas beyond 250 km, which we call the urban growth shadow effect. Initial agricultural employment share has the strongest power to explain whether the negative effect exists. The mechanism is consistent with lower opportunity costs of migration for people employed in agriculture, yet contrasts with core–periphery models that give transport costs a central role. Notably, this effect exhibits a temporal trend. Over time, being proximate to a big city becomes increasingly beneficial.

Zhang; BX;H. A study of the effect of multimodal input on vocabulary acquisition: Evidence from online Chinese language learners. Language Teaching Research [Internet]. 2025. 访问链接 Abstract

In response to the growing prevalence of online second language learning and the burgeoning field of international Chinese language education, this study examines the impact of multimodal inputs (MMI) on vocabulary acquisition within online environments among learners of Chinese as a second language (CSL). A teaching intervention was conducted with 90 Mongolian CSL learners, who were grouped into audiovisual, audio, and visual groups. The findings indicate that the audiovisual condition significantly improved vocabulary retention compared to the single-modality conditions in a delayed post-test. Nevertheless, the efficacy of the MMI treatment was observed to vary with learners’ proficiency levels, with beginner-level CSL learners deriving greater benefit from MMI than intermediate-level learners. Furthermore, participants expressed both favorable and critical perspectives regarding the application of MMI in vocabulary instruction. These results highlight the potential of MMI interventions to enhance vocabulary learning in online second-language education, while also underscoring the necessity of considering learners’ target language proficiency and their attitudes when developing MMI-based instructional approaches.

Gong L, Tang Z, Guo H, Tang R, Qu B, Yu W, Chen Z, Xiao L. Sub-Second Long Lifetime Triplet Exciton Reservoir as Assistant Host for Highly Efficient and Stable Organic Light-Emitting Diode. ADVANCED FUNCTIONAL MATERIALS. 2025.

Huang X, Peng W, Zhao A, Ou Y, Kennedy S, Iyer G, McJeon H, Cui R, Hultman N. Substantial air quality and health co-benefits from combined federal and subnational climate actions in the United States. One Earth [Internet]. 2025;8(3). [Link]

Wang Y, Jiang T, Yan W. Suddenly enlightened: awe promotes wise reasoning via self-transcendence. The Journal of Positive Psychology [Internet]. 2025. 访问链接 Abstract

Awe, a self-transcendent emotion, has been theoretically posited as a precursor to wise reasoning. However, direct empirical evidence supporting this relationship and the underlying mechanism has been limited. In four studies (N = 3700), we examined the relationship between awe and wise reasoning, as well as the mediating effect of self-transcendence, employing cross-sectional, longitudinal, and experimental designs. We consistently found that awe had a lagged effect on (Study 1), enhanced (Studies 2 & 3), and was associated with (Study 4) wise reasoning. Furthermore, self-transcendence mediated this relationship (Studies 3 & 4). The impact of awe on wise reasoning and mediating effect of self-transcendence could not solely be attributed to awe’s predominantly positive nature, and the mediation model was established beyond the influence of self-smallness (Studies 3–4). These findings contribute to understanding the emotional trigger of wise reasoning, the cognitive implications of awe, and its role in promoting wise conflict resolution.

Chen, A. ZJLCLYJM. A systematic review and meta-analysis of AI-enabled assessment in language learning: Design, implementation, and effectiveness. Journal of Computer Assisted Learning [Internet]. 2025;41(1):e13064. 访问链接

Hu F, Truong TT, Xie J. Tate's question, Standard conjecture D, semisimplicity and Dynamical degree comparison conjecture. 2025.

You Y, Wu X, Qu T. TA-V2A: Textually Assisted Video-to-Audio Generation, in International Conference on Acoustics, Speech and Signal Processing (ICASSP). Hyderabad, India; 2025:1-5.Abstract

As artificial intelligence-generated content (AIGC) continues to evolve, video-to-audio (V2A) generation has emerged as a key area with promising applications in multimedia editing, augmented reality, and automated content creation. While Transformer and Diffusion models have advanced audio generation, a significant challenge persists in extracting precise semantic information from videos, as current models often lose sequential context by relying solely on frame-based features. To address this, we present TA-V2A, a method that integrates language, audio, and video features to improve semantic representation in latent space. By incorporating large language models for enhanced video comprehension, our approach leverages text guidance to enrich semantic expression. Our diffusion model-based system utilizes automated text modulation to enhance inference quality and efficiency, providing personalized control through text-guided interfaces. This integration enhances semantic expression while ensuring temporal alignment, leading to more accurate and coherent video-to-audio generation.

Huang Z, Yang Y, Sheng D, Li H, Wang Y, Sun Z, Li M, WANG R, HUANG R, Cheng Z. Thermal Conductivity of Cubic Silicon Carbide Single Crystals Heavily Doped by Nitrogen. arXiv preprint arXiv:2409.18843. 2025.

Liu Z, Qiao L, Chu X, Ma L, Jiang T. Towards Efficient Foundation Model for Zero-shot Amodal Segmentation, in IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025, June 11-15. Nashville, TN, USA: Computer Vision Foundation / IEEE; 2025:20254–20264. 访问链接

and Author) YZ (FC. Transforming traditions into academic resources: Astudy of Chinese scholars in the humanities and social sciences Shen Y. Higher Education [Internet]. 2025;89:1619-1635. 访问链接 Abstract

The asymmetrical global higher education and knowledge systems ordered by Euro–American hegemony have been increasingly interrogated, especially by scholars in the humanities and social sciences (HSS). With gathering awareness, growing HSS scholars from non-Western backgrounds have called for global intellectual pluriversality. Responding to such a trend, this article sheds new light on the status quo of East Asian and other non-Euro–American intellectual traditions by taking Chinese intellectual traditions as a case. Since the nineteenth century, generations of Chinese intellectuals have strived to transform their intellectual traditions into modern resources. This historical mission has been carried on by contemporary scholars and become even more complex in the current global era. By unpacking the real perceptions and recent experiences of Chinese HSS scholars, this study demonstrates that Chinese intellectual traditions deeply influence today’s knowledge production and have been transformed into three kinds of academic resources: approaches, methodologies/paradigms, and theories. However, the transformation process has never been smooth. Domestically, the great endeavours of Chinese HSS scholars are often impeded by the dominant intellectual extraversion and coercive audit culture; internationally, they feel constrained by epistemic injustice. This article proposes an empirical approach to examining and presenting intellectual traditions in the individual experiences of scholars. It reveals the high complexities of navigating through asymmetrical globalisation to achieve intellectual pluriversality.

Long G, Zeng H, Pan M, Duan W, Huang H. Two-terminal Electrical Detection of the Néel Vector via Longitudinal Antiferromagnetic Nonreciprocal Transport. Nano Lett. [Internet]. 2025. 访问链接 Abstract

https://arxiv.org/abs/2505.15016

Yang W, Huang* H. Unified Multipole Bott Indices for Non-Hermitian Skin Effect in Different Orders. Phys. Rev. B [Internet]. 2025;111:155121. 访问链接

Gao M, Wei Z, Xiang H. A Unified Theory of China's Three-Pillar Pension System. 《中国社会科学》（英文版）Social Sciences in China [Internet]. 2025;46(2):80-98. 全文链接 DOI: 10.1080/02529203.2025.2555765 Abstract

This paper develops a unified theory integrating the three pillars of the pension system—public, occupational, and private pensions—within a heterogeneous-agent overlapping generations (OLG) model. By incorporating income heterogeneity and institutional features unique to each pillar, the model captures how individuals across the income distribution participate in the pension system and derive utility. We characterize the distinct yet interactive roles of each pillar in providing risk sharing and retirement security and identify fundamental trade-offs in pension design. Our model provides a laboratory for analyzing the coordination of the three pillars that aims at enhancing equity and fiscal sustainability.

Zhang T, ZHONG Y*, YANG Y, WANG Z, ZHANG Z, WANG Y*. UniPRE: An SNN-ANN Accelerator with Unified Max-pooling Prediction and Redundancy Elimination. IEEE Transactions on Circuits and Systems II: Brief Paper [Internet]. 2025;72(8):1088-1092. 访问链接

Chen, AX; Xiang ZJSLGFMJJ. Unpacking help-seeking process through multimodal learning analytics: A comparative study of ChatGPT vs Human expert. Computers & Education [Internet]. 2025;(226). 访问链接

Yan W, Zhang X, Wang Y, Peng K, Ma Y. Unraveling the relationship between teachers’ and students’ mental health: A one-to-one matched analysis. The Journal of Experimental Education [Internet]. 2025;93(1):136-148. 访问链接 Abstract

This study aims to identify the associations between teacher mental health and student mental health. Cross-sectional data were collected from 127,877 students aged 9–20 years and 2,759 teachers across 31 provinces in China. The mental health of students and teachers were assessed by well-being (life satisfaction and positive mental health), and psychological distress (depression and anxiety). Controlling for demographic variables, multilevel regression analyses suggest that higher teacher positive mental health was linked to higher student positive mental health and lower student depression; higher teacher depression were correlated with higher student depression; and teacher life satisfaction and anxiety were not correlated with any indicators of student mental health. The study highlights the significant association between teacher mental health and student mental health.

Dang Q, Li G. Unveiling trust in AI: the interplay of antecedents, consequences, and cultural dynamics. AI & SOCIETY [Internet]. 2025:1-24. 访问链接 Abstract

Trust in artificial intelligence (AI) has become a central issue due to the opacity and unpredictability of AI decision-making processes. However, existing studies often produce inconsistent results and fail to provide a unified understanding of the underlying factors, making a comprehensive review necessary. To address this gap, we conducted a systematic review of 562 empirical studies to explore the antecedents and consequences of human trust in AI. The review identified key antecedents of trust, including AI capability, anthropomorphism, individual factors, and explainability, and associated consequences, such as behavioral intention, attitude, and acceptance. A cross-cultural analysis revealed significant differences in how cultural contexts influence the perception and prioritization of factors, such as capability, transparency, and anthropomorphism. These findings emphasize the need for a multidimensional approach to developing trustworthy AI systems, highlighting the importance of cultural sensitivity in AI design. The review also suggests several promising avenues for future research, including dynamic trust formation, reciprocal trust relationships, and the evolution of trust over time. Addressing these areas will enhance our understanding of trust in AI and contribute to the development of culturally adapted and ethically sound AI technologies.

Li P, Zhu R, McJeon H, Byers E, Zhou P, Ou Y. Using deep learning to generate key variables in global mitigation scenarios. Nature Climate Change [Internet]. 2025;15:760–768. [Link]

Pages