I’m a Ph.D. candidate in Department of Civil and Environmental Engineering and Center for Scientific Computation at The Hong Kong University of Science and Technology, under the supervision of Prof. Sen Li. My current research primarily focuses on Deep Reinforcement Learning, Intelligent Transportation, Smart City, and AI Ethics.
I received my B.Eng. degree from School of Computer Science and Engineering at Sun Yat-sen University in 2024, mentored by Prof. Kai Huang (Robot RL), Prof. Chengying Gao and Prof. Ning Liu (Music AI). I was also a visiting student in Shenzhen Research Institute of Big Data at The Chinese University of Hong Kong (Shenzhen) from 2023 to 2024, under the guidance of Prof. Guangxu Zhu (Deep Wireless Sensing).
Aside from academics, I’m also the vocalist, guitarist, and bassist for several metal and rock bands, including Tokamak Disruption, NEWS (personal band) and Rights of Lethe (dissolved). If you are interested in collaborating with me (whether in academia or music), please feel free to reach out!
Ph.D. in Civil Engineering (Scientific Computation), 2024~
The Hong Kong University of Science and Technology (Clearwater Bay Campus, Hong Kong)
B.Eng. in Computer Science and Technology (National Basic Subject Talent Training Plan), 2020~2024
Sun Yat-sen University (Guangzhou Campus)
Code: Python, C/C++ (CCF-CSP:320, Top 0.8%), Java, Matlab, SQL Music: Guitar, Keyboard, Bass, Ukulele
Teaching Assistant: HKUST CIVL 4640 Introduction to Smart City Economics (Undergraduate, Spring 2026)Interview: HKUST JUPAS 2025
Society Membership: IEEE Student Member, ACM Student Member, AAAI Student Membership,CCF Student MemberTPC Membership: IEEE WCNC Workshop 2024-2026, IEEE PIMRC Workshop 2024-2025, IEEE GLOBECOM Workshop 2025, IEEE/CIC ICCC Workshop 2025Technical Reviewer: IEEE TPAMI, IEEE TMC, IEEE IOTJ, IEEE WCL, MTAP, Bentham Science Book, IEEE ICASSP 2024-2026, IEEE ICME 2024-2026, IEEE WCNC 2024-2026, ICLR 2025-2026, ACL ARR 2025-2026, IEEE IJCNN 2025-2026, ICML 2026, AAAI 2026, IEEE PIMRC 2024-2025, HKSTS 2024-2025, IEEE GLOBECOM 2025, IEEE/CIC ICCC 2025, IEEE AVSS 2025, IEEE MLSP 2025, IET IRC 2025, BTR 2025, IEEE SMC 2023

Stage lighting plays an essential role in live music performances, influencing the engaging experience of both musicians and audiences. Given the high costs associated with hiring or training professional lighting engineers, Automatic Stage Lighting Control (ASLC) has gained increasing attention. However, most existing approaches only classify music into limited categories and map them to predefined light patterns, resulting in formulaic and monotonous outcomes that lack rationality. To address this issue, this paper presents an end-to-end solution that directly learns from experienced lighting engineers – Skip-BART. To the best of our knowledge, this is the first work to conceptualize ASLC as a generative task rather than merely a classification problem. Our method modifies the BART model to take audio music as input and produce light hue and value (intensity) as output, incorporating a novel skip connection mechanism to enhance the relationship between music and light within the frame grid. We validate our method through both quantitative analysis and an human evaluation, demonstrating that Skip-BART outperforms conventional rule-based methods across all evaluation metrics and shows only a limited gap compared to real lighting engineers. Specifically, our method yields a p-value of 0.72 in a statistical comparison based on human evaluations with human lighting engineers, suggesting that the proposed approach closely matches human lighting engineering performance. To support further research, we have made our self-collected dataset, code, and trained model parameters available at https://github.com/RS2002/Skip-BART.

On-demand ride-sharing platforms, such as Uber and Lyft, face the intricate real-time challenge of bundling and matching passengers—each with distinct origins and destinations—to available vehicles, all while navigating significant system uncertainties. Due to the extensive observation space arising from the large number of drivers and orders, order dispatching, though fundamentally a centralized task, is often addressed using Multi-Agent Reinforcement Learning (MARL). However, independent MARL methods fail to capture global information and exhibit poor cooperation among workers, while Centralized Training Decentralized Execution (CTDE) MARL methods suffer from the curse of dimensionality. To overcome these challenges, we propose Triple-BERT, a centralized Single Agent Reinforcement Learning (MARL) method designed specifically for large-scale order dispatching on ride-sharing platforms. Built on a variant TD3, our approach addresses the vast action space through an action decomposition strategy that breaks down the joint action probability into individual driver action probabilities. To handle the extensive observation space, we introduce a novel BERT-based network, where parameter reuse mitigates parameter growth as the number of drivers and orders increases, and the attention mechanism effectively captures the complex relationships among the large pool of driver and orders. We validate our method using a real-world ride-hailing dataset from Manhattan. Triple-BERT achieves approximately an 11.95% improvement over current state-of-the-art methods, with a 4.26% increase in served orders and a 22.25% reduction in pickup times. Our code, trained model parameters, and processed data are publicly available at the repository https://github.com/RS2002/Triple-BERT .

Channel State Information (CSI) is the cornerstone in both wireless communication and sensing systems. In wireless communication systems, CSI provides essential insights into channel conditions, enabling system optimizations like channel compensation and dynamic resource allocation. However, the high computational complexity of CSI estimation algorithms necessitates the development of fast deep learning methods for CSI prediction. In wireless sensing systems, CSI can be leveraged to infer environmental changes, facilitating various functions, including gesture recognition and people identification. Deep learning methods have demonstrated significant advantages over model-based approaches in these fine-grained CSI classification tasks, particularly when classes vary across different scenarios. However, a major challenge in training deep learning networks for wireless systems is the limited availability of data, further complicated by the diverse formats of many public datasets, which hinder integration. Additionally, collecting CSI data can be resource-intensive, requiring considerable time and manpower. To address these challenges, we propose CSI-BERT2 for CSI prediction and classification tasks, effectively utilizing limited data through a pre-training and fine-tuning approach. Building on CSI-BERT1, we enhance the model architecture by introducing an Adaptive Re-Weighting Layer (ARL) and a Multi-Layer Perceptron (MLP) to better capture sub-carrier and timestamp information, effectively addressing the permutation-invariance problem. Furthermore, we propose a Mask Prediction Model (MPM) fine-tuning method to improve the model’s adaptability for CSI prediction tasks. Experimental results demonstrate that CSI-BERT2 achieves state-of-the-art performance across all tasks with relatively fast computation speeds. To facilitate future research, we will make our code and dataset publicly available upon publication. The dataset and code are publicly available at https://github.com/RS2002/CSI-BERT2.