Mingxuan Song

About me

I am currently a PhD student at the School of Computer Science, Peking University. I received my bachelor's degree in computer science and technology from China University of Geosciences, Wuhan, in 2023. My research interests include reinforcement learning (RL), sharding blockchain, and large language models (LLMs). Feel free to contact me!

I enjoy playing billiards, badminton, table tennis, and also love running. My goal is to continuously grow both professionally and personally, while maintaining a healthy and fulfilling lifestyle.

📍 Location: Beijing, China

✉️ Email: songmingxuan@stu.pku.edu.cn

🎯 Hobbies: 🎱 Billiards | 🏸 Badminton | 🏓 Table Tennis | 🏃‍♂️ Running

View Resume

📚 Publications

WWW 2025 Oral CCF-A Mingxuan Song, Pengze Li, Bohan Zhou, Shenglin Yin, Zhen Xiao, Jieyi Long. "AERO: Enhancing Sharding Blockchain via Deep Reinforcement Learning for Account Migration." Proceedings of the Web Conference, May 2025. [PDF]

CVPR 2024 CCF-A Shenglin Yin, Zhen Xiao, Mingxuan Song, and Jieyi Long. "Adversarial Distillation Based on Slack Matching and Attribution Region Alignment." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 2024. [PDF]

WWW 2024 Oral CCF-A Pengze Li, Mingxuan Song, Mingzhe Xing, Zhen Xiao, Qiuyu Ding, Shengjie Guan, and Jieyi Long. "SPRING: Improving the Throughput of Sharding Blockchain via Deep Reinforcement Learning Based State Placement." In Proceedings of the Web Conference, May 2024. [PDF]

Sensors 2022 JCR Q1 Mingxuan Song, Chengyu Hu, Wenyin Gong, Xuesong Yan. "Domain Knowledge-Based Evolutionary Reinforcement Learning for Sensor Placement." Sensors 2022. [PDF]

📰 News

Joint Introductory Computing Tutorial Successfully Held by Ten Institutes, Peking University. [Report] December 23, 2024 [Preview] December 12, 2024

Outstanding Undergraduate Graduates of the Class of 2023, China University of Geosciences (Wuhan). [Link] June 19, 2023

Public List of Peking University 2023 Recommended Exempted Postgraduate Candidates, Peking University. [Link] November 1, 2022

The Second Professional Vertical Communication Forum of School of Computer Science, China University of Geosciences (Wuhan). [Link] October 3, 2022

2022 Summer Camp for Outstanding Students, School of Computer Science, Peking University. [Admission Result] September 30, 2022 [List of Outstanding Participants] July 8, 2022 [Admission Notice] June 28, 2022

The Second Professional Vertical Communication Forum of the School of Computer Science, School of Computer Science, China University of Geosciences (Wuhan). [Link] September 27, 2022

Class 111003 Senior Scholarship, School of Computer Science, China University of Geosciences (Wuhan). [Link] June 21, 2022

Award List of the 2021 National Undergraduate Mathematical Contest in Modeling (MCM Cup). [Link] November 15, 2021

National College Student Innovation Training Program Platform. [Link] August 24, 2021

Public List of National Scholarship Candidates for Undergraduates (2019-2020 Academic Year), China University of Geosciences (Wuhan). [Link] November 2, 2020

🌟 Projects

Few-Shot RL Fine-Tuning for LLMs

Affiliations:
School of Computer Science, Peking University;
Alimama, Alibaba Group.

In recent years, Large Language Models (LLMs) have demonstrated remarkable performance across a variety of natural language processing tasks. However, fine-tuning these models typically requires large-scale datasets and extensive computational resources, which limits their applicability in scenarios where data is scarce and budgets are constrained. This work explores a novel approach to few-shot reinforcement learning (RL) fine-tuning for LLMs, aiming to adapt pre-trained models to specific tasks using minimal supervision.

See Live Source Code

RL for Efficient Sharding Blockchain

Affiliations:
School of Computer Science, Peking University;
Theta Labs, Theta Inc.

Sharding blockchain systems face critical challenges in achieving efficient cross-shard data distribution and maintaining balanced workload across shards. Traditional address allocation methods often suffer from high latency and uneven shard utilization, especially when dealing with dynamically changing transaction patterns and reconfiguration events.

See Live Source Code

RL for Advanced NFT Auction System

Affiliations:
School of Computer Science, Peking University;
Theta Labs, Theta Inc.

In the Web 3.0 era, NFTs have become a popular asset type, and the auction system is a crucial component of the NFT ecosystem. However, traditional auction systems often suffer from low throughput and high latency, especially when dealing with high-demand auctions. To address these challenges, we propose a hierarchical reinforcement learning approach to optimize dynamic bidding strategies, enabling more efficient and responsive NFT auctions.

See Live Source Code

Hi, my name is Mingxuan Song
I'm currently pursuing my PhD at Peking University, under the supervision of Researcher Zhen Xiao.

About me

📚 Publications

📰 News

🌟 Projects

Few-Shot RL Fine-Tuning for LLMs

RL for Efficient Sharding Blockchain

RL for Advanced NFT Auction System

Contact

Hi, my name is Mingxuan Song I'm currently pursuing my PhD at Peking University, under the supervision of Researcher Zhen Xiao.

About me

📚 Publications

📰 News

🌟 Projects

Few-Shot RL Fine-Tuning for LLMs

RL for Efficient Sharding Blockchain

RL for Advanced NFT Auction System

Contact

Hi, my name is Mingxuan Song
I'm currently pursuing my PhD at Peking University, under the supervision of Researcher Zhen Xiao.