I am a stupid PhD from the Department of Computer Science and Technology, Beijing Institute of Technology (BIT, 北京理工大学计算机学院), under the supervision of Prof. Chi Harold Liu. I received both of my Bachelor’s degree and Docter’s degree from BIT (Software Engineering: 2015-2019 / Computer Science and Technology: 2019-2025).

My research interest includes mobile crowdsensing and unmanned vehicles, with the consideration of 5G communication modeling and reinforcement learning techniques. I have published more than 10 papers at the top international conferences (e.g., KDD, ICDE, CoRL, INFOCOM) and journals (e.g., TKDE, TMC). I am also one of the Student Liaison of RLChina committee.

Contact: I’m always glad to discuss or collaborate! If interested, feel free to email me at daizipeng1997@gmail.com.

🔥 News

2025.06: 🎉🎉 Congratulations to Zihao for open-sourcing UavNetSim-v1 — a comprehensive, Python-based simulation platform for UAV networks. It models key layers including the network, MAC, and physical layers, along with UAV mobility and energy dynamics. The platform is highly extensible for beginners in the networking and communications community, empowering users to design and test custom protocols for a wide range of applications.
2025.05: 🤠🤠 My google scholar citations have exceeded 500!

📝 Publications

🙋‍♂️ Mobile Crowdsensing

INFOCOM 2022

AoI-minimal UAV Crowdsensing by Model-based Graph Convolutional Reinforcement Learning
Zipeng Dai, Chi Harold Liu, Yuxiao Ye, Rui Han, Ye Yuan, Guoren Wang, Jian Tang INFOCOM 2022 (Oral)

Project |

IEEE JSAC QoI-Aware Mobile Crowdsensing for Metaverse by Multi-Agent Deep Reinforcement Learning, Yuxiao Ye, Hao Wang, Chi Harold Liu, Zipeng Dai, Guozheng Li, Guoren Wang, Jian Tang
ICDE 2023 (Oral) Exploring both Individuality and Cooperation for Air-Ground Spatial Crowdsourcing by Multi-Agent Deep Reinforcement Learning, Yuxiao Ye, Chi Harold Liu, Zipeng Dai, Jianxin Zhao, Ye Yuan, Guoren Wang, Jian Tang
IEEE TMC Delay-Sensitive Energy-Efficient UAV Crowdsensing by Deep Reinforcement Learning, Zipeng Dai, Chi Harold Liu, Rui Han, Guoren Wang, Kin K. Leung, Jian Tang
KDD 2021 (best paper award runner up) Energy-Efficient 3D Vehicular Crowdsourcing for Disaster Response by Distributed Deep Reinforcement Learning, Hao Wang, Chi Harold Liu, Zipeng Dai, Jian Tang, Guoren Wang (Applied Data Science Track)
INFOCOM 2021 (Oral) Mobile Crowdsensing for Data Freshness: A Deep Reinforcement Learning Approach, Zipeng Dai, Hao Wang, Chi Harold Liu, Rui Han, Jian Tang, Guoren Wang
ICDE 2020 (Oral) Curiosity-Driven Energy-Efficient Worker Scheduling in Vehicular Crowdsourcing: A Deep Reinforcement Learning Approach, Chi Harold Liu, Yinuo Zhao, Zipeng Dai, Ye Yuan, Guoren Wang, Dapeng Wu, Kin K. Leung
INFOCOM 2020 (Oral) Multi-Task-Oriented Vehicular Crowdsensing: A Deep Learning Approach, Chi Harold Liu, Zipeng Dai, Haoming Yang, Jian Tang
IEEE TMC Distributed and Energy-Efficient Mobile Crowdsensing with Charging Stations by Deep Reinforcement Learning, Chi Harold Liu, Zipeng Dai, Yinuo Zhao, Jon Crowcroft, Dapeng Wu, Kin K. Leung

🚖 Unmanned Vehicles

ICCC 2025 UavNetSim-v1: A Python-based Simulation Platform for UAV Communication Networks, Zihao Zhou, Zipeng Dai, Linyi Huang, Cui Yang, Youjun Xiang, Jie Tang, Kai-kit Wong
CoRL 2022 Socially-Attentive Policy Optimization in Multi-Agent Self-Driving System, Zipeng Dai, Tianze Zhou, Kun Shao, David Henry Mguni, Bin Wang, Jianye Hao

🎟 Others

Neurocomputing Generalizable Agent Modeling for Agent Collaboration-Competition Adaptation with Multi-Retrieval and Dynamic Generation, Chenxu Wang, Yonggang Jin, Cheng Hu, Youpeng Zhao, Zipeng Dai, Jian Zhao, Liuyu Xiang, Junge Zhang, Zhaofeng He
AAMAS 2025 Taming multi-agent reinforcement learning with estimator variance reduction, Taher Jafferjee, Juliusz Ziomek, Tianpei Yang, Zipeng Dai, Jianhong Wang, Matthew E. Taylor, Kun Shao, Jun Wang, David Mguni
IEEE Transactions on Games CuDA2: An Approach for Incorporating Traitor Agents into Cooperative Multi-Agent Systems, Zhen Chen, Yong Liao, Youpeng Zhao, Zipeng Dai, Jian Zhao
IEEE Transactions on Computers HiBid: A Cross-Channel Constrained Bidding System with Budget Allocation by Hierarchical Offline Deep Reinforcement Learning, Hao Wang, Bo Tang, Chi Harold Liu, Shangqin Mao, Jiahong Zhou, Zipeng Dai, Yaqi Sun, Qianlong Xie, Xingxing Wang, Dong Wang
IEEE Transactions on Games Cooperative Multi-Agent Transfer Learning with Coalition Pattern Decomposition, Tianze Zhou, Fubiao Zhang, Kun Shao, Zipeng Dai, Kai Li, Wenhan Huang, Weixun Wang, Bin Wang, Dong Li, Wulong Liu, Jianye Hao
ICLR 2023 Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints, David Mguni, Aivar Sootla, Juliusz Krzysztof Ziomek, Oliver Slumbers, Zipeng Dai, Kun Shao, Jun Wang
AAAI 2023 Learning to Shape Rewards using a Game of Two Partners, David Mguni, Taher Jafferjee, Jianhong Wang, Nicolas Perez-Nieves, Wenbin Song, Feifei Tong, Matthew Taylor, Tianpei Yang, Zipeng Dai, Hui Chen, Jiangcheng Zhu, Kun Shao, Jun Wang, Yaodong Yang
IEEE TKDE Time-Aware Location Prediction by Convolutional Area-of-Interest Modeling and Memory-Augmented Attentive LSTM, Chi Harold Liu, Yu Wang, Chengzhe Piao, Zipeng Dai, Ye Yuan, Guoren Wang, Dapeng Oliver Wu

🎖 Honors and Awards

2024.05 The 2nd Prize of the AgileX Sim2Real Challenge in ICRA 2024
2023.12 The 7th Rank of the First Low-Altitude Economy Intelligent Flight Management Challenge in Meituan, ShenZhen
2022.10 National Scholarship (Top 1%)
2022.05 Student Conference Grant of INFOCOM 2022 (50 worldwide each year)
2021.08 Best Paper Award, Applied Data Science Runner Up of KDD 2021 (1/238)
2020.10 National Scholarship (Top 1%)
2019.09 Outstanding Thesis Award (Undergraduate) in Beijing (only 3 person/college)
2017.08 The 3rd Place of the Supermarket Robot track (@Home) in RoboCup China 2017
2016.12 Future Star of Software Engineering, Beijing Institute of Technology (10/176)

📖 Educations

2019.06 - 2025.09, PhD, Beijing Institute of Techonolgy, Beijing.
2015.09 - 2019.06, Undergraduate, Beijing Institute of Techonolgy, Beijing.
2012.09 - 2015.06, Yaohua High School, Tianjin.

💬 Invited Talks

2023.01.17, 基于社会价值取向的无人车交互决策方法, RLChina论文研讨会（第37期）, Online.
2022.12.17, Socially-Attentive Policy Optimization in Multi-Agent Self-Driving System, DAI 2022, Tianjin.
2022.07.26, Zero-Shot Coordination: A Survey about Human-AI Collaboration, Huawei Noah’s Ark Lab internal discussion, Beijing.

💻 Internships

2025.07 - 2025.09 COWAROBOT, Shanghai. (Mentor: Pai Peng)
2024.04 - 2025.05 Polixir Technologies, Nanjing. (Mentor: Rongjun Qin)
2023.04 - 2024.04, Qiyuan Lab, Beijing. (Mentor: Chao Wang)
2021.10 - 2023.03, Huawei Noah’s Ark Lab, Beijing. (Mentor: Kun Shao)

🤝 Partner Links

Polixir Technologies (During a period when I felt lost and helpless, I am deeply grateful to the students in Professor Yang Yu’s team for providing me with immense emotional support and inspiring my direction in life. I am also willing to share my past research experience and project resources in the field of UAV networks with them.)
Chengzhe Piao (He was the most impressive senior colleague I met during my graduate studies. Before he went to UCL for a PhD in interdisciplinary medical studies, he had already demonstrated exceptional research intuition in AI for science and generative models.)
Hao Wang (He is the only genius in the history of the school to have received the highest honor scholarship during both his undergraduate and graduate studies.)
Yuxiao Ye (He was interviewed by a well-known blogger for a project that won a grand prize.)

Design and source code inspired from Yi Ren’s awesome template.

Zipeng Dai (戴子彭)