I am a final-year student from the Department of Computer Science and Technology, Beijing Institute of Technology (BIT, 北京理工大学计算机学院), under the supervision of Prof. Chi Harold Liu. I received my Bachelor’s degree in Software Engineering from BIT (2015-2019).
My research interest includes mobile crowdsensing and unmanned vehicles, with the consideration of 5G communication modeling and reinforcement learning techniques. I have published more than 10 papers at the top international conferences (e.g., KDD, ICDE, CoRL, INFOCOM) and journals (e.g., TKDE, TMC). I am also one of the Student Liaison of RLChina committee.
Contact: I’m always glad to discuss or collaborate! If interested, feel free to email me at daizipeng1997@gmail.com.
🔥 News
- 2025.06: 🎉🎉 Congratulations to Zihao for open-sourcing UavNetSim-v1 — a comprehensive, Python-based simulation platform for UAV networks. It models key layers including the network, MAC, and physical layers, along with UAV mobility and energy dynamics. The platform is highly extensible for beginners in the networking and communications community, empowering users to design and test custom protocols for a wide range of applications.
- 2025.05: 🤠🤠 My google scholar citations have exceeded 500!
📝 Publications
🙋♂️ Mobile Crowdsensing

AoI-minimal UAV Crowdsensing by Model-based Graph Convolutional Reinforcement Learning
Zipeng Dai, Chi Harold Liu, Yuxiao Ye, Rui Han, Ye Yuan, Guoren Wang, Jian Tang
INFOCOM 2022 (Oral)
Project |
IEEE JSAC
QoI-Aware Mobile Crowdsensing for Metaverse by Multi-Agent Deep Reinforcement Learning, Yuxiao Ye, Hao Wang, Chi Harold Liu, Zipeng Dai, Guozheng Li, Guoren Wang, Jian TangICDE 2023
(Oral) Exploring both Individuality and Cooperation for Air-Ground Spatial Crowdsourcing by Multi-Agent Deep Reinforcement Learning, Yuxiao Ye, Chi Harold Liu, Zipeng Dai, Jianxin Zhao, Ye Yuan, Guoren Wang, Jian TangIEEE TMC
Delay-Sensitive Energy-Efficient UAV Crowdsensing by Deep Reinforcement Learning, Zipeng Dai, Chi Harold Liu, Rui Han, Guoren Wang, Kin K. Leung, Jian TangKDD 2021
(best paper award runner up) Energy-Efficient 3D Vehicular Crowdsourcing for Disaster Response by Distributed Deep Reinforcement Learning, Hao Wang, Chi Harold Liu, Zipeng Dai, Jian Tang, Guoren Wang (Applied Data Science Track)INFOCOM 2021
(Oral) Mobile Crowdsensing for Data Freshness: A Deep Reinforcement Learning Approach, Zipeng Dai, Hao Wang, Chi Harold Liu, Rui Han, Jian Tang, Guoren WangICDE 2020
(Oral) Curiosity-Driven Energy-Efficient Worker Scheduling in Vehicular Crowdsourcing: A Deep Reinforcement Learning Approach, Chi Harold Liu, Yinuo Zhao, Zipeng Dai, Ye Yuan, Guoren Wang, Dapeng Wu, Kin K. LeungINFOCOM 2020
(Oral) Multi-Task-Oriented Vehicular Crowdsensing: A Deep Learning Approach, Chi Harold Liu, Zipeng Dai, Haoming Yang, Jian TangIEEE TMC
Distributed and Energy-Efficient Mobile Crowdsensing with Charging Stations by Deep Reinforcement Learning, Chi Harold Liu, Zipeng Dai, Yinuo Zhao, Jon Crowcroft, Dapeng Wu, Kin K. Leung
🚖 Unmanned Vehicles
ICCC 2025
UavNetSim-v1: A Python-based Simulation Platform for UAV Communication Networks, Zihao Zhou, Zipeng Dai, Linyi Huang, Cui Yang, Youjun Xiang, Jie Tang, Kai-kit WongCoRL 2022
Socially-Attentive Policy Optimization in Multi-Agent Self-Driving System, Zipeng Dai, Tianze Zhou, Kun Shao, David Henry Mguni, Bin Wang, Jianye Hao
🎟 Others
Neurocomputing
Generalizable Agent Modeling for Agent Collaboration-Competition Adaptation with Multi-Retrieval and Dynamic Generation, Chenxu Wang, Yonggang Jin, Cheng Hu, Youpeng Zhao, Zipeng Dai, Jian Zhao, Liuyu Xiang, Junge Zhang, Zhaofeng HeAAMAS 2025
Taming multi-agent reinforcement learning with estimator variance reduction, Taher Jafferjee, Juliusz Ziomek, Tianpei Yang, Zipeng Dai, Jianhong Wang, Matthew E. Taylor, Kun Shao, Jun Wang, David MguniIEEE Transactions on Games
CuDA2: An Approach for Incorporating Traitor Agents into Cooperative Multi-Agent Systems, Zhen Chen, Yong Liao, Youpeng Zhao, Zipeng Dai, Jian ZhaoIEEE Transactions on Computers
HiBid: A Cross-Channel Constrained Bidding System with Budget Allocation by Hierarchical Offline Deep Reinforcement Learning, Hao Wang, Bo Tang, Chi Harold Liu, Shangqin Mao, Jiahong Zhou, Zipeng Dai, Yaqi Sun, Qianlong Xie, Xingxing Wang, Dong WangIEEE Transactions on Games
Cooperative Multi-Agent Transfer Learning with Coalition Pattern Decomposition, Tianze Zhou, Fubiao Zhang, Kun Shao, Zipeng Dai, Kai Li, Wenhan Huang, Weixun Wang, Bin Wang, Dong Li, Wulong Liu, Jianye HaoICLR 2023
Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints, David Mguni, Aivar Sootla, Juliusz Krzysztof Ziomek, Oliver Slumbers, Zipeng Dai, Kun Shao, Jun WangAAAI 2023
Learning to Shape Rewards using a Game of Two Partners, David Mguni, Taher Jafferjee, Jianhong Wang, Nicolas Perez-Nieves, Wenbin Song, Feifei Tong, Matthew Taylor, Tianpei Yang, Zipeng Dai, Hui Chen, Jiangcheng Zhu, Kun Shao, Jun Wang, Yaodong YangIEEE TKDE
Time-Aware Location Prediction by Convolutional Area-of-Interest Modeling and Memory-Augmented Attentive LSTM, Chi Harold Liu, Yu Wang, Chengzhe Piao, Zipeng Dai, Ye Yuan, Guoren Wang, Dapeng Oliver Wu
🎖 Honors and Awards
- 2024.05 The 2nd Prize of the AgileX Sim2Real Challenge in ICRA 2024
- 2023.12 The 7th Rank of the First Low-Altitude Economy Intelligent Flight Management Challenge in Meituan, ShenZhen
- 2022.10 National Scholarship (Top 1%)
- 2022.05 Student Conference Grant of INFOCOM 2022 (50 worldwide each year)
- 2021.08 Best Paper Award, Applied Data Science Runner Up of KDD 2021 (1/238)
- 2020.10 National Scholarship (Top 1%)
- 2019.09 Outstanding Thesis Award (Undergraduate) in Beijing (only 3 person/college)
- 2017.08 The 3rd Place of the Supermarket Robot track (@Home) in RoboCup China 2017
- 2016.12 Future Star of Software Engineering, Beijing Institute of Technology (10/176)
📖 Educations
-
2019.06 - Now, PhD, Beijing Institute of Techonolgy, Beijing.
-
2015.09 - 2019.06, Undergraduate, Beijing Institute of Techonolgy, Beijing.
-
2012.09 - 2015.06, Yaohua High School, Tianjin.
💬 Invited Talks
- 2023.01.17, 基于社会价值取向的无人车交互决策方法, RLChina论文研讨会(第37期), Online.
- 2022.12.17, Socially-Attentive Policy Optimization in Multi-Agent Self-Driving System, DAI 2022, Tianjin.
- 2022.07.26, Zero-Shot Coordination: A Survey about Human-AI Collaboration, Huawei Noah’s Ark Lab internal discussion, Beijing.
💻 Internships
- 2024.04 - 2025.05 Polixir Technologies, Nanjing. (Mentor: Rongjun Qin)
- 2023.04 - 2024.04, Qiyuan Lab, Beijing. (Mentor: Chao Wang)
- 2021.10 - 2023.03, Huawei Noah’s Ark Lab, Beijing. (Mentor: Kun Shao)
🤝 Partner Links
- Polixir Technologies (During a period when I felt lost and helpless, I am deeply grateful to the students in Professor Yang Yu’s team for providing me with immense emotional support and inspiring my direction in life. I am also willing to share my past research experience and project resources in the field of UAV networks with them.)
- Chengzhe Piao (He was the most impressive senior colleague I met during my graduate studies. Before he went to UCL for a PhD in interdisciplinary medical studies, he had already demonstrated exceptional research intuition in AI for science and generative models.)
- Hao Wang (He is the only genius in the history of the school to have received the highest honor scholarship during both his undergraduate and graduate studies.)
- Yuxiao Ye (He was interviewed by a well-known blogger for a project that won a grand prize.)
Design and source code inspired from Yi Ren’s awesome template.