π About Me
Hi! I am a second year PhD student studying at Tsinghua University starting from Fall 2024, majoring in Computer Science and Technology. I am a member of THUNLP, advised by Prof. Zhiyuan Liu. I received my bachelorβs degree with honors from Tsinghua University in June 2024. My research interests primarily lie in the field of natural language processing, with a particular focus on alignment of LLMs.
π News
- 2025.09: Β π MiniCPM-V 4.5 released at arXiv [GitHub]
- 2025.09: Β π₯ Survey of RL for LRM released at arXiv [GitHub]
- 2025.07: Β π AIR accepted by COLM 2025
- 2025.06: Β π MiniCPM4 released at arXiv [GitHub]
- 2025.05: Β π EscapeBench accepted by ACL 2025 Main [GitHub]
- 2025.05: Β π One paper accepted by ACL 2025 Findings [GitHub]
- 2025.02: Β π₯ PRIME released at arXiv [GitHub]
- 2024.05: Β π One paper accepted by ACL 2024 Main [GitHub]
- 2024.05: Β π₯ UltraFeedback accepted by ICML 2024 Poster [GitHub]
- 2023.10: Β π One paper accepted by EMNLP 2023 Main [GitHub]
- 2022.09: Β π OpenBackdoor accepted by NeurIPS Datasets & Benchmarks 2022 (Spotlight) [GitHub]
π Publications
(* denotes equal/core contribution, β denotes project lead)
- MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe
MiniCPM-V Team
Preprint [GitHub 22k+ Stars] - A Survey of Reinforcement Learning for Large Reasoning Models
Kaiyan Zhang*β , Yuxin Zuo*β , Bingxiang He*, Youbang Sun*, Runze Liu*, Che Jiang*, Yuchen Fan*, Kai Tian*, Guoli Jia*, Pengfei Li*, Yu Fu*, Xingtai Lv*, Yuchen Zhang*, Sihang Zeng*, Shang Qu*, Haozhan Li*, Shijie Wang*, Yuru Wang*, Xinwei Long, Fangfu Liu, Xiang Xu, Jiaze Ma, Xuekai Zhu, Ermo Hua, Yihao Liu, Zonglin Li, Huayu Chen, Xiaoye Qu, Yafu Li, Weize Chen, Zhenzhao Yuan, Junqi Gao, Dong Li, Zhiyuan Ma, Ganqu Cui, Zhiyuan Liu, Biqing Qi, Ning Ding, Bowen Zhou
Preprint [GitHub 1.4k+ Stars] - AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset
Bingxiang He, Wenbin Zhang, Jiaxi Song, Cheng Qian, Zixuan Fu, Bowen Sun, Ning Ding, Haiwen Hong, Longtao Huang, Hui Xue, Ganqu Cui, Wanxiang Che, Zhiyuan Liu, Maosong Sun
COLM 2025 - MiniCPM4: Ultra-Efficient LLMs on End Devices
Preprint MiniCPM Team [GitHub 8.4k+ Stars] - Process Reinforcement through Implicit Rewards
Ganqu Cui*, Lifan Yuan*, Zefan Wang*, Hanbin Wang*, Wendi Li*, Bingxiang He*, Yuchen Fan*, Tianyu Yu*, Qixin Xu*, Weize Chen, Jiarui Yuan, Huayu Chen, Kaiyan Zhang, Xingtai Lv, Shuo Wang, Yuan Yao, Xu Han, Hao Peng, Yu Cheng, Zhiyuan Liu, Maosong Sun, Bowen Zhou, Ning Ding
ICLR 2026 Submission [GitHub 1.7k+ Stars] - EscapeBench: Pushing Language Models to Think Outside the Box
Cheng Qian, Peixuan Han, Qinyu Luo, Bingxiang He, Xiusi Chen, Yuji Zhang, Hongyi Du, Jiarui Yao, Xiaocheng Yang, Denghui Zhang, Yunzhu Li, Heng Ji
ACL 2025 Main [GitHub] - The Right Time Matters: Data Arrangement Affects Zero-Shot Generalization in Instruction Tuning
Bingxiang He*, Ning Ding*, Cheng Qian*, Jia Deng, Ganqu Cui, Lifan Yuan, Haiwen Hong, Huan-ang Gao, Longtao Huang, Hui Xue, Huimin Chen, Zhiyuan Liu, Maosong Sun
ACL 2025 Findings [GitHub] - Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents
Cheng Qian*, Bingxiang He*, Zhong Zhuang, Jia Deng, Yujia Qin, Xin Cong, Zhong Zhang, Jie Zhou, Yankai Lin, Zhiyuan Liu, Maosong Sun
ACL 2024 Main [GitHub] - UltraFeedback: Boosting Language Models with High-quality Feedback
Ganqu Cui*, Lifan Yuan*, Ning Ding, Guanming Yao, Bingxiang He, Wei Zhu, Yuan Ni, Guotong Xie, Ruobing Xie, Yankai Lin, Zhiyuan Liu, Maosong Sun
ICML 2024 Poster [GitHub] - Beat LLMs at Their Own Game: Zero-Shot LLM-Generated Text Detection via Querying ChatGPT
Biru Zhu, Lifan Yuan, Ganqu Cui, Yangyi Chen, Chong Fu, Bingxiang He, Yangdong Deng, Zhiyuan Liu, Maosong Sun, Ming Gu
EMNLP 2023 Main [GitHub] - A Unified Evaluation of Textual Backdoor Learning: Frameworks and Benchmarks
Ganqu Cui*, Lifan Yuan*, Bingxiang He, Yangyi Chen, Zhiyuan Liu, Maosong Sun
NeurIPS Datasets & Benchmarks 2022 (Spotlight) [GitHub]
π Educations
- 2024.09 - 2029.06 (now), Tsinghua University Ph.D. in Computer Science and Technology (THUNLP)
- 2020.09 - 2024.06, Tsinghua University B.S. in Computer Science and Technology with honors
π Honors and Awards
- Outstanding Graduate Award, Beijing Municipal Education Commission. 2024.06
- Outstanding Paper Award for Diploma Project, Tsinghua University. 2024.06
- Five Star ZiJing Volunteer Award, Tsinghua University Communist Youth League Committee. 2024.05
- Comprehensive Merit Scholarship of Tsinghua for the 2022-2023 school year, Dept. of CST. 2023.10
- Comprehensive Merit Scholarship of Tsinghua for the 2021-2022 school year, Dept. of CST (Top 1). 2022.10
- Third Prize in THU Challenge Cup Academic Competition, Tsinghua University. 2022.04
- Comprehensive Merit Scholarship of Tsinghua for the 2020-2021 school year, Dept. of CST. 2021.10
- Second Prize in National Undergraduate Physics Competition, Beijing Physics Society. 2021.04
- Second Prize in Freshmen Scholarship, Tsinghua University. 2020.09
π¬ Invited Talks
- The Right Time Matters: Data Arrangement Affects Zero-Shot Generalization in Instruction Tuning. Alibaba Security. Link. 2025.05
- Tell me more! towards implicit user intention understanding of language model driven agents. Wiztalk. 2024.08
- Tell me more! towards implicit user intention understanding of language model driven agents. ModelBest. 2024.04
π οΈ Services
- Conference Reviewer: NeurIPS (2024 - 2025), ICLR (2025 - 2026), ICML (2025), ACL ARR (2024 - 2025), COLM (2025), COLM SCALR Workshop (2025), AAAI (2026), AISTATS (2025 - 2026), ICCV (2025)