I am an undergraduate student at Yao Class, Tsinghua Univeristy. I am interested in language models, reinforcement learning, and ai for science.
Education
IIIS, Tsinghua Univeristy
Undergraduate Student
Beijing, China, 2023 - now
GPA: 3.98/4.00, with 15 A+ (Top ~5%) and 15 A (Top ~25%)
Undergraduate Student
Beijing, China, 2023 - now
GPA: 3.98/4.00, with 15 A+ (Top ~5%) and 15 A (Top ~25%)
Publications
Papers sorted by recency. Representative papers are highlighted.
metaTextGrad: Automatically optimizing language model optimizers
Guowei Xu, Mert Yuksekgonul, Carlos Guestrin, James Zou
arXiv
paper / bibtex
Guowei Xu, Mert Yuksekgonul, Carlos Guestrin, James Zou
arXiv
paper / bibtex
LLaVA-CoT: Let Vision Language Models Reason Step-by-Step
Guowei Xu*, Peng Jin*, Ziang Wu*, Hao Li, Yibing Song, Lichao Sun, Li Yuan
International Conference on Computer Vision (ICCV), 2025
model & dataset / arXiv / code / bibtex
Guowei Xu*, Peng Jin*, Ziang Wu*, Hao Li, Yibing Song, Lichao Sun, Li Yuan
International Conference on Computer Vision (ICCV), 2025
model & dataset / arXiv / code / bibtex
MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning
Suning Huang*, Zheyu Zhang*, Tianhai Liang, Yihan Xu, Zhehao Kou, Chenhao Lu, Guowei Xu, Zhengrong Xue, Huazhe Xu
International Conference on Machine Learning (ICML), 2025
project page / arXiv / code / bibtex
Suning Huang*, Zheyu Zhang*, Tianhai Liang, Yihan Xu, Zhehao Kou, Chenhao Lu, Guowei Xu, Zhengrong Xue, Huazhe Xu
International Conference on Machine Learning (ICML), 2025
project page / arXiv / code / bibtex
ACE: Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Tianying Ji*, Yongyuan Liang*, Yan Zeng, Yu Luo, Guowei Xu, Jiawei Guo, Ruijie Zheng, Furong Huang, Fuchun Sun, Huazhe Xu
International Conference on Machine Learning (ICML), 2024 (Oral)
project page / arXiv / code / bibtex
Tianying Ji*, Yongyuan Liang*, Yan Zeng, Yu Luo, Guowei Xu, Jiawei Guo, Ruijie Zheng, Furong Huang, Fuchun Sun, Huazhe Xu
International Conference on Machine Learning (ICML), 2024 (Oral)
project page / arXiv / code / bibtex
DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization
Guowei Xu*, Ruijie Zheng*, Yongyuan Liang*, Xiyao Wang, Zhecheng Yuan, Tianying Ji, Yu Luo, Xiaoyu Liu, Jiaxin Yuan, Pu Hua, Shuzhen Li, Yanjie Ze, Hal Daumé III, Furong Huang, Huazhe Xu
International Conference on Learning Representations (ICLR), 2024 (Spotlight)
project page / arXiv / code / bibtex
Guowei Xu*, Ruijie Zheng*, Yongyuan Liang*, Xiyao Wang, Zhecheng Yuan, Tianying Ji, Yu Luo, Xiaoyu Liu, Jiaxin Yuan, Pu Hua, Shuzhen Li, Yanjie Ze, Hal Daumé III, Furong Huang, Huazhe Xu
International Conference on Learning Representations (ICLR), 2024 (Spotlight)
project page / arXiv / code / bibtex
Can Pre-Trained Text-to-Image Models Generate Visual Goals for Reinforcement Learning?
Jialu Gao*, Kaizhe Hu*, Guowei Xu, Huazhe Xu
Conference on Neural Information Processing Systems (NeurIPS), 2023
project page / arXiv / bibtex
Jialu Gao*, Kaizhe Hu*, Guowei Xu, Huazhe Xu
Conference on Neural Information Processing Systems (NeurIPS), 2023
project page / arXiv / bibtex
Selected open-source projects
LLaVA-CoT Official Implementation⭐ 2023
Guowei Xu, Peng Jin, Ziang Wu (Code Implementation)
Nov 2024
model & dataset / arXiv / code / bibtex
Guowei Xu, Peng Jin, Ziang Wu (Code Implementation)
Nov 2024
model & dataset / arXiv / code / bibtex
DrM Official Implementation⭐ 75
Guowei Xu, Ruijie Zheng, Yongyuan Liang (Code Implementation)
Sept 2023
project page / arXiv / code / bibtex
Guowei Xu, Ruijie Zheng, Yongyuan Liang (Code Implementation)
Sept 2023
project page / arXiv / code / bibtex
Selected Talk and Reports
YouTuber Reports: LLaVA-CoT: Let Vision Language Models Reason Step-by-Step
Speaker
Timestamp 7:58 (YouTube), 2024
video link
Speaker
Timestamp 7:58 (YouTube), 2024
video link
Invited Talk: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization
Speaker
88th Seminar (RL China), 2024
video link
Speaker
88th Seminar (RL China), 2024
video link
Invited Talk: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization
Speaker
ICLR Seminar (AI TIME), 2024
video link
Speaker
ICLR Seminar (AI TIME), 2024
video link
Selected Awards
Sparking Program
The most prestigious and selective academic organization for students at THU
Tsinghua University (THU), 2025 (Top 1% in Tsinghua)
The most prestigious and selective academic organization for students at THU
Tsinghua University (THU), 2025 (Top 1% in Tsinghua)
National Scholarship
scholar
China, 2024 (Top Scholarship in China)
scholar
China, 2024 (Top Scholarship in China)
Freshman Scholarship (First Prize)
scholar
Tsinghua University (THU), 2024 (Top Scholarship for Freshman)
scholar
Tsinghua University (THU), 2024 (Top Scholarship for Freshman)
The 52nd International Physics Olympiad (IPhO 2022)
Overall Winner (best total score)
International Physics Olympiad (IPhO), 2022
Overall Winner (best total score)
International Physics Olympiad (IPhO), 2022
Teaching Assistant
Deep Reinforcement Learning (Graduate Course)
Director: Huazhe Xu
TAs: Guowei Xu, Kaizhe Hu, Pu Hua
Tsinghua University (THU), 2024
Director: Huazhe Xu
TAs: Guowei Xu, Kaizhe Hu, Pu Hua
Tsinghua University (THU), 2024
Natural Language Processing
Director: Tianxing He
TAs: Guowei Xu, Yeqi Feng, Minrui Luo
Tsinghua University (THU), 2025
Director: Tianxing He
TAs: Guowei Xu, Yeqi Feng, Minrui Luo
Tsinghua University (THU), 2025
Reviewer
International Conference on Learning Representations (ICLR)
Role: Reviewer for main conference
Official Site: https://iclr.cc/
Years: 2025
Role: Reviewer for main conference
Official Site: https://iclr.cc/
Years: 2025
International Conference on Machine Learning (ICML)
Role: Reviewer for main conference
Official Site: https://icml.cc/
Years: 2025
Role: Reviewer for main conference
Official Site: https://icml.cc/
Years: 2025
Conference on Neural Information Processing Systems (NeurIPS)
Role: Reviewer for main conference
Official Site: https://nips.cc/
Years: 2025
Role: Reviewer for main conference
Official Site: https://nips.cc/
Years: 2025
Conference on Robot Learning (CoRL)
Role: Reviewer for workshop
Official Site: https://www.corl.org/
Years: 2025
Role: Reviewer for workshop
Official Site: https://www.corl.org/
Years: 2025
Language and Writing
Writing and Communication / Roads to Academia
A+ score (Top ~5%)
Tsinghua University (THU), 2024-2025
A+ score (Top ~5%)
Tsinghua University (THU), 2024-2025