Hey! I'm Zhaoye

👋 About
I am a Phd student in Fudan University and advised by Prof. Xipeng Qiu. Also the member of FudanNLP. Before that i completed my bachelor’s in Hangzhou Dianzi University and is the member of vidar-team. Also the dedicated photographer of @blueblueO908.
My current research focuses on building AGI agents based on large language models (LLMs), with two main directions: (1) Embodied Large Models (LMs) for environmental interaction, encompassing embodied task planning, grounding, and perception; and (2) Voice Intelligence systems for human-computer interaction.
📰 News
[Jun. 2025]
One paper accepted to ICCV 2025!
[Mar 2025]
Three papers accepted to ACL 2025!
[Sep. 2024]
One paper accepted to EMNLP 2024!
[May 2024]
One paper accepted to ACL 2024!
[Feb. 2024]
We released Wanjuan-cc, a high-quality open-sourced english webtext dataset.
📚 Publications

VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon ...
ICCV 2025
Shiduo Zhang, ... , Zhaoye Fei, Zhangyue Yin, Zuxuan Wu, Yu-Gang Jiang and Xipeng Qiu

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning
ACL 2025
Siyin Wang, Zhaoye Fei, Qinyuan Cheng, Shiduo Zhang, Panpan Cai, Jinlan Fu and Xipeng Qiu

How to Mitigate Overfitting in Weak-to-strong Generalization?
ACL 2025
Junhao Shi*, Qinyuan Cheng*, Zhaoye Fei, Yining Zheng and Qipeng Guo, Xipeng Qiu

Visuothink: Empowering lvlm reasoning with multimodal tree search
ACL 2025
Yikun Wang*, Siyin Wang*, Qinyuan Cheng, Zhaoye Fei, Liang Ding, Qipeng Guo, Dacheng Tao andXipeng Qiu

Balanced data sampling for language model training with clustering
ACL 2024
Yunfan Shao*, Linyang Li*, Zhaoye Fei, Hang Yan, Dahua Lin and Xipeng Qiu

Turn Waste into Worth: Rectifying Top-k Router of MoE
EMNLP 2024
Zhiyuan Zeng, Qipeng Guo, Zhaoye Fei, Zhangyue Yin, Yunhua Zhou, Linyang Li, Tianxiang Sun, Hang Yan, Dahua Lin, Xipeng Qiu
Coarse-to-Fine: Hierarchical Multi-task Learning for Natural Language Understanding
COLING 2022
Zhaoye Fei*, Yu Tian*, Yongkang Wu*, Xinyu Zhang*, Yutao Zhu, ...
Data and Model Architecture in Base Model Training
CCL 2023
Hang Yan*, Zhaoye Fei*, Xiaopeng Yang, Yang Gao, Xipeng Qiu
📄 TECHNICAL REPORT & PREPRINT
Internlm-math: Open math large language models toward verifiable reasoning
Technical Report, ArXiv
Huaiyuan Ying, Shuo Zhang, Linyang Li, Zhejian Zhou, Yunfan Shao, Zhaoye Fei, ...
Internlm: A multilingual language model with progressively enhanced capabilities
Technical Report
InternLM Team
Unearthing Large Scale Domain-Specific Knowledge from Public Corpora
ArXiv
Zhaoye Fei*, Yunfan Shao*, Linyang Li*, Zhiyuan Zeng, Conghui He, Hang Yan, Dahua Lin, Xipeng Qiu
Pre-training for Information Retrieval: Are Hyperlinks Fully Explored?
ArXiv
Jiawen Wu, Xinyu Zhang, Yutao Zhu, Zheng Liu, Zikai Guo, Zhaoye Fei, ...
Towards More Effective and Economic Sparsely-Activated Model
ArXiv
Zhaoye Fei*, Hao Jiang*, Ke Zhan*, Jianwei Qu*, ...
🔧 Services
ARR (2024, 2025) NeurIPS (2024, 2025) ICLR (2024, 2025) ICML (2025) AISTATS (2025)