Zhaoye's Homepage

Hey! I'm Zhaoye

Zhaoye's Homepage

👋 About

I am a Phd student in Fudan University and advised by Prof. Xipeng Qiu. Also the member of FudanNLP. Before that i completed my bachelor’s in Hangzhou Dianzi University and is the member of vidar-team. Also the dedicated photographer of @blueblueO908.

My current research focuses on building AGI agents based on large language models (LLMs), with two main directions: (1) Embodied Large Models (LMs) for environmental interaction, encompassing embodied task planning, grounding, and perception; and (2) Voice Intelligence systems for human-computer interaction.

📰 News

[Jun. 2025]
One paper accepted to ICCV 2025!
[Mar 2025]
Three papers accepted to ACL 2025!
[Sep. 2024]
One paper accepted to EMNLP 2024!
[May 2024]
One paper accepted to ACL 2024!
[Feb. 2024]
We released Wanjuan-cc, a high-quality open-sourced english webtext dataset.

📚 Publications

Card image

VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon ...

ICCV 2025
Shiduo Zhang, ... , Zhaoye Fei, Zhangyue Yin, Zuxuan Wu, Yu-Gang Jiang and Xipeng Qiu
Card image

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

ACL 2025
Siyin Wang, Zhaoye Fei, Qinyuan Cheng, Shiduo Zhang, Panpan Cai, Jinlan Fu and Xipeng Qiu
Card image

How to Mitigate Overfitting in Weak-to-strong Generalization?

ACL 2025
Junhao Shi*, Qinyuan Cheng*, Zhaoye Fei, Yining Zheng and Qipeng Guo, Xipeng Qiu
Card image

Visuothink: Empowering lvlm reasoning with multimodal tree search

ACL 2025
Yikun Wang*, Siyin Wang*, Qinyuan Cheng, Zhaoye Fei, Liang Ding, Qipeng Guo, Dacheng Tao andXipeng Qiu
Card image

Balanced data sampling for language model training with clustering

ACL 2024
Yunfan Shao*, Linyang Li*, Zhaoye Fei, Hang Yan, Dahua Lin and Xipeng Qiu
Card image

Turn Waste into Worth: Rectifying Top-k Router of MoE

EMNLP 2024
Zhiyuan Zeng, Qipeng Guo, Zhaoye Fei, Zhangyue Yin, Yunhua Zhou, Linyang Li, Tianxiang Sun, Hang Yan, Dahua Lin, Xipeng Qiu

Coarse-to-Fine: Hierarchical Multi-task Learning for Natural Language Understanding

COLING 2022
Zhaoye Fei*, Yu Tian*, Yongkang Wu*, Xinyu Zhang*, Yutao Zhu, ...

Data and Model Architecture in Base Model Training

CCL 2023
Hang Yan*, Zhaoye Fei*, Xiaopeng Yang, Yang Gao, Xipeng Qiu

📄 TECHNICAL REPORT & PREPRINT

Internlm2 technical report

Technical Report, ArXiv
InternLM Team

Internlm-math: Open math large language models toward verifiable reasoning

Technical Report, ArXiv
Huaiyuan Ying, Shuo Zhang, Linyang Li, Zhejian Zhou, Yunfan Shao, Zhaoye Fei, ...

Unearthing Large Scale Domain-Specific Knowledge from Public Corpora

ArXiv
Zhaoye Fei*, Yunfan Shao*, Linyang Li*, Zhiyuan Zeng, Conghui He, Hang Yan, Dahua Lin, Xipeng Qiu

Pre-training for Information Retrieval: Are Hyperlinks Fully Explored?

ArXiv
Jiawen Wu, Xinyu Zhang, Yutao Zhu, Zheng Liu, Zikai Guo, Zhaoye Fei, ...

Towards More Effective and Economic Sparsely-Activated Model

ArXiv
Zhaoye Fei*, Hao Jiang*, Ke Zhan*, Jianwei Qu*, ...

🔧 Services

ARR (2024, 2025) NeurIPS (2024, 2025) ICLR (2024, 2025) ICML (2025) AISTATS (2025)

Follow me

I work on Robots based on LLMs interactive with human and our world~

MOSI AI
HuaFa Road St.699 No.3
shanghai