About Me

I am a second-year Master’s student at Tsinghua University and a member of the CVML Lab, advised by Prof. Chun Yuan.

Before joining Tsinghua, I received my B.S. in Computer Science and Technology from the Central University of Finance and Economics in 2025. My recent research focuses on training better foundation multimodal large language models.

Research Interests

  • Multimodal large language models
  • Controllable video generation and world models
  • Multimodal Image Fusion
  • Remote sensing understanding and reasoning

Education

  • Tsinghua University, M.S. in Computer Technology, 2025 - present
  • Central University of Finance and Economics, B.S. in Computer Science and Technology, 2021 - 2025

Publications

Image Generation

Multimodal Understanding

Internship

XPENG Motors

Embodied AI Research Intern · Topic: VLM Pre-training · Mentor: TBD.

July 2026 - Present

Tencent LIGHTSPEED STUDIOS

Research Intern · Topic: Multimodal Large Language Models · Mentor: Shengju Qian.

July 2024 - June 2025

Awards

M Award, Mathematical Contest in Modeling 2024
Huawei Scholarship, Central University of Finance and Economics 2023
First-Class Scholarship for Comprehensive Development, Central University of Finance and Economics 2022 and 2023
Outstanding Academic Scholarship, Central University of Finance and Economics 2023