Yue Zhang
Hello! I am a postdoctoral research associate in the MURGe-Lab led by Prof. Mohit Bansal , at the University of North Carolina, Chapel Hill. I earned my Ph.D. at Michigan State University, where I was advised by Prof. Parisa Kordjamshidi . I was a visiting scholar at Virginia Tech, collaborating with Prof. Lifu Huang . Prior to Ph.D. study, I obtained my master’s degree from Peking University. My research interests include multimodal learning, embodied agents, and spatial reasoning.
Email /
CV /
Google Scholar /
Github /
LinkedIn
News
[2025.6] Please check NEW preprints of MEXA and EPiC .
[2025.4] Offically Joined MURGe-Lab!
[2025.2] One paper got accepted by CVPR 2025!
[2025.2] Passed the Ph.D. dissertation defense!
[2025.1] One paper got accepted by ICLR 2025!
[2024.11] Our VLN survey got accepted by TMLR!
[2024.7] One paper got accepted by ECCV 2024!
Selected Publications
MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation
Shoubin Yu* ,
Yue Zhang* ,
Ziyang Wang ,
Jaehong Yoon ,
Mohit Bansal
Preprint , 2025
ArXiv /
Code
Your browser does not support the video tag.
EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance
Zun Wang ,
Jaemin Cho ,
Jialu Li ,
Han Lin ,
Jaehong Yoon ,
Yue Zhang ,
Mohit Bansal
Preprint , 2025
ArXiv /
Code
Rethinking Vision Language Model in Face Forensic: Multi-modal Interpretable Forged Face Detector
Xiao Guo ,
Xiufeng Song ,
Yue Zhang ,
Xiaohong Liu ,
Xiaoming Liu
CVPR (Oral ) , 2025
ArXiv /
Code
SPARTUN3D: Situated Spatial Understanding of 3D World in Large Language Models
Yue Zhang ,
Zhiyang Xu ,
Ying Shen ,
Parisa Kordjamshidi ,
Lifu Huang
ICLR , 2025
ArXiv /
Code
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Yue Zhang* ,
Ziqiao Ma* ,
Jialu Li* ,
Yanyuan Qiao* ,
Zun Wang* ,
Joyce Chai ,
Qi Wu ,
Mohit Bansal ,
Parisa Kordjamshidi
TMLR , 2024
ArXiv /
Code
Narrowing the Gap between Vision and Action in Navigation
Yue Zhang ,
Parisa Kordjamshidi
ACM MM , 2024
ArXiv /
Code
Common Sense Reasoning for Deepfake Detection
Yue Zhang , Ben Colman,
Xiao Guo , Ali Shahriyari,
Gaurav Bharaj
ECCV , 2024
ArXiv /
Code
NavHint: Vision and Language Navigation Agent with a Hint Generator
Yue Zhang ,
Quan Guo ,
Parisa Kordjamshidi
EACL Findings , 2024
ArXiv /
Code
VLN-Trans: Translator for the Vision and Language Navigation Agent
Yue Zhang ,
Parisa Kordjamshidi
ACL (Oral ) , 2023
ArXiv
Code
LOViS: Learning Orientation and Visual Signals for Vision and Language Navigation
Yue Zhang ,
Parisa Kordjamshidi
COLING (Oral ) , 2022
ArXiv /
Code
Explicit Object Relation Alignment for Vision and Language Navigation
Yue Zhang ,
Parisa Kordjamshidi
ACL SRW , 2022
ArXiv /
Code
Towards Navigation by Reasoning over Spatial Configurations
Yue Zhang ,
Quan Guo ,
Parisa Kordjamshidi
ACL workshop on SpLU-RoboNLP , 2021
ArXiv /
Code
Professional Service
[2024.7] Co-organizer of SpLU-RoboNLP @ ACL 2024
[2022.11] Invited talk at Sichuan University
Reviewer for ACL, EMNLP, NAACL