Yue Zhang

Hello! I am a postdoctoral research associate in the MURGe-Lab led by Prof. Mohit Bansal, at the University of North Carolina, Chapel Hill. I earned my Ph.D. at Michigan State University, where I was advised by Prof. Parisa Kordjamshidi. I was a visiting scholar at Virginia Tech, collaborating with Prof. Lifu Huang. Prior to Ph.D. study, I obtained my master’s degree from Peking University. My research interests include multimodal learning, embodied agents, and spatial reasoning.

Email  /  CV  /  Google Scholar  /  Github  /  LinkedIn

profile photo

News

  • [2025.4] Offically Joined MURGe-Lab!
  • [2025.2] One paper got accepted by CVPR 2025!
  • [2025.2] Passed the Ph.D. dissertation defense!
  • [2025.1] One paper got accepted by ICLR 2025!
  • [2024.11] Our VLN survey got accepted by TMLR!
  • [2024.7] One paper got accepted by ECCV 2024!

Selected Publications

Rethinking Vision Language Model in Face Forensic: Multi-modal Interpretable Forged Face Detector
Xiao Guo, Xiufeng Song, Yue Zhang, Xiaohong Liu, Xiaoming Liu
CVPR, 2025
Project Page
SPARTUN3D: Situated Spatial Understanding of 3D World in Large Language Models
Yue Zhang, Zhiyang Xu, Ying Shen, Parisa Kordjamshidi, Lifu Huang
ICLR, 2025
Code / ArXiv
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Yue Zhang*, Ziqiao Ma*, Jialu Li*, Yanyuan Qiao*, Zun Wang*, Joyce Chai, Qi Wu, Mohit Bansal, Parisa Kordjamshidi
TMLR, 2024
Code / ArXiv
Narrowing the Gap between Vision and Action in Navigation
Yue Zhang, Parisa Kordjamshidi
ACM MM, 2024
Code / ArXiv
Common Sense Reasoning for Deepfake Detection
Yue Zhang, Ben Colman, Xiao Guo, Ali Shahriyari, Gaurav Bharaj
ECCV, 2024
Code / ArXiv
NavHint: Vision and Language Navigation Agent with a Hint Generator
Yue Zhang, Quan Guo, Parisa Kordjamshidi
EACL Findings, 2024
Code / ArXiv
VLN-Trans: Translator for the Vision and Language Navigation Agent
Yue Zhang, Parisa Kordjamshidi
ACL (Oral), 2023
Code / ArXiv
LOViS: Learning Orientation and Visual Signals for Vision and Language Navigation
Yue Zhang, Parisa Kordjamshidi
COLING (Oral), 2022
Code / ArXiv
Explicit Object Relation Alignment for Vision and Language Navigation
Yue Zhang, Parisa Kordjamshidi
ACL SRW, 2022
Code / ArXiv
Towards Navigation by Reasoning over Spatial Configurations
Yue Zhang, Quan Guo, Parisa Kordjamshidi
ACL workshop on SpLU-RoboNLP, 2021
Code / ArXiv

Professional Service

  • [2024.7] Co-organizer of SpLU-RoboNLP @ ACL 2024
  • [2022.11] Invited talk at Sichuan University
  • Reviewer for ACL, EMNLP, NAACL