Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps

Sicheng Feng1,2,4†  Song Wang3,4†  Shuyi Ouyang3,4  Lingdong Kong4  Zikai Song4,5  Jianke Zhu3  Huan Wang1,*  Xinchao Wang4 

2025

1Westlake University, Hangzhou, China 2Nankai University, Tianjin, China 3Zhejiang University, Hangzhou, China 4National University of Singapore, Singapore 5Huazhong University of Science and Technology, Wuhan, China
*Corresponding author: wanghuan@westlake.edu.cn

WLU
NKU
ZJU
NUS
HZU
ENCODE Lab

Abstract

BibTeX

@article{,
    title={},
    author={},
    journal={arXiv preprint arXiv:},
    year={2025},
}