Jin Wang
Jin Wang
Home
Publications
CV
Light
Dark
Automatic
Evaluation
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
The capability to process multiple images is crucial for Large Vision-Language Models (LVLMs) to develop a more thorough and nuanced …
Fanqing Meng
,
Jin Wang
,
Chuanhao Li
,
Quanfeng Lu
,
Hao Tian
,
Jiaqi Liao
,
Xizhou Zhu
,
Jifeng Dai
,
Yu Qiao
,
Ping Luo
,
Kaipeng Zhang
,
Wenqi Shao
PDF
Cite
Code
Dataset
Project
Cite
×