Jin Wang
Jin Wang
Home
Publications
CV
Light
Dark
Automatic
Multimodal Large Language Models
FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities
The rapid progress of large language models (LLMs) has catalyzed the emergence of multimodal large language models (MLLMs) that unify …
Jin Wang
,
Yao Lai
,
Aoxue Li
,
Shifeng Zhang
,
Jiacheng Sun
,
Ning Kang
,
Chengyue Wu
,
Zhenguo Li
,
Ping Luo
PDF
Cite
Code
Project
INTER: Mitigating Hallucination in Large Vision-Language Models by Interaction Guidance Sampling
Hallucinations in large vision-language models (LVLMs) pose significant challenges for real-world applications, as LVLMs may generate …
Xin Dong
,
Shichao Dong
,
Jin Wang
,
Jing Huang
,
Li Zhou
,
Zenghui Sun
,
Lihua Jing
,
Jingsong Lan
,
Xiaoyong Zhu
,
Bo Zheng
PDF
Cite
Cite
×