Fast-dDrive: Efficient Block-Diffusion VLM for Autonomous Driving

Publication
arXiv preprint arXiv:2605.23163
Jin Wang
Jin Wang
CS PhD Student at HKU

My research focuses on multimodal foundation models, especially unified systems that connect visual understanding, generation, and evaluation, with earlier work in deepfake detection and AI interpretability.