Cross-modal Identity Mapping: Minimizing Information Loss in Modality Conversion via Reinforcement Learning

Publication
In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2026)
Jin Wang
Jin Wang
CS PhD Student at HKU

My research focuses on multimodal foundation models, especially unified systems that connect visual understanding, generation, and evaluation, with earlier work in deepfake detection and AI interpretability.