๐ฅ We release Harmon: a unified framework for multimodal understanding & generation with a shared visual encoder (vs. decoupled Janus/-Pro).
๐ฅ SOTA on GenEval, MJHQ, WISE
๐ง Strong understanding performance
๐ Paper: https://t.co/RFhEl9NEN7
๐ Code: https://t.co/O6og7NcYAI
๐ฅ We release Harmon: a unified framework for multimodal understanding & generation with a shared visual encoder (vs. decoupled Janus/-Pro).
๐ฅ SOTA on GenEval, MJHQ, WISE
๐ง Strong understanding performance
๐ Paper: https://t.co/RFhEl9NEN7
๐ Code: https://t.co/O6og7NcYAI
๐Our paper proposes a clever and novel approach to Open-Vocabulary Object Detection. Come and check it in this afternoon #CVPR2023 ! โ
๐ Location: West Exhibit Hall #276
โฐ Time: 04: 30 PM - 06:00 PM
Paper: https://t.co/Pyki7tVThJ