π₯ We release Harmon: a unified framework for multimodal understanding & generation with a shared visual encoder (vs. decoupled Janus/-Pro).
π₯ SOTA on GenEval, MJHQ, WISE
π§ Strong understanding performance
π Paper: https://t.co/RFhEl9NEN7
π Code: https://t.co/O6og7NcYAI
π₯ We release Harmon: a unified framework for multimodal understanding & generation with a shared visual encoder (vs. decoupled Janus/-Pro).
π₯ SOTA on GenEval, MJHQ, WISE
π§ Strong understanding performance
π Paper: https://t.co/RFhEl9NEN7
π Code: https://t.co/O6og7NcYAI
πOur paper proposes a clever and novel approach to Open-Vocabulary Object Detection. Come and check it in this afternoon #CVPR2023 ! β
π Location: West Exhibit Hall #276
β° Time: 04: 30 PM - 06:00 PM
Paper: https://t.co/Pyki7tVThJ