Jongwoo Park @jongwoopark7978 - Twitter Profile

Pinned Tweet

9 days ago

Excited to share our poster schedule for 🤖IVRA at ICRA 2026 in Vienna 📍 Thu, June 4, 15:00-16:30, Hall C, ThI2I.318 A lightweight, training-free method that improves 🧠📈 spatial understanding in VLA with affinity hint 🎥 More demo: https://t.co/WPyPIuDvcK More details ⬇️

0

2

0

171

Jongwoo Park @jongwoopark7978

3 months ago

Excited to share our poster schedule for 🎞️LVNet at EACL 2026 in Morocco 📍 March 27, 9:00-10:30 AM, Poster Hall, Session 10 LVNet is a Training-free keyframe selector for long-video QA Paper: https://t.co/wvrtJ2jmSc Demo: LVNet (top) vs VideoTree (bottom)

1

6

0

1

162

Jongwoo Park @jongwoopark7978

3 months ago

🎉Congrats to the team: @kahnchana @jjh6297 Cristina Mata, Yoo Sung Jang, @ryoo_michael

0

41

Jongwoo Park @jongwoopark7978

3 months ago

IVRA is accepted to #ICRA2026 🎉 A lightweight, training-free method that improves 🧠📈 spatial understanding in VLA models using affinity hints already inside the vision encoder—no external encoder. 🎥 More demo: https://t.co/WPyPIuDvcK More details ⬇️

3

4

2

0

188

Who to follow

Kanchana Ranasinghe

@kahnchana

🤖 Vision & Robotics Researcher @SFResearch 👨🏽‍💻 Former Intern @Apple MLR, @AIatMeta, @GoogleResearch, @mbzuai 💃🏻 Dancer in free time

Staff Research Scientist @GoogleDeepMind, Gemini video & Omni 🎥. Prev: PhD @Inria & @ENS_ULM, MEng @Polytechnique.

jongwoopark7978 retweeted

Kanchana Ranasinghe

@kahnchana

3 months ago

[CVPR 2026] FOFPred has been accepted to #CVPR2026 (Findings)! We build a diffusion-based model that predicts Future Optical Flow from a single image guided by natural language instructions. Checkout code, model ckpt, & live demo at: https://t.co/tCFSYWlSNr

4

27

6

7

3K

Jongwoo Park @jongwoopark7978

4 months ago

Code + more demos: https://t.co/SMSd6ZuucH Congrats to the team!: @kahnchana , @kkahatapitiy , Wonjeong Ryu, Donghyun Kim, @ryoo_michael

0

3

0

57

Jongwoo Park @jongwoopark7978

4 months ago

LVNet accepted to #EACL26! Training-free keyframe selector for long-video QA: 🎯High accuracy low caption,⚡up to 3.4x speed, ⚙️filters 1,800 to 24 keyframes on 1 GPU,💸10x cheaper LLM cost. Paper: https://t.co/wvrtJ2jmSc More details in the thread. ⬇️ Demo: LVNet (top)

2

5

3

2

410

jongwoopark7978 retweeted

Xiang Li @XiangLi54505720

over 1 year ago

(1/5) Excited to present our #ICLR2025 paper, LLaRA, at NYC CV Day! LLaRA efficiently transforms a pretrained Vision-Language Model (VLM) into a robot Vision-Language-Action (VLA) policy, even with a limited amount of training data. More details are in the thread. ⬇️

XiangLi54505720's tweet photo. (1/5)
Excited to present our #ICLR2025 paper, LLaRA, at NYC CV Day!
LLaRA efficiently transforms a pretrained Vision-Language Model (VLM) into a robot Vision-Language-Action (VLA) policy, even with a limited amount of training data.
More details are in the thread. ⬇️ https://t.co/gEKBnULIbO

1

44

6

15

13K

jongwoopark7978 retweeted

TwelveLabs (twelvelabs.io)

@twelve_labs

over 1 year ago

The webinar recording of this session with @jongwoopark7978, @kkahatapitiy, and @kahnchana is up! Watch here: https://t.co/X4wmcgai6W 📺 All three talks focused on efficient multimodal LLMs for long videos: Visual keyframes, Effective captions, and Fast inference. Enjoy!

0

4

3

0

342

Jongwoo Park @jongwoopark7978

almost 2 years ago

@XiangLi54505720 Thanks for all your effort Xiang!

0

105

Jongwoo Park @jongwoopark7978

almost 2 years ago

🚀 Check out our new arXiv release! We've demonstrated the effectiveness of the Hierarchical Keyframe Selector for very long-form VQA. The model processes video in three stages. Explore our work and code here: https://t.co/SMSd6Zv22f

jongwoopark7978's tweet photo. 🚀 Check out our new arXiv release!

We've demonstrated the effectiveness of the Hierarchical Keyframe Selector for very long-form VQA. The model processes video in three stages.

Explore our work and code here: https://t.co/SMSd6Zv22f https://t.co/3vDkk7EvwT

0

10

2

1

395

jongwoopark7978 retweeted

Xiang Li @XiangLi54505720

almost 2 years ago

(6/N) We ran multiple types of real-world robot experiments and found that our method, trained on just 8k simulated data, performs strongly in unseen real-world settings. With minimal in-domain fine-tuning, the model achieves a 91.6% average success rate!