Michael Munje @michaelmunje - Twitter Profile

Pinned Tweet

9 months ago

[1/8] New social navigation paper + benchmark: SocialNav-SUB 🚶🤖 Recent work puts VLMs on robots for navigation, but can they really interpret scenes and extract key details for social navigation? 🔎 https://t.co/2rlcIQpf6h

michaelmunje's tweet photo. [1/8] New social navigation paper + benchmark: SocialNav-SUB 🚶🤖 Recent work puts VLMs on robots for navigation, but can they really interpret scenes and extract key details for social navigation? 🔎 https://t.co/2rlcIQpf6h https://t.co/SKsCdTIR6i

8

9

3

2

3K

michaelmunje retweeted

Zichao @ZichaoHu99

9 months ago

How can robots follow complex instructions in dynamic environments? 🤖 Meet ComposableNav — a diffusion-based planner that enables robots to generate novel navigation behaviors that satisfy diverse instruction specifications on the fly — no retraining needed. 📄 Just accepted to CoRL 2025 🔗 Project: https://t.co/FX3O0ZYYyD A Thread (1/8)

1

19

7

8

3K

Michael Munje @michaelmunje

9 months ago

A huge thanks to my collaborators for making this work possible! @ChenTangMark @dafeijing @ZichaoHu99 @yifengzhu_ut @cuijiaxun @GarrettWarnell @Joydeepb_robots @PeterStone_TX

0

4

0

1

120

Michael Munje @michaelmunje

9 months ago

[1/8] New social navigation paper + benchmark: SocialNav-SUB 🚶🤖 Recent work puts VLMs on robots for navigation, but can they really interpret scenes and extract key details for social navigation? 🔎 https://t.co/2rlcIQpf6h

8

9

3

2

3K

Michael Munje @michaelmunje

9 months ago

[8/8] 🤝 SocialNav-SUB is a human-grounded check on whether VLMs understand social navigation scenes ✨ Please read our paper for more info: https://t.co/FpL3kqMFVP #Robotics #VLM #SocialNavigation

0

1

0

59

Michael Munje @michaelmunje

9 months ago

[7/8] SocialNavSUB is also fully open-source, actively maintained, and easily extendable to customized prompts and/or additional VLMs! Pull requests are always welcome! https://t.co/IJCjCUo5Io

0

50

Michael Munje @michaelmunje

9 months ago

[6/8] 🧪 Does chain-of-thought (using spatial/spatiotemporal VQAs first) improve social reasoning? ✅Yes. Does BEV context help models? ⚖️ Model-dependent (sometimes a lot). Does better spatial(temporal) context improve social reasoning? ✅Yes.

0

1

0

47

Michael Munje @michaelmunje

9 months ago

[5/8] 📊 Do today’s VLMs agree with human judgments? We find that they still trail behind humans and simple rule-based baselines.

michaelmunje's tweet photo. [5/8] 📊 Do today’s VLMs agree with human judgments? We find that they still trail behind humans and simple rule-based baselines. https://t.co/5XicU2dta8

0

46

Michael Munje @michaelmunje

9 months ago

[4/8] 👥 We collected human data from an IRB-approved human-subject study to construct our benchmark and evaluate whether models align with human judgments in social navigation scenes.

0

1

0

48

Michael Munje @michaelmunje

9 months ago

[3/8] SocialNav-SUB features real-world social navigation scenarios built from SCAND scenarios @ 4 Hz → PHALP tracking → front-view & BEV with labeled pedestrians, combining them with a set of carefully designed questions to create our VQA prompts (5k in total).

michaelmunje's tweet photo. [3/8] SocialNav-SUB features real-world social navigation scenarios built from SCAND scenarios @ 4 Hz → PHALP tracking → front-view & BEV with labeled pedestrians, combining them with a set of carefully designed questions to create our VQA prompts (5k in total). https://t.co/LGu1EZEmeR

0

1

0

59

Michael Munje @michaelmunje

9 months ago

[2/8] We introduce SocialNav-SUB: a VQA benchmark to evaluate spatial, spatiotemporal, and social reasoning for real-world social navigation scenarios with object-centric grounding (front view + Bird’s-Eye-View (BEV) + numbered markers) to provide rich context to VLMs.

0

1

0

73

Michael Munje

@michaelmunje

Last Seen Users on Sotwe

Trends for you

Most Popular Users