Jaden Park @_jadenpark - Twitter Profile

Pinned Tweet

about 2 months ago

We all knew LLM agents struggle to explore, but we had to eyeball it 👀. We couldn't measure exploration errors. Until now. 🗺️🤖 We built a policy-agnostic metric to quantify exploration and exploitation errors in LLM agents. Spoiler: Exploration error is what kills📉 agent performance in our setting 👇🧵(1/8)

_jadenpark's tweet photo. We all knew LLM agents struggle to explore, but we had to eyeball it 👀. We couldn't measure exploration errors. Until now. 🗺️🤖

We built a policy-agnostic metric to quantify exploration and exploitation errors in LLM agents.

Spoiler: Exploration error is what kills📉 agent performance in our setting 👇🧵(1/8)

1

31

17

5

2K

Jaden Park

@_jadenpark

23 days ago

@cataluna84 @AdobeResearch @michi_fischer Thank you so much!! 😁

0

1

0

15

Jaden Park

@_jadenpark

about 1 month ago

Lots of good news this week! 🚀 1. My internship project from @AdobeResearch has been accepted to #SIGGRAPH2026! ("MAOAM: Unified Object & Material Selection with Vision-Language Models") Special thanks to my wonderful mentor @michi_fischer who has made this project possible! 2. Paper accepted to #ICML2026! ("DocHop: Benchmarking Out-of-domain Multi-hop Reasoning in Information-Dense Documents") 3. Paper accepted (with minor revisions) at #DMLR! ("Decomposing Complex Visual Comprehension into Atomic Visual Skills for Vision Language Models") In both papers, we generate carefully designed benchmarks to tackle compositional/multi-hop reasoning in VLMs. Proud to have contributed in these projects. More detailed posts soon :) Stay tuned!

1

23

3

0

653

_jadenpark retweeted

Jaden Park

@_jadenpark

about 2 months ago

We all knew LLM agents struggle to explore, but we had to eyeball it 👀. We couldn't measure exploration errors. Until now. 🗺️🤖 We built a policy-agnostic metric to quantify exploration and exploitation errors in LLM agents. Spoiler: Exploration error is what kills📉 agent performance in our setting 👇🧵(1/8)

1

31

17

5

2K

Jaden Park

@_jadenpark

about 2 months ago

I will be at #ICLR2026 to present my work on data contamination in VLMs! (Fri, Apr 24, 2026 • 8:30 AM – 11:00 AM, Pavilion 3 P3-917) I am currently interested in VLA/physical AI, agents and robustness/generalization. Would love to chat and connect with anyone with similar interests :)

Jaden Park

@_jadenpark

7 months ago

Me: memorize past exams 📚💯 Also me: fail on a slight tweak 🤦‍♂️🤦‍♂️ Turns out, we can use the same method to 𝗱𝗲𝘁𝗲𝗰𝘁 𝗰𝗼𝗻𝘁𝗮𝗺𝗶𝗻𝗮𝘁𝗲𝗱 𝗩𝗟𝗠𝘀! 🧵(1/10) - Project Page: https://t.co/ue1GybD4fm

_jadenpark's tweet photo. Me: memorize past exams 📚💯
Also me: fail on a slight tweak 🤦‍♂️🤦‍♂️

Turns out, we can use the same method to 𝗱𝗲𝘁𝗲𝗰𝘁 𝗰𝗼𝗻𝘁𝗮𝗺𝗶𝗻𝗮𝘁𝗲𝗱 𝗩𝗟𝗠𝘀! 🧵(1/10)

- Project Page: https://t.co/ue1GybD4fm https://t.co/AhfX7hhABH

1

28

11

2

7K

0

18

2

956

Jaden Park

@_jadenpark

about 2 months ago

@neural_avb Thank you for your interest in our work! Feel free to let me know if you'd like to discuss anything about our work! https://t.co/5uVypzw1Nz :)

0

2

0

33

Jaden Park

@_jadenpark

about 2 months ago

This was a joint co-first author work with @jungtaek_kim and @jongwonjeong123, with the guidance from @rdnowak, @Kangwook_Lee and @yong_jae_lee. If this sounds interesting, please check out our paper 📄 https://t.co/UYor59qioM! If you have any questions, feedback, or new ideas, I'd be more than happy to discuss!🧵(8/8)

0

6

0

1

296

Jaden Park

@_jadenpark

about 2 months ago

We all knew LLM agents struggle to explore, but we had to eyeball it 👀. We couldn't measure exploration errors. Until now. 🗺️🤖 We built a policy-agnostic metric to quantify exploration and exploitation errors in LLM agents. Spoiler: Exploration error is what kills📉 agent performance in our setting 👇🧵(1/8)

1

31

17

5

2K

Jaden Park

@_jadenpark

about 2 months ago

Can we improve exploration failures in LM agents? 🛠️ 🗺️ Exploration Prompts: Explicitly injecting exploration strategies increases success rate by 17%. 📝 Explicit Harness: Providing the agent with structured summaries of its past observations; success rate boost by 29.4%! 🧵(7/8)

1

5

0

206

Jaden Park

@_jadenpark

about 2 months ago

Excited to be back at @AdobeResearch this summer where I will be working with @Shramanpramani2 :) Would love to connect with anyone who will be around!

_jadenpark's tweet photo. Excited to be back at @AdobeResearch this summer where I will be working with @Shramanpramani2 :)

Would love to connect with anyone who will be around! https://t.co/3O5aGX0Sz0

1

6

0

120

_jadenpark retweeted

Bocheng Zou @bochengzou

2 months ago

🔥 Upgrade your frozen vision encoders with <10 lines of code! Single-scale inference throws away vital details. Enter MuRF 🚀: a simple, training-free plug-in for instant, massive gains in MLLMs, Seg & Depth. 🤯 1/6

bochengzou's tweet photo. 🔥 Upgrade your frozen vision encoders with <10 lines of code!
Single-scale inference throws away vital details. Enter MuRF 🚀: a simple, training-free plug-in for instant, massive gains in MLLMs, Seg & Depth. 🤯 1/6 https://t.co/bOvAdhAn2h

7

147

26

144

28K

Jaden Park

@_jadenpark

3 months ago

@thaoshibe GPT told me a few times that it won't be helping me if I keep being mean at it 😂

0

1

0

20

_jadenpark retweeted

Aniket Rege @wregss

3 months ago

🚨New work with @Meta @RealityLabs We introduce EGAgent, an agentic reasoning framework for very long video understanding powered by entity scene graphs Why? With long multimodal data streams, agents must search and reason across multiple modalities! 🧵 (1/n)

2

19

8

2

2K

_jadenpark retweeted

Harris Zhang @HyperStorm9682

3 months ago

New paper out! 🚨 Introducing STTS: Unified Spatio-Temporal Token Scoring for Efficient Video VLMs. We tackle the massive token bottleneck in video models by jointly identifying the tokens that actually matter. The overall figure below breaks down the core problem! 🧵👇

HyperStorm9682's tweet photo. New paper out! 🚨 Introducing STTS: Unified Spatio-Temporal Token Scoring for Efficient Video VLMs. We tackle the massive token bottleneck in video models by jointly identifying the tokens that actually matter. The overall figure below breaks down the core problem! 🧵👇 https://t.co/67GBy0cMZJ

1

18

4

11

5K

_jadenpark retweeted

Aniket Rege @wregss

3 months ago

Hi ML Twitter! My Summer 2026 internship unfortunately fell through last minute 😵‍💫 If your team is looking for interns, I’d love to connect - RTs appreciated 🙏 My website: https://t.co/rNih6t6Emb

17

265

28

69

35K

Jaden Park

@_jadenpark

Last Seen Users on Sotwe

Trends for you

Most Popular Users