pat ✈️ CVPR @patrickamadeus_ - Twitter Profile

Pinned Tweet

10 months ago

Personal update: I am starting my PhD @mbzuai where I look forward to work in multimodal realm (interpretability, modality imbalance, eval & application) to address foundational gaps with @AlhamFikri and co.

patrickamadeus_'s tweet photo. Personal update: I am starting my PhD @mbzuai where I look forward to work in multimodal realm (interpretability, modality imbalance, eval & application) to address foundational gaps with @AlhamFikri and co. https://t.co/QKEPNhW9DG

6

144

3

15

13K

pat ✈️ CVPR

@patrickamadeus_

about 10 hours ago

Come visit one of https://t.co/uT0S7enbHs paper @ exhibit hall board 171, 7-9AM today! #CVPR2026

0

6

0

46

patrickamadeus_ retweeted

Jia-Bin Huang

@jbhuang0604

about 24 hours ago

POV: attending CVPR

14

528

39

69

27K

patrickamadeus_ retweeted

Andrei Bursuc @CVPR @abursuc

2 days ago

The humbling lesson for humans from Alyosha: humans turned out much simpler than we thought, 90% of the time we’re just nearest neighbor machines, pastiches from high-school reading lists 🙃 #cvpr2026

abursuc's tweet photo. The humbling lesson for humans from Alyosha: humans turned out much simpler than we thought, 90% of the time we’re just nearest neighbor machines, pastiches from high-school reading lists 🙃 #cvpr2026 https://t.co/UoGBHA1R4J

4

139

10

59

25K

Who to follow

building https://t.co/IteJHsIK8l • https://t.co/BIfpnt5mQE • https://t.co/lCq2Dpqgln

pong

@gagaspb

turn back, nothing interesting here

patrickamadeus_ retweeted

NVIDIA AI

@NVIDIAAI

5 days ago

Introducing Cosmos 3: Our latest frontier model for Physical AI Cosmos 3 is the world’s first fully open omnimodel with native vision reasoning, world and action generation. Today we’re releasing Super (32B) and Nano (8B) variants.

96

3K

407

1K

395K

pat ✈️ CVPR

@patrickamadeus_

4 days ago

In the world of "you can just do things", one must try not to do something that solves nothing

1

3

0

100

patrickamadeus_ retweeted

alphaXiv

@askalphaxiv

6 days ago

"Learn from your own latents, not tokens: A Sample Complexity Theory" This paper explains why data2vec and JEPA can learn with much less data. They showed that when data has hidden hierarchy, token prediction becomes harder as the hierarchy gets deeper. But latent prediction keeps the learning problem simple at every level. Which suggests that models may learn faster when they stop predicting raw tokens and start predicting their own abstractions.

askalphaxiv's tweet photo. "Learn from your own latents, not tokens: A Sample Complexity Theory"

This paper explains why data2vec and JEPA can learn with much less data.

They showed that when data has hidden hierarchy, token prediction becomes harder as the hierarchy gets deeper. But latent prediction keeps the learning problem simple at every level.

Which suggests that models may learn faster when they stop predicting raw tokens and start predicting their own abstractions.

9

630

106

520

35K

patrickamadeus_ retweeted

OpenAI

@OpenAI

7 days ago

AI can give researchers the freedom to pursue “crazier” ideas. For Terence Tao, AI creates more room to experiment, test unexpected paths, and discover what might otherwise stay out of reach.

310

6K

630

1K

1M

patrickamadeus_ retweeted

Muratcan Koylan

@koylanai

9 days ago

'Agent Harness Engineering: A Survey' just cited my Agent Skills for Context Engineering project in its Context & Memory Management section. It’s a new paper on OpenReview (authors from CMU, Yale, Johns Hopkins, Amazon + others). They reviewed 170+ open-source projects and pulled real production lessons from OpenAI, Anthropic, and LangChain. Agent performance in the real world = Model capability + Harness quality For long-horizon, multi-step, production tasks, the harness has become the main bottleneck. Simple harness tweaks (better tool formats, sandbox changes, automated verification loops) deliver significant gains on benchmarks. This is the second time my open-source work has been cited in academic research (first was Peking University’s State Key Lab paper on meta context engineering). I’m genuinely proud of that, but more than anything it reminds me why I love open source. I’m not from academia. I learned this field by building, shipping, writing... Open source lets your experiments enter the research papers. That is still one of the best parts of this field. The paper is worth reading. We're moving from “build one agent” to “operate a fleet of long-running agents” and the paper repeatedly shows that the biggest improvements come from turning production traces into regression tests and automated harness fixes. Paper & Repo: https://t.co/PAjqvOXedL

koylanai's tweet photo. 'Agent Harness Engineering: A Survey' just cited my Agent Skills for Context Engineering project in its Context & Memory Management section.

It’s a new paper on OpenReview (authors from CMU, Yale, Johns Hopkins, Amazon + others). They reviewed 170+ open-source projects and pulled real production lessons from OpenAI, Anthropic, and LangChain.

Agent performance in the real world = Model capability + Harness quality

For long-horizon, multi-step, production tasks, the harness has become the main bottleneck. Simple harness tweaks (better tool formats, sandbox changes, automated verification loops) deliver significant gains on benchmarks.

This is the second time my open-source work has been cited in academic research (first was Peking University’s State Key Lab paper on meta context engineering).

I’m genuinely proud of that, but more than anything it reminds me why I love open source. I’m not from academia. I learned this field by building, shipping, writing...

Open source lets your experiments enter the research papers. That is still one of the best parts of this field.

The paper is worth reading. We're moving from “build one agent” to “operate a fleet of long-running agents” and the paper repeatedly shows that the biggest improvements come from turning production traces into regression tests and automated harness fixes.

Paper & Repo: https://t.co/PAjqvOXedL

14

715

146

833

38K

patrickamadeus_ retweeted

Eunsu Kim @euns0o_kim

8 days ago

After submitting our culture mixing paper to CVPR (https://t.co/YWFLGl1BSp), we came across the ConfusedTourist paper which shares same motivation but different and interesting analysis! We’ve put together a joint website to share our findings. Check it out below!

0

10

3

0

864

pat ✈️ CVPR

@patrickamadeus_

8 days ago

Thankful for all of the contributors & authors! - WoF @euns0o_kim @jjjunyeong @aliceoh and others - ConfusedTourist @IkhlasulHanif0 @emthehunt @gentaiscool @FajriKoto @AlhamFikri - CubeMix @/JunSeongKim and others p.s. If you are heading to #CVPR2026, come by and say hi as me and @jjjunyeong will be around! [6/n]

0

3

0

257

pat ✈️ CVPR

@patrickamadeus_

8 days ago

Introducing CultureMix: a joint findings showing that when cultures collide, VLMs collapse! https://t.co/gXZSH8ygS6 3 papers (2 CVPR, 1 NAACL) 30k+ samples 60+ countries & 300+ cultural concepts up to -58% accu. drop! #CVPR2026 #NAACL2025 🇺🇸 👇👇👇👇 [1/n]

patrickamadeus_'s tweet photo. Introducing CultureMix: a joint findings showing that when cultures collide, VLMs collapse!

https://t.co/gXZSH8ygS6

3 papers (2 CVPR, 1 NAACL)
30k+ samples
60+ countries & 300+ cultural concepts
up to -58% accu. drop!

#CVPR2026 #NAACL2025 🇺🇸
👇👇👇👇 [1/n]

1

15

8

0

2K

pat ✈️ CVPR

@patrickamadeus_

8 days ago

Too much? Come try the samples in our hub! You can copy our exact prompts and culture-mixed images to test where your VLM's understanding breaks down 🤖 [5/n]

patrickamadeus_'s tweet photo. Too much? Come try the samples in our hub!

You can copy our exact prompts and culture-mixed images to test where your VLM's understanding breaks down 🤖

[5/n] https://t.co/inTYxFMw2r

1

0

116

patrickamadeus_ retweeted

Anushka@CVPR26' @_anushkaagarwal

9 days ago

If you are attending #CVPR2026 and looking for Happy hour suggestions, check this out. 1)World Models & Drinks @reactorworld : https://t.co/1cZKxpJxYB 2)Researcher Reception @nvidia: https://t.co/OtXZxTz1td 3)Robotics & World Models : https://t.co/w9FufM5PYr [Cont]

4

70

4

73

10K

pat ✈️ CVPR

@patrickamadeus_

9 days ago

just realized that the days can get super busy during the conference and u can't just keep opting for all things😭 regardless, excited to bump into some of these!

0

81

pat ✈️ CVPR

@patrickamadeus_

9 days ago

Super helpful! Check it out #cvpr2026 peeps

Gabriele @gabrosi3

9 days ago

#CVPR starts in one week 🚀 One thing that always frustrated me at CVPR was workshop/tutorial days. Schedules are scattered across dozens of websites, and planning your day means opening 20 tabs. So I built CVPR Workshop Radar 👇

gabrosi3's tweet photo. #CVPR starts in one week 🚀

One thing that always frustrated me at CVPR was workshop/tutorial days.

Schedules are scattered across dozens of websites, and planning your day means opening 20 tabs.

So I built CVPR Workshop Radar 👇 https://t.co/wVz35OOjTh

3

31

3

24

6K

1

3

0

1

784

pat ✈️ CVPR

@patrickamadeus_

9 days ago

@Kyriakos_Pelek balancing the training data is always best, but at post-training/test time, exercising a stronger perception module through improved prompting and/or an agentic, multi-phased approach to extract the information can also be the way!

0

44

pat ✈️ CVPR

@patrickamadeus_

9 days ago

I am going to #CVPR2026 to present 3 of my papers! 1. ConfusedTourists ✈️😵‍💫 Geographical object or background perturbation is causing up to -40% VLMs accuracy drop 🚨 2. M4-RAG 🌇⛏️ 80k+ multimodal-multilingual-multicultural RAG hub, the bigger the agent... the accuracy does not always go up 🤔 3. Counting to 4 is still a Chore 👀🔢 VLMs are still struggling with object counting, can attention budgeting help them? [1/4]