Florent BARTOCCIONI @fbartoc - Twitter Profile

fbartoc retweeted

2 days ago

how does the brain build and track an internal state of the world from (possibly incomplete and noisy) visual observations? i believe visual state tracking will be the grand challenge for vision in the coming years, and i hope this benchmark can be a useful starting line. enjoy!

18

401

41

164

62K

fbartoc retweeted

Kiymet Akdemir @akdemir_kiymet

3 days ago

Game engines let you spawn anything into a scene. World models don't. Once the camera moves, you're stuck with whatever the model dreamed up. 🧵 We fix that with SPAWN, a training-free method that injects a custom concept into the rollout from either an image or a text prompt.

3

57

6

28

6K

fbartoc retweeted

Nikhil Keetha

@Nik__V__

5 days ago

Looks like I didn't do a good job of sharing this before but... Yes, you can infer, visualize, compare, train and finetune all the geometry foundation models in the MapAnything codebase‼️

Nik__V__'s tweet photo. Looks like I didn't do a good job of sharing this before but...

Yes, you can infer, visualize, compare, train and finetune all the geometry foundation models in the MapAnything codebase‼️

1

56

5

20

4K

fbartoc retweeted

Nicholas Boffi

@nmboffi

6 days ago

really excited to finally release this one. guidance is critical for getting flow and diffusion models to do what we want, but most methods in the literature are heuristic and work for unclear reasons. the field likes to frame it as reward-tilted sampling, yet what people run in practice is often nowhere close to that. here we take a different angle, deriving guidance from first principles as an optimal control problem. existing methods drop out as coarse approximations, and the flow map emerges as the fundamental ingredient for effective guidance. our approach is training-free, and reaches state-of-the-art performance across diverse benchmarks at up to 70x fewer NFEs. amazing work by @jrrhuang, justin, kartik, and sheel. stay tuned for more on the finetuning side!

2

151

18

119

14K

Who to follow

Oriane Siméoni @CVPR

@oriane_simeoni

Research Scientist @ Meta FAIR

Bernhard Jaeger

@bern_jaeger

Co-founder of KE:SAI, a non-profit open science AI research lab. https://t.co/JdxBCIZedy

Imagine-ENPC

@ImagineEnpc

Computer Vision team of LIGM/A3SI @EcoledesPonts ParisTech (ENPC)

fbartoc retweeted

Li Yiteng

@liyitengx

7 days ago

AM-ARM200 is here: 6+1 DoF, 1kg payload, 52cm reach — all for a $240 follower BOM. 🤯 It brings the SO-ARM100 open-source philosophy into a whole new capability class. Built for real-world manipulation, not just tabletop demos. And yes, you drive it exactly like a SO-ARM100. Fully compatible with @huggingface @LeRobotHF GitHub: https://t.co/srLxZZgZDr #EmbodiedAI #Robotics #OpenSource @RemiCadene

6

99

10

46

8K

fbartoc retweeted

Andrew Gordon Wilson

@andrewgwils

9 days ago

How much does a language model forget when finetuned on new tasks? We show both model size and optimization matter and forgetting can be nearly eliminated with self-generated replay! https://t.co/Qs9A4n095s w/@mrtnm @dongkyucho @ShikaiQiu @rumichunara @Pavel_Izmailov 1/8

andrewgwils's tweet photo. How much does a language model forget when finetuned on new tasks? We show both model size and optimization matter and forgetting can be nearly eliminated with self-generated replay!
https://t.co/Qs9A4n095s
w/@mrtnm @dongkyucho @ShikaiQiu @rumichunara @Pavel_Izmailov 1/8 https://t.co/Z4tTKGcnxA

18

663

88

541

51K

fbartoc retweeted

Lucas Beyer (bl16)

@giffmana

9 days ago

The people who think this sounds bad cannot possibly be real developers...

27

357

10

59

59K

fbartoc retweeted

Chetaslua

@chetaslua

16 days ago

Gemini Omni Flash - monalisa edition 🤩 This model is so coherent in arts and science at the same time zooming in monalisa showing paints to the molecules to atom with coherent text 🤯 omg we are so back and the speed , super hyped for omni pro

16

399

25

186

161K

fbartoc retweeted

Sam Sheffer

@samsheffer

9 days ago

omni continues to blow my mind (left original / right generated) waymo looks sick in matte black

28

461

28

80

95K

fbartoc retweeted

Alexander Chen

@alexanderchen

10 days ago

Gemini Omni 🕶️ prompt in 🧵

46

1K

78

795

156K

fbartoc retweeted

Michael Arbel @MichaelArbel

10 days ago

What do JEPA-style self-distillation dynamics actually learn — and why do they sometimes avoid collapse? In our new work with @BasileTerv987 and Jean Ponce, we tackle this question. What surprised us: These dynamics provably recover representations with nonlinear-CCA structure.

1

93

11

73

35K

fbartoc retweeted

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

10 days ago

Language Models Need Sleep "Transformer-based large language models are increasingly used for long-horizon tasks; however, their attention mechanism scales poorly with context length. To handle this, we study a sleep-like consolidation mechanism in which a model periodically converts recent context into persistent fast weights before clearing its key-value cache." "increasing sleep duration N for our models improves performance, with the largest gains on examples that require deeper reasoning."

iScienceLuvr's tweet photo. Language Models Need Sleep

"Transformer-based large language models are increasingly used for long-horizon tasks; however, their attention mechanism scales poorly with context length. To handle this, we study a sleep-like consolidation mechanism in which a model periodically converts recent context into persistent fast weights before clearing its key-value cache."

"increasing sleep duration N for our models improves performance, with the largest gains on examples that require deeper reasoning."

32

909

146

713

66K

fbartoc retweeted

Egor Cherepanov @hirasava_ui

13 days ago

🎉 We released MIKASA-Robo-VLA v1.0.0 — a benchmark suite for studying memory in Vision-Language-Action (VLA) policies for tabletop robotic manipulation. https://t.co/LLN7sCokx2 🧠 The goal is simple: make memory evaluation in robotic manipulation more systematic. 👇

7

222

24

161

46K

fbartoc retweeted

Gouki Minegishi

@GoukiMinegishi

10 days ago

Our paper was accepted as a #ICML2026 Spotlight! Reasoning in LLMs has improved largely by chaining local steps. But is that the whole story? Humans occasionally make inferential "leaps" across domains, a faculty known as analogy. We design a synthetic task to show how small Transformers acquire analogical reasoning, and find that the same signatures appear in pretrained LLMs. arxiv: https://t.co/1WCizIKWly code: https://t.co/82kOKCtJo7

29

1K

162

1K

86K

fbartoc retweeted

Basile Terver

@BasileTerv987

11 days ago

📢 Accepted to TMLR, with reproducibility certification 🏅 v2 of our JEPA-WM study (arXiv:2512.24497) is out, with new data-scaling experiments, a Lipschitz analysis of multistep rollout training, and extended discussions. Recap + what's new 👇 w/ @JimmyTYYang1, Jean Ponce, @AdrienBardes, @ylecun

15

235

30

122

64K

fbartoc retweeted

CHRIS FIRST

@chrisfirst

14 days ago

I uploaded a screenshot of Google Maps to Gemini Omni with a route drawn on it. Then I prompted it to create a first person view of someone driving a taxi cab along the route in the reference image. Pretty close to the real thing.

chrisfirst's tweet photo. I uploaded a screenshot of Google Maps to Gemini Omni with a route drawn on it.

Then I prompted it to create a first person view of someone driving a taxi cab along the route in the reference image.

Pretty close to the real thing. https://t.co/F5XCm5r36w

81

2K

205

1K

3M

fbartoc retweeted

Hansheng Chen @HanshengCh

22 days ago

New paper: AsymFlow🔥 JiT x0-prediction is not enough for pixel generation. Better keep velocity in a low-rank subspace: - 1.57 FID on ImageNet (best pixel flow model) - Finetunes FLUX.2 klein into pixel space, beats the original on HPSv3/DPG/GenEval (#1 overall on HPSv3) 1/7

HanshengCh's tweet photo. New paper: AsymFlow🔥

JiT x0-prediction is not enough for pixel generation. Better keep velocity in a low-rank subspace:

- 1.57 FID on ImageNet (best pixel flow model)
- Finetunes FLUX.2 klein into pixel space, beats the original on HPSv3/DPG/GenEval (#1 overall on HPSv3)

1/7 https://t.co/FSz46hrJHj

20

281

55

197

54K

fbartoc retweeted

Lily Goli @lily_goli

14 days ago

🚀 🚀 🚀 Excited to share our new paper: Remember to be Curious: Episodic Context and Persistent Worlds for 3D Exploration What does it take for an agent to stay curious in a 3D world? The answer is memory. 🌐 Project: https://t.co/G4SjLoFJht 📄 Paper: https://t.co/iUFwp5NvRu 💻 Code: https://t.co/KZRaQLyzyh

2

222

40

129

69K

fbartoc retweeted

Kento Nishi｜🐔 @kento_nishi

15 days ago

🚨New paper! 📃Mechanisms of Misgeneralization in Physical Sequence Modeling Planners for the physical world produce motions that look safe, but quietly change quantities the demonstrations are meant to control. When does this happen? Why? Can we predict it before training?👇🧵

kento_nishi's tweet photo. 🚨New paper!
📃Mechanisms of Misgeneralization in Physical Sequence Modeling

Planners for the physical world produce motions that look safe, but quietly change quantities the demonstrations are meant to control.

When does this happen? Why? Can we predict it before training?👇🧵 https://t.co/U7h7zyz3l0

3

54

16

20

9K

fbartoc retweeted

Pedro

@pmpcurvo

15 days ago

Guide with examples, not rewards 🐘 Controlling what a pretrained generative model produces is still mostly a choice between three slow options: fine-tune it, attach a reward network, or search at inference. We found flow matching allows a fourth, and it costs almost nothing. In deterministic interpolants, the velocity of the flow is determined by where the trajectory is headed: the endpoint mean. Shift that mean, and the entire flow shifts with it. This turns control into a matter of reference. Change the examples that define the endpoint, and you change the direction the model follows. The examples need not be perfect. They only need to point the flow toward the attribute you want. Color, identity, style, and structure, all controllable through examples. 🧵👇

6

168

29

177

34K

Florent BARTOCCIONI

@fbartoc

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users