Oscar Davis @osclsd - Twitter Profile

Pinned Tweet

15 days ago

We were all wondering whether Categorical Flow Maps (CFMs) could scale... 🤔 I couldn't help trying it out... So we scaled CFMs to 1.7B parameters over 2.1T tokens 🚀🔥 Short summary 🧵⬇️

4

128

32

64

16K

osclsd retweeted

Niklas Rindtorff

@Niklas_TR

1 day ago

Introducing Strong Stochastic Flow Maps TLDR: Stochastic Flow Maps where we learn the stochastic solution path. Work led by Sam McCallum, @zwblasingame, with Timothy Herschelll, @AlexanderTong7, and @JamesFosterBath Arxiv: https://t.co/Hy8WWZOnjE Code: https://t.co/PMe6RoqyZA

5

325

66

265

60K

osclsd retweeted

Jiaming Song

@baaadas

3 days ago

Over the weekend, I was using codex to update my homepage and a paper I wrote a year ago on the topic of diffusion LLMs (should be updated on Monday). https://t.co/qvqldZ9H1w While I did not want to make it too explicit back then, I have argued that discrete diffusion LLMs were not the right thing to do and if diffusion ever works on LLMs continuous dLLMs are the way to go. A year later, we are seeing a lot cool papers in this space, and I hope the community can push for something practical and scalable.

baaadas's tweet photo. Over the weekend, I was using codex to update my homepage and a paper I wrote a year ago on the topic of diffusion LLMs (should be updated on Monday).

https://t.co/qvqldZ9H1w

While I did not want to make it too explicit back then, I have argued that discrete diffusion LLMs were not the right thing to do and if diffusion ever works on LLMs continuous dLLMs are the way to go.

A year later, we are seeing a lot cool papers in this space, and I hope the community can push for something practical and scalable.

8

170

14

70

14K

Oscar Davis @osclsd

5 days ago

@jrrhuang Amazing work! Curious to see how it can get any better than this 😉

0

3

0

410

osclsd retweeted

Jerry Huang

@jrrhuang

6 days ago

Can we guide flow models in just a few steps? 🚀 Flow-based sampling is rapidly moving toward few-step generation. But reward guidance often still requires many steps and costly test-time search. Excited to introduce Flow Map Reward Guidance (FMRG): a training-free framework for few-step guidance with flow maps. FMRG matches or surpasses strong baselines on inverse problems and reward-guided text-to-image generation with: ⚡ as few as 3 NFEs ⚡ up to 10× fewer NFEs on inverse problems ⚡ up to 70× fewer NFEs on reward-guided generation 🧵⬇️

3

86

14

75

21K

osclsd retweeted

Michael Bronstein @mmbronstein

10 days ago

We are recruiting multiple postdocs at Oxford: https://t.co/SnylAZn0l0

2

109

19

41

16K

osclsd retweeted

Luca Ambrogioni

@LucaAmb

13 days ago

The way forward for discrete DLM is to turn them into continuous DLM ;)

1

60

9

39

8K

Oscar Davis @osclsd

13 days ago

@richnanophd Thanks for your message! :)

0

28

Oscar Davis @osclsd

15 days ago

We were all wondering whether Categorical Flow Maps (CFMs) could scale... 🤔 I couldn't help trying it out... So we scaled CFMs to 1.7B parameters over 2.1T tokens 🚀🔥 Short summary 🧵⬇️

4

128

32

64

16K

osclsd retweeted

Tyler Farghly @tylerfarghly

14 days ago

[📄preprint] Diffusion models 🤝 MCMC ! Diffusion model samplers are biased due to discretisation 💡The fix: Metropolis-type adjustment on corrector steps ❗️Challenge: no access to the density ratio, only the score 🔑Insight: the score (and some maths) is all you need... [1/3]

tylerfarghly's tweet photo. [📄preprint] Diffusion models 🤝 MCMC !

Diffusion model samplers are biased due to discretisation

💡The fix: Metropolis-type adjustment on corrector steps
❗️Challenge: no access to the density ratio, only the score
🔑Insight: the score (and some maths) is all you need...
[1/3] https://t.co/jM1wMXhasL

5

345

50

265

19K

Oscar Davis @osclsd

14 days ago

@pmpcurvo Very cool :)

0

4

0

451

osclsd retweeted

Pedro

@pmpcurvo

14 days ago

Guide with examples, not rewards 🐘 Controlling what a pretrained generative model produces is still mostly a choice between three slow options: fine-tune it, attach a reward network, or search at inference. We found flow matching allows a fourth, and it costs almost nothing. In deterministic interpolants, the velocity of the flow is determined by where the trajectory is headed: the endpoint mean. Shift that mean, and the entire flow shifts with it. This turns control into a matter of reference. Change the examples that define the endpoint, and you change the direction the model follows. The examples need not be perfect. They only need to point the flow toward the attribute you want. Color, identity, style, and structure, all controllable through examples. 🧵👇

6

168

29

177

34K

osclsd retweeted

Floor Eijkelboom

@FEijkelboom

15 days ago

Very excited about our work on finding the right drifting direction 🐎 We tackle a core open question in drifting: when does “no drift left” mean the model really matched the data? Kernel-gradient drifting is the answer (with natural extensions to manifolds + discrete data)!

0

81

12

46

10K

Oscar Davis @osclsd

15 days ago

@eb1aexperts Thank you! :)

0

1

0

102

Oscar Davis @osclsd

15 days ago

@FEijkelboom Thanks a lot, my friend 😌

0

1

0

95

Oscar Davis @osclsd

15 days ago

@jdeschena Thanks a lot, Justin! :)

0

1

0

79

Oscar Davis @osclsd

15 days ago

@LiDavid2002 Absolutely right. Thanks, David! :)

0

136

Oscar Davis @osclsd

15 days ago

@Sam_Acqua Thanks a lot! :)

0

1

0

112

Oscar Davis @osclsd

15 days ago

@LucaAmb Thanks a lot! :)

0

1

0

407

Oscar Davis @osclsd

15 days ago

Thanks a lot for reading, and don't hesitate to reach out! 🙂 🔗arXiv: https://t.co/nnOks8aXzh

0

19

2

10

798

Oscar Davis @osclsd

15 days ago

We had little time for this work, as I had just arrived at @Apple MLR. Stay tuned for what's coming next! 😉 Thanks a lot to the incredible team that helped make it possible! ❤️ @NasFilippova @PierreAblin @victorturrisi @AmitisShidani1 M. Cuturi and @LouisBAlgue

1

6

0

734

Oscar Davis

@osclsd

Last Seen Users on Sotwe

Trends for you

Most Popular Users