Justin Jung @jsjung00 - Twitter Profile

Pinned Tweet

3 months ago

A really interesting phenomenon of diffusion: It's L2 denoising training objective has a closed form solution, the conditional mean, which reduces to a gaussian kernel weighted average of the training points. Sampling from this would only return training data, yet diffusion models generalize. In this blog post, we highlight how properties of euclidean geometry (the concentration of gaussians, the L2 spacing of high dimensional points in R^d) of the diffusion objective and the training dataset can explain diffusion generalization: https://t.co/aSmBOYmHNB

jsjung00's tweet photo. A really interesting phenomenon of diffusion:

It's L2 denoising training objective has a closed form solution, the conditional mean, which reduces to a gaussian kernel weighted average of the training points.

Sampling from this would only return training data, yet diffusion models generalize.

In this blog post, we highlight how properties of euclidean geometry (the concentration of gaussians, the L2 spacing of high dimensional points in R^d) of the diffusion objective and the training dataset can explain diffusion generalization: https://t.co/aSmBOYmHNB

0

1

0

1

526

jsjung00 retweeted

Marco Mascorro

@Mascobot

6 days ago

After coding is solved, the next frontier is computer use. Today, we are launching Use Computer, the infra for evaluating and training models to use all kinds of computers 👇

40

271

22

145

43K

Justin Jung

@jsjung00

about 1 month ago

@jdeschena @caglarml @Jaeyeon_Kim_0 @JonathanGeuter @LucaAmb @ssahoo_ @zhihanyang_ @dvruette @yjelid @LiDavid2002 @LucaEyring @AndrewC_ML @ValentinDeBort1 @ArnaudDoucet1 @chandavidlee @WeiGuo01 @modal very interesting work, and love the interactive blog format/style!

1

2

0

133

Justin Jung

@jsjung00

about 1 month ago

@jm_alexia @MehdiEsmaexl Very exciting! Looking forward to the paper

0

445

Justin Jung

@jsjung00

about 2 months ago

@ayaanzhaque this is awesome congrats!

0

1

0

45

Justin Jung

@jsjung00

2 months ago

@hla_michael really well written! thanks for sharing

0

19

Justin Jung

@jsjung00

3 months ago

@celestepoasts Congrats!

0

1

0

21

Justin Jung

@jsjung00

4 months ago

@elliotarledge Do you have any thoughts on why the drift model has worse generation quality here? The paper seems to report good metrics

1

2

0

421

Justin Jung

@jsjung00

5 months ago

@jm_alexia Agreed—had my LLMs are sycophantic and bad for brainstorming moment today

0

20

Justin Jung

@jsjung00

5 months ago

🧵(6/6) And thanks to the authors of ESM3 and the open source model release for making this investigation possible @THayes427 @proteinrosh @halilakin @sofroniewn @denizzokt @ebetica @robert_verkuil Vincent Tran @deaton_jon @MariusWiggert @rohilbadkundri @irhumshafkat Jun Gong @awfderry @rsm_ai @countablyfinite @TheYousufKhan Chetan Mishra Carolyn Kim @BartieLiam @mnemeth101 @pdhsu @TomSercu @salcandido @alexrives

0

1

0

412

Justin Jung

@jsjung00

5 months ago

🧵 Can a small (1.4B parameter) protein language model solve challenging protein scaffold design tasks by scaling inference compute? Yes—but not simply through scaling the number of samples generated