Mateo Díaz @mateodd25 - Twitter Profile

Pinned Tweet

almost 2 years ago

This paper made me smile a lot while working on it, so I want to share a bit about it https://t.co/egypZTIky9. We draw a parallel story to the Eckart-Young Theorem (from numerical analysis) in stochastic optimization/learning problems. (with Josh Cutler and Dima Drusvyatskiy)

3

99

22

78

14K

mateodd25 retweeted

OpenAI

@OpenAI

15 days ago

Today, we share a breakthrough on the planar unit distance problem, a famous open question first posed by Paul Erdős in 1946. For nearly 80 years, mathematicians believed the best possible solutions looked roughly like square grids. An OpenAI model has now disproved that belief, discovering an entirely new family of constructions that performs better. This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.

1K

27K

4K

9K

13M

mateodd25 retweeted

Damek

@damekdavis

6 months ago

New paper studies when spectral gradient methods (e.g., Muon) help in deep learning: 1. We identify a pervasive form of ill-conditioning in DL: post-activations matrices are low-stable rank. 2. We then explain why spectral methods can perform well despite this. Long thread

damekdavis's tweet photo. New paper studies when spectral gradient methods (e.g., Muon) help in deep learning:

1. We identify a pervasive form of ill-conditioning in DL: post-activations matrices are low-stable rank.
2. We then explain why spectral methods can perform well despite this.

Long thread https://t.co/xEcpPvr32n

11

335

68

318

99K

mateodd25 retweeted

Jeremias Sulam

@Jere_je_je

11 months ago

NEW PAPER ALERT 📢 Score-based diffusion models are powerful—but slow to sample. Could there be something better? Drop the scores, use proximals instead! We present Proximal Diffusion Models (ProxDM), a faster alternative both in theory* and practice. Here’s how it works 🧵(1/n)

Jere_je_je's tweet photo. NEW PAPER ALERT 📢 Score-based diffusion models are powerful—but slow to sample. Could there be something better? Drop the scores, use proximals instead! We present Proximal Diffusion Models (ProxDM), a faster alternative both in theory* and practice. Here’s how it works 🧵(1/n) https://t.co/ucw6Fn3RAk

1

12

4

3

1K

Who to follow

Chinmaya Kausik

@ChinmayaKausik

ML PhD @UMich. Prev: ML Research intern @JaneStreetGroup, @Netflix, DS intern @Microsoft. Undergrad @iiscbangalore '21. He/him.

Ben Grimmer

@prof_grimmer

Assistant Professor @JohnsHopkinsAMS, Optimization, PhD @Cornell_ORIE Mostly here to share pretty maths/3D prints, sometimes sharing my research

Michael Choi

@michaelchchoi

Assistant Professor @NUSingapore. Applied probabilist. Probability, MCMC, statistical physics, optimization, information theory, TCS. Opinions my own.

Mateo Díaz @mateodd25

12 months ago

We did a bunch of other things on the paper, but I defer the details to our succinct appendix. That's all, folks. (16/n)

0

1

0

91

Mateo Díaz @mateodd25

12 months ago

When does the performance of an ML model transfer across dimensions? https://t.co/cqFMdgl6ZV Kudos to my terrific collaborators Eitan Levin, Yuxin Ma, and @SoledadVillar5. 🧵(1/n)

1

20

3

5

2K

Mateo Díaz @mateodd25

12 months ago

Interestingly, the way in which we identify low-dimensional objects with high-dimensional objects and the choice of the norm for continuity play crucial roles. The task of interest has to be aligned with these choices; otherwise, transferability might fail. (15/n)

mateodd25's tweet photo. Interestingly, the way in which we identify low-dimensional objects with high-dimensional objects and the choice of the norm for continuity play crucial roles. The task of interest has to be aligned with these choices; otherwise, transferability might fail.
(15/n) https://t.co/C5fBVMvUaX

1

0

127

Mateo Díaz @mateodd25

over 1 year ago

@boazbaraktcs I recommend you tell him about NixOS, another beautiful rabbit hole.

0

154

Mateo Díaz

@mateodd25

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users