Alex Alemi @alemi - Twitter Profile

alemi retweeted

Anthropic

@AnthropicAI

3 months ago

A statement on the comments from Secretary of War Pete Hegseth. https://t.co/Gg7Zb09IMR

3K

42K

7K

5K

18M

Alex Alemi @alemi

5 months ago

@geoffreylitt "I produced this with Claude", which I produced with Claude. Like a film or music producer that doesn't operate the camera or play the instruments but vouches for the outcome.

0

5

0

209

alemi retweeted

Pavel Izmailov

@Pavel_Izmailov

over 1 year ago

I am recruiting Ph.D. students for my new lab at @nyuniversity! Please apply, if you want to work with me on reasoning, reinforcement learning, understanding generalization and AI for science. Details on my website: https://t.co/d8uId2LC47. Please spread the word!

Pavel_Izmailov's tweet photo. I am recruiting Ph.D. students for my new lab at @nyuniversity! Please apply, if you want to work with me on reasoning, reinforcement learning, understanding generalization and AI for science.

Details on my website: https://t.co/d8uId2LC47. Please spread the word! https://t.co/ae3juAdaJI

16

749

102

328

115K

Alex Alemi @alemi

over 1 year ago

Recently I've been playing around with a quarter-order-of-magnitude system for simple calculations. It gives better precision than single sig-fig calculations using only four, very intuitive, symbols. https://t.co/BO9mLi8pLF

alemi's tweet photo. Recently I've been playing around with a quarter-order-of-magnitude system for simple calculations. It gives better precision than single sig-fig calculations using only four, very intuitive, symbols. https://t.co/BO9mLi8pLF https://t.co/VWStE2NVvX

0

8

0

4

789

Who to follow

Pavel Izmailov

@Pavel_Izmailov

Researcher @AnthropicAI 🤖 Assistant Professor @nyuniversity 🏙️ Previously @OpenAI #StopWar 🇺🇦

Yang Song

@DrYangSong

Research Principal of MSL

Kaiyu Yang

@KaiyuYang4

Chief Scientist, Verifiable AI Lab of @miromind_ai. Previously: Research Scientist @FAIR, Postdoc @Caltech, PhD @PrincetonCS, Undergrad @Tsinghua_Uni.

Alex Alemi @alemi

over 1 year ago

@BlackHC Here's mine. ELBO is the KL between the forward and backward joint: https://t.co/IfB73XinFs

0

4

0

1

163

Alex Alemi @alemi

over 1 year ago

If you miss the NYTimes needle, especially one that is statistically uniform (https://t.co/uqLw9f69Sw), you can use this page: https://t.co/xQ5cFrtRSD I whipped together to reason about the correlations between the swing states tonight as results come in.

alemi's tweet photo. If you miss the NYTimes needle, especially one that is statistically uniform (https://t.co/uqLw9f69Sw), you can use this page: https://t.co/xQ5cFrtRSD I whipped together to reason about the correlations between the swing states tonight as results come in. https://t.co/zDWq7lXxkw

0

18

1

3

3K

Alex Alemi @alemi

over 1 year ago

@statymath Yeah, since the arcsine transformation makes the fisher flat, the variance is also isotropic. Maybe that would be a good thing to add, you can easily estimate the standard deviation in a simple estimate as 30/sqrt(n) if you measure things in degrees.

0

1

0

22

Alex Alemi @alemi

over 1 year ago

Why don't we measure probabilities in degrees? https://t.co/uqLw9f5C2Y

4

56

11

24

6K

Alex Alemi @alemi

almost 2 years ago

@AllThingsGenAI Here is the accompanying blog post: https://t.co/IfB73XinFs

0

3

0

35

Alex Alemi @alemi

about 2 years ago

@dvrshil Yes, thank you!

0

1

0

164

Alex Alemi @alemi

about 2 years ago

In which I try to make sense of most of machine learning: https://t.co/IfB73XinFs

5

293

41

416

62K

Alex Alemi @alemi

about 2 years ago

@iskander @SergeyFeldman It's a custom static site generator I threw together with some janky python scripts: https://t.co/MnXIUmcpx2

0

3

0

25

Alex Alemi @alemi

about 2 years ago

@dythui Basically, anytime we are dealing with continuous random variables I don't feel as though entropy is the most appropriate. It's rare for the appropriate prior to be uniform over the space, usually you want something with some concentration.

0

1

0

1

175

Alex Alemi @alemi

about 2 years ago

@gil2rok Yeah, I believe that's right. For more details on different divergences, check out @DaniloJRezende fantastic notes: https://t.co/LgnWiGW44n

0

2

0

1

100

Alex Alemi @alemi

about 2 years ago

@gil2rok I really like that the KL divergence is linearly decomposable, while the other f-divergences are reparameterization invariant, they don't decompose naturally. I find Hobson's list of desiderata hard to argue with: https://t.co/2Qzq1vWudU

1

2

0

1

94

Alex Alemi @alemi

about 2 years ago

@dythui Relative entropy is KL, so yeah, I think that fixes things 😀. My take is everywhere you see entropy, it should really be thought of as KL to a uniform distribution, which while that is sometimes appropriate, it isn't always, including in many of the places it is used.

0

2

0

134

Alex Alemi @alemi

about 2 years ago

@dythui I tend to think of KL minimization as more fundamental than max ent: https://t.co/hKP3EZMa0M

0

6

1

4

1K

Alex Alemi @alemi

about 2 years ago

@dpkingma Recommendations: https://t.co/bxpbwDTgGy

0

4

0

1

690

alemi retweeted

Brian Lester @blester125

about 2 years ago

Is Kevin onto something? We found that LLMs can struggle to understand compressed text, unless you do some specific tricks. Check out https://t.co/DRO2IbTFCg and help @hoonkp, @alemi, Jeffrey Pennington, @ada_rob, @jaschasd, @noahconst and I make Kevin’s dream a reality.

0

15

6

4

3K

alemi retweeted

Noah Constant @noahconst

about 2 years ago

Ever wonder why we don’t train LLMs over highly compressed text? Turns out it’s hard to make it work. Check out our paper for some progress that we’re hoping others can build on. https://t.co/mceqpUfZQo With @blester125, @hoonkp, @alemi, Jeffrey Pennington, @ada_rob, @jaschasd

2

81

12

33

32K

Alex Alemi

@alemi

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users