David J Wu @lightvector1 - Twitter Profile

almost 2 years ago

@AlexGDimakis @yevgets Anyways, even if the result is "obvious" common wisdom many years old, there's still upsides to have a paper that formally writes it up and tries to analyze it where others haven't really done so.

1

4

0

74

David J Wu @lightvector1

almost 2 years ago

@AlexGDimakis @yevgets Without ruling out the confounders here, the paper's hypothesis that non-transcendence of their 1500 model is due to lack of "diversity" rather than one of these other effects seems oddly particular and a bit premature.

1

4

0

110

David J Wu @lightvector1

about 2 years ago

Wooo, tensor diagrams are cool. (Transformer self-attention layer, from https://t.co/djCSrI7f4h)

0

2

0

2

1K

lightvector1 retweeted

Thomas Ahle

@thomasahle

about 2 years ago

I always found the tensor notation in Fast Matrix Multiplication algorithms confusing. But using tensor diagrams it's pretty easy to see what's going on:

thomasahle's tweet photo. I always found the tensor notation in Fast Matrix Multiplication algorithms confusing. But using tensor diagrams it's pretty easy to see what's going on: https://t.co/cIJmXOjZ8O

7

757

91

746

91K

Who to follow

PhD student @berkeley_ai. AI persuasion, safety, sign language. Prev @carnegiemellon @polytechnique, intern @msftresearch @deepmind. 🇫🇷🇯🇵

Adam Lerer

@adamlerer

Tuning hypers @AnthropicAI

David J Wu @lightvector1

about 2 years ago

Even though we've known from word2vec and much work since that LLM representations correlate well with human concepts (both in linear additivity, distance/clustering, etc), I still find it cool that it holds up with larger models so far. Lots of space to explore further.

Anthropic

@AnthropicAI

about 2 years ago

New Anthropic research paper: Scaling Monosemanticity. The first ever detailed look inside a leading large language model. Read the blog post here: https://t.co/6RYwxt6nWI

AnthropicAI's tweet photo. New Anthropic research paper: Scaling Monosemanticity.

The first ever detailed look inside a leading large language model.

Read the blog post here: https://t.co/6RYwxt6nWI https://t.co/Oh3RIvgnXx

64

2K

531

1K

755K

0

10

0

2

2K

David J Wu @lightvector1

about 2 years ago

@littmath If you always... ...show as many heads as you can, then prob=0. ...show exactly 9 heads if there are at least 9 heads, then prob=1/11. ...show a random 9 coins and it just happens they're all heads, then prob=1/2. ...show 9 heads only in the case there are 10 heads, then prob=1.

0

1

0

122

lightvector1 retweeted

Samuel Sokota @ssokota

about 2 years ago

SOTA AI for games like poker & Hanabi rely on search methods that don’t scale to games w/ large amounts of hidden information. In our ICLR paper, we introduce simple search methods that scale to large games & get SOTA for Hanabi w/ 100x less compute. 1/N https://t.co/oxopUMkTK2

ssokota's tweet photo. SOTA AI for games like poker & Hanabi rely on search methods that don’t scale to games w/ large amounts of hidden information.

In our ICLR paper, we introduce simple search methods that scale to large games & get SOTA for Hanabi w/ 100x less compute. 1/N

https://t.co/oxopUMkTK2 https://t.co/oKVDBaz6Zg

5

326

51

175

61K

David J Wu @lightvector1

over 2 years ago

There are tons of articles on MCTS, which wastes compute whenever paths lead to the same state, but few on Monte-Carlo *Graph* Search, which doesn't. But implementing MCGS soundly can be tricky! Here's a doc on how to do it, and the theory behind it: https://t.co/J1TiH9Y2QC

lightvector1's tweet photo. There are tons of articles on MCTS, which wastes compute whenever paths lead to the same state, but few on Monte-Carlo *Graph* Search, which doesn't. But implementing MCGS soundly can be tricky! Here's a doc on how to do it, and the theory behind it: https://t.co/J1TiH9Y2QC https://t.co/pR1PSolq1i

3

122

19

109

10K

lightvector1 retweeted

Leela Chess Zero @LeelaChessZero

over 2 years ago

@GoogleDeepMind ..and the blog post with more details is live at https://t.co/zYhQpcBSoR

0

10

3

1

2K

lightvector1 retweeted

Leela Chess Zero @LeelaChessZero

over 2 years ago

In the recent paper https://t.co/tZD0NK49fh @GoogleDeepMind introduced a transformer chess network, but didn't include Lc0 in their comparison. We've used transformers for a while, and our network is stronger with fewer parameters. More details soon.

LeelaChessZero's tweet photo. In the recent paper https://t.co/tZD0NK49fh @GoogleDeepMind introduced a transformer chess network, but didn't include Lc0 in their comparison. We've used transformers for a while, and our network is stronger with fewer parameters. More details soon. https://t.co/CtThnd3uRt

3

90

17

14

7K

lightvector1 retweeted

Samuel Sokota @ssokota

almost 3 years ago

There are two shapes below: one is named “kiki” and one is named “bouba”. Which is which? This is the puzzle we consider in our ICML paper: Learning Intuitive Policies Using Action Features. 1/N https://t.co/07nVGynBb4 ⚫ ✴

4

41

10

14

19K

lightvector1 retweeted

Eugene Vinitsky 🦋 @EugeneVinitsky

over 3 years ago

What is off-belief learning and how does it help us build agents that coordinate only in grounded ways ? Part 1 of a new blog series on intuitive summaries of key ideas in multi-agent RL: https://t.co/H7g1PIHn4D

EugeneVinitsky's tweet photo. What is off-belief learning and how does it help us build agents that coordinate only in grounded ways ? Part 1 of a new blog series on intuitive summaries of key ideas in multi-agent RL: https://t.co/H7g1PIHn4D https://t.co/hE1zTXMjwh

2

66

17

28

24K

lightvector1 retweeted

Lex Fridman

@lexfridman

over 3 years ago

Here's my conversation with Noam Brown (@polynoamial), co-creator of AI systems that achieve superhuman level performance in games of poker and Diplomacy that involves strategic negotiations with humans. This was a fascinating, technical conversation. https://t.co/e6BArJjnag

lexfridman's tweet photo. Here's my conversation with Noam Brown (@polynoamial), co-creator of AI systems that achieve superhuman level performance in games of poker and Diplomacy that involves strategic negotiations with humans. This was a fascinating, technical conversation. https://t.co/e6BArJjnag https://t.co/m9O592F2Cu

64

1K

115

132

0

David J Wu

@lightvector1

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users