Seth M @_sethmorton - Twitter Profile

Pinned Tweet

about 2 months ago

Why what you attend to can't be static: https://t.co/4R5A5ljyah Transformers can't change what they attend to after training. Backprop is too global and too destructive for continual learning. The brain doesn't work this way. 1/2

_sethmorton's tweet photo. Why what you attend to can't be static:

https://t.co/4R5A5ljyah

Transformers can't change what they attend to after training. Backprop is too global and too destructive for continual learning. The brain doesn't work this way. 1/2 https://t.co/7MxiLrvDVK

16

461

29

473

48K

Seth M @_sethmorton

3 days ago

@GaoShanghua the two fixes that i think would be great: a falsification layer before promotion (anti-collapse metric + mandatory ablation) and a quality-diversity archive instead of one global champion

0

1

0

55

Seth M @_sethmorton

3 days ago

@GaoShanghua after having used it, this still feels like hill-climbing in a multi-team costume: there's one global champion, one-comment critique gate, serial gpu etc. also - no falsification layer, so it tends to game metrics

1

0

65

Seth M @_sethmorton

4 days ago

@bpjzy @alexn5264 , @suganton thoughts?

1

0

330

Seth M @_sethmorton

4 days ago

@beffjezos @CIMCAI @stephen_wolfram this is incredibly interesting

1

0

117

Seth M @_sethmorton

6 days ago

@internetvin based

0

3

0

47

Seth M @_sethmorton

10 days ago

@roland_graser this is a cool benchmark

0

2

0

100

Seth M @_sethmorton

15 days ago

@iocapon true

0

1

0

33

Seth M @_sethmorton

17 days ago

@beffjezos you should check out @Primer - i really respect the mission & founders of this company. I had a fantastic convo with Robert (their head of engineering) a bit ago

0

19

Seth M @_sethmorton

17 days ago

@arthur_hyper88 is this real?

0

18

Seth M @_sethmorton

21 days ago

@poetengineer__ how did you make this visual? it's beautiful

2

9

0

2K

Seth M @_sethmorton

22 days ago

@iocapon true

0

2

0

37

Seth M @_sethmorton

27 days ago

@iocapon @interfere_ so exciting!

0

1

0

81

Seth M @_sethmorton

27 days ago

@bercankilic your graphics are awesome

0

1

0

18

Seth M @_sethmorton

about 1 month ago

@beffjezos i've been super impressed with 5.5 - definitely feels like I can trust it more than Opus

0

1

0

192

Seth M @_sethmorton

about 1 month ago

@beffjezos i first heard of atomic labs and Sam all the way in Munich from @moritzthuening , the work they're doing is amazing and Sam is incredibly cracked - unfortunate but makes sense that they're moving to Texas

1

2

0

251

Seth M @_sethmorton

about 1 month ago

@beffjezos more investment is needed in AI for biology - we need better tooling such that bio companies can increase their internal rate of return.

_sethmorton's tweet photo. @beffjezos more investment is needed in AI for biology - we need better tooling such that bio companies can increase their internal rate of return. https://t.co/BFCUnzBpuH

1

0

152

Seth M @_sethmorton

about 1 month ago

@punit_arani i think this is really focused on directed execution toward pre-formed goals - but the real distinguishing variable is something earlier: your capacity to attend to things outside of distribution in the first place

0

1

0

38

Seth M @_sethmorton

about 1 month ago

@nilscmr check out this blog https://t.co/ibfvFM8SnT

0

2

0

7

310

Seth M @_sethmorton

about 1 month ago

@punit_arani true

0

25

Seth M @_sethmorton

about 1 month ago

@lukas_bongartz @misovalko reality -> we observe messy data π -> We try decompositions μ -> we score them by preferring the most simple explanatory decompositions and the best scoring is our understanding of the world

0

2

0

33

Seth M

@_sethmorton

Last Seen Users on Sotwe

Trends for you

Most Popular Users