biased estimator

Verified account

@selfattentive

deep learning and big computers

local minima (chicago)

Joined July 2023

1.8K Following

323 Followers

2.9K Posts

Pinned Tweet

biased estimator

about 1 year ago

I cannot overstate how crowded "ai research" has gotten and how low quality the published material is becoming as a result. Nearly every week now I am seeing throw away experiments my lab looked into 1 or 2 years ago being dressed up as "research" and posted to arxiv.

1

21

0

1

3K

biased estimator

about 3 hours ago

@DeepDishEnjoyer I think it works because training a big neural net is like doing greedy search in a space with no(few) local minima

0

1

0

0

39

biased estimator

about 3 hours ago

@DeepDishEnjoyer The op is interaction bait and incorrect, but I don’t think the universal approximation theorem actually explains much at all about why current deep learning works well.

1

3

0

0

47

biased estimator

about 6 hours ago

@rishabh16_ Depends on the model. If you have a dna/rna language model with a causal mask yes you can do this. With a bidirectional mask interpreting attention weights is ~impossible.

0

0

0

0

92

biased estimator

about 6 hours ago

@macrocephalopod I have a very serious quant trading strategy™️ I promise we just have to wait for the college football season to start trading it.

0

1

0

0

65

biased estimator

about 7 hours ago

I’m sure they’re good at hardware or whatever but every time I’ve had a conversation with one of their ml people I came away unimpressed.

@StockSavvyShay

20 days ago

Cathie Wood bought ~$35M of $CBRS today

StockSavvyShay's tweet photo. Cathie Wood bought ~$35M of $CBRS today https://t.co/opzLestzGU

650

1K

78

159

576K

0

0

0

0

79

selfattentive retweeted

about 17 hours ago

great moment in every optimizer’s life when he finally runs the EV calc on running EV calcs on everything, realizes the whole thing has been catastrophically negative EV, deletes the spreadsheet and goes outside

19

2K

82

201

41K

selfattentive retweeted

about 9 hours ago

"Attention is just a special case of <abstract math thing> so we generalized it by <neglecting the other 30 abstractions and conditions required for frontier architecture> and we found it performed <p hacking> compared to <naive baseline>"

7

500

20

85

17K

biased estimator

about 10 hours ago

@mandylu Interaction bait

0

0

0

0

33

biased estimator

about 10 hours ago

@Tyler_A_Harper This is like saying universities shouldn’t have computers or microscopes or any one of the countless other tools that make research and learning possible.

0

1

0

0

67

selfattentive retweeted

@Miles_Brundage

1 day ago

Bay Area people are like "become rich. go into debt if you have to"

17

467

11

26

35K

selfattentive retweeted

Andrew Gordon Wilson

about 24 hours ago

When someone says "we need theory of deep learning", note that probably nothing will "count" as a theory of deep learning unless it is *their* theory, or unless it is speaking a language, and using techniques, that they already have a bias towards.

12

200

12

34

12K

biased estimator

1 day ago

@Tyler_A_Harper “Why are they paying for access to the most revolutionary technology of our lifetime? That money should go to paying fat narcissists to stalk ucpd officers and paint graffiti all over Hyde park”

0

5

0

2

610

biased estimator

1 day ago

@Tyler_A_Harper Well at least Claude enterprise subscriptions will produce something of value

1

1

0

0

489

biased estimator

1 day ago

@JacquesThibs I don’t see any reason why this would be the case

1

3

0

0

107

biased estimator

1 day ago

@Lib_Development What a way to spend time.

1

1

0

0

486

biased estimator

1 day ago

@akarlin French should be bumped up and German should be bumped down

0

0

0

0

147

biased estimator

1 day ago

Milton Friedman

2 days ago

Since everyone is sharing their favourite American authors, who is the best American philosopher?

520

314

18

142

291K

0

0

0

0

52

biased estimator

1 day ago

@PAHoyeck @razibkhan Milton Friedman

0

0

0

0

23

biased estimator

1 day ago

@mattapplepi @Grnfink2 Seems like it puts the trainee into desperate and violent state of mind that probably comes in handy

1

0

0

0

58

biased estimator

1 day ago

@mattapplepi @Grnfink2 I don’t know anything about military stuff but even if it isn’t a practical skill I bet this kind of training is good for teaching a mentality that is useful in war

1

2

0

0

563

Last Seen Users on Sotwe

Trends for you

Most Popular Users