Bayesian (LessOnline+Manifest)

Verified account

@Bayesian0_0

#1 AI forecaster on Manifold Markets (and #2 across all categories) want everything to make sense

Joined July 2024

2.2K Following

713 Followers

983 Posts

Bayesian (LessOnline+Manifest)

about 11 hours ago

I think this is just a chinese labs + nvidia skill issue mostly, and the more inference demand you expect for ur model the more it justifies pretraining overtraining, so frontier us models are probably much more ‘overtrained’. They also likely are much more mature in their synthetic pretraining data pipelines. But yeah assuming they keep models around for post training for half a year to a year, it just seems crazy to me to some extent that they would pretrain them for only a few days!

0

9

0

1

918

Bayesian (LessOnline+Manifest)

about 12 hours ago

This makes sense thanks for the answer ig I’m forgetting they have a crapton of compute. I think it’s plausible they have 150T unique tokens and 2-3 epochs gets u to 400Tish training tokens, which would get to month+ training run. But maybe I am making numbers up that match my intuition, not sure. Just seems surprising that they could not make use of significantly more compute time than that for pretraining given its apparent importance

1

6

0

0

1K

Bayesian0_0 retweeted

Tamay Besiroglu

1 day ago

I think it's underappreciated how economically valuable AI safety is. A model that frequently goes off the rails, takes dangerous actions, is misleading or deceptive, etc. is simply much less valuable than a model that does not do that.

24

505

33

144

99K

Bayesian (LessOnline+Manifest)

1 day ago

@cqkten And mythos is prolly ~3-4 months ahead in productivity uplift

0

2

0

0

95

Bayesian0_0 retweeted

5 days ago

This benchmark is great! A model that I like scores highly and a model that I dislike scores poorly. This benchmark is slop! A model that I dislike is at the top of the rankings. How can that be possible? I have taste!

5

42

3

1

2K

Bayesian (LessOnline+Manifest)

6 days ago

@fleetingbits @ar0cket1 Ok maybe that’s too many inferential steps to be saying probably

0

1

0

0

18

Bayesian (LessOnline+Manifest)

6 days ago

@fleetingbits @ar0cket1 Probably they realized it led to a higher price / lower usage than they actually needed to charge to maintain margins per unit of compute, so changed it (beyond possible inference efficiency gains between then and now)

1

2

0

0

24

Bayesian (LessOnline+Manifest)

6 days ago

@ValsTutor following ECI trends (my own implementation, slightly diff results to Epoch's) you get to mythos preview level open weights in 10ish months, bc mythos preview is pretty outlier in its capability level

Bayesian0_0's tweet photo. @ValsTutor following ECI trends (my own implementation, slightly diff results to Epoch's) you get to mythos preview level open weights in 10ish months, bc mythos preview is pretty outlier in its capability level https://t.co/fx7PXACNhB

0

6

0

0

141

Bayesian (LessOnline+Manifest)

6 days ago

@justjoshinyou13 this was my take https://t.co/TIhJCghNn4

Bayesian (LessOnline+Manifest)

7 days ago

@ar0cket1 they force fast mode to be API usage! So they decrease the API margin but maintain gross margin by decreasing fraction of tokens that are heavily subsidized

2

6

0

1

285

1

2

0

0

74

Bayesian (LessOnline+Manifest)

8 days ago

@himanshustwts @Gauri_the_great > all our customers this almost certainly includes the general public

1

2

0

0

67

Bayesian (LessOnline+Manifest)

9 days ago

@teortaxesTex Openai models are overspecialized on math so this doesn’t make any sense

1

4

0

0

336

Bayesian (LessOnline+Manifest)

9 days ago

remarkable honesty

10 days ago

what did @tszzl see

altryne's tweet photo. what did @tszzl see https://t.co/MbF8n2Jrsd

27

308

7

76

46K

3

46

0

5

7K

Bayesian (LessOnline+Manifest)

12 days ago

@fleetingbits good takes

0

1

0

0

152

Bayesian0_0 retweeted

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)

14 days ago

Interesting how different are frontier labs' notions of progress towards AGI. OpenAI: "we've disproven an old conjecture in math" Anthropic: "we've discovered ALL the vulnerabilities" DeepSeek: "we've made context free" Google DeepMind: "we've reduced the batch size for Flash"

20

559

27

77

21K

Bayesian (LessOnline+Manifest)

15 days ago

@snowboat84 I doubt it https://t.co/Vfd1g9IKOb

0

0

0

0

33

Bayesian (LessOnline+Manifest)

19 days ago

@SemiAnalysis_ dylan is right

0

8

0

1

361

Bayesian (LessOnline+Manifest)

20 days ago

@celestepoasts I disagree, pre-reasoning to post-reasoning seems like an especially large gap

0

11

0

0

234

Bayesian (LessOnline+Manifest)

22 days ago

@chrisgpt @RobotChocobo They will in a few days and announce it and everyone will react with ‘omg growth slowed!!!’

0

1

0

0

100

Last Seen Users on Sotwe

Trends for you

Most Popular Users