Brandon Hudgens

11 months ago

Indeed. Always a scam.

11 months ago

These plans are a pure gold mine for Apple. The vast majority of people will pay $240 a year for the next several years and never use it. That’s why insurance is a great business!

78

2K

68

127

244K

0

10

11 months ago

Very excited for this

11 months ago

Power On: Apple’s first foldable iPhone is arriving next year and it will be its most un-Apple like launch yet. Here’s why — https://t.co/iFYiSb162n

82

1K

71

179

204K

0

33

agentengineer retweeted

11 months ago

Yet still no Instagram for iPad 🤣

22

488

22

15

58K

agentengineer retweeted

11 months ago

Okay wtf is going on

100

2K

26

120

222K

agentengineer retweeted

Greg Kamradt

@GregKamradt

11 months ago

We got a call from @xai 24 hours ago “We want to test Grok 4 on ARC-AGI” We heard the rumors. We knew it would be good. We didn’t know it would become the #1 public model on ARC-AGI Here’s the testing story and what the results mean: Yesterday, we chatted with Jimmy from the xAI team, who wanted us to validate their Grok 4 score. They did their own testing on the ARC-AGI-1 & 2 public evaluation set To validate their score (and measure possible overfitting), we self-tested the new model on our semi-private evaluation set We walked them through our testing policy: * No data retention * Model checkpoint must be intended for public use * Temporary increase in rate limits for burst testing They were on board, so we got started Initially, we ran into timeout errors with normal requests, so we switched to streaming. That resolved the issue So, what do these results mean? First, the facts: Grok 4 is now the top-performing publicly available model on ARC-AGI. This even outperforms purpose-built solutions submitted on Kaggle. Second, ARC-AGI-2 is hard for current AI models. To score well, models have to learn a mini-skill from a series of training examples, then demonstrate that skill at test time. The previous top score was ~8% (by Opus 4). Below 10% is noisy Getting 15.9% breaks through that noise barrier, Grok 4 is showing non-zero levels of fluid intelligence But the mission isn’t over. We need new ideas to solve ARC-AGI-2. Scale alone won’t get us there Come work on ARC-AGI with us

290

7K

784

1K

15M

11 months ago

Where's the API?! You said it was live! @elonmusk

0

2

11 months ago

I knew it. The worst.

11 months ago

Surprise! Grok 4 is not dropping on the API today. I'm sure it will happen in a few months...

38

218

3

12

97K

0

2

agentengineer retweeted

11 months ago

Surprise! Grok 4 is not dropping on the API today. I'm sure it will happen in a few months...

38

218

3

12

97K

11 months ago

For anyone interested in AI, I can't recommend @natebjones YT channel enough. A refreshing voice in a forest of ill-informed channels

0

2

11 months ago

@markgurman How does @theo feel about this

0

4

agentengineer retweeted

11 months ago

It is incredible that Apple design decisions developed over multiple years can be influenced by a week of Twitter and YouTube commentary.

323

7K

306

301

480K

agentengineer retweeted

Russillo

@ryenarussillo

11 months ago

Had anyone ever done this before? Take two weeks off, then come back to work?

183

6K

160

163

772K

agentengineer retweeted