Jonathan Ragan-Kelley

@jrk

The lyf so short, the craft so longe to lerne.

02139

Joined January 2007

617 Following

1.1K Followers

5.8K Posts

Jonathan Ragan-Kelley @jrk

2 months ago

@stephensacks @grok I'm not sure—I'm not a deep expert in this area, nor do I have that much personal experience using Grok. (I generally find the big 3 more useful, each in different ways.) It certainly seems to be fine-tuned to have at least a superficially more oppositional character.

jrk retweeted

Simon Willison

@simonw

5 months ago

It genuinely feels to me like GPT-5.2 and Opus 4.5 in November represent an inflection point - one of those moments where the models get incrementally better in a way that tips across an invisible capability line where suddenly a whole bunch of much harder coding problems open up

141

198

561

492K

Jonathan Ragan-Kelley @jrk

6 months ago

This is close to, but not the same as, "LLM in a loop with tools," because (in the context of the piece) it emphasizes the significance of the shift to one universal, general-purpose tool which is "just using a computer" (e.g. Bash, etc.)

193

Jonathan Ragan-Kelley @jrk

6 months ago

@simonw This quote from the recent "Bitter Lesson of LLM extensions" post (https://t.co/bgqghTsJqK) resonated in a way that felt like it belonged in your canon: "An agent isn't just a[n] LLM in a while loop. It's an LLM in a while loop that has a computer strapped to it."

252

Who to follow

Fredo Durand

@fredodurand

he/him. Amar Bose Professor of Computing at MIT EECS and Equity Officer. Photography, good food, wine, travel, and parenting, often at the same time.

PLDI

@PLDI

The ACM SIGPLAN Conference on Programming Language Design and Implementation. Official hashtag this year: #PLDI2026. Tweets by Jenna DiVincenzo and @konskallas.

Jan Kautz

@jankautz

VP of Learning and Perception Research @NvidiaAI. Views and opinions are my own.

Jonathan Ragan-Kelley @jrk

7 months ago

@jon_barron Research compute likely isn’t out of line with the conference itself in its impact.

Jonathan Ragan-Kelley @jrk

7 months ago

@jon_barron Also, back of the envelope carbon analysis: CVPR has ~10k submissions and ~10k attendees. If the mean attendee flies round trip NY-LA (some less, some much more), that’s 1 MT CO2. Equivalent mean compute / submission is ~3000 H100-hours (4 months) with average US electricity.

104

Jonathan Ragan-Kelley @jrk

9 months ago

@abrakjamson @simonw This is great, thanks! I’m unsurprised that Microsoft is out front on this given their longstanding enterprise productivity tools focus and resulting culture. (And good on you for it all the same!) I’m very surprised that others aren’t taking it more seriously by now.

632

Jonathan Ragan-Kelley @jrk

9 months ago

@simonw I'm curious about your thoughts on policies and issues around providers training on model queries.

Jonathan Ragan-Kelley @jrk

9 months ago

@simonw Anyway, could be another useful thing to elevate more prominently with your platform, as you so effectively have for the “lethal trifecta”! (And: big fan/love your work/thanks for everything you do :)

Jonathan Ragan-Kelley @jrk

9 months ago

@simonw I understand why lawyers don’t want them to promise specifics, but it seems like a huge problem not to have a clear answer. I would have hoped the product/business owners would see this bigger picture cost/benefit and overrule the lawyers’ narrow conservatism by now.

Jonathan Ragan-Kelley @jrk

9 months ago

@simonw @bleuonbase I don’t know either, and it’s *possible* the labs are just empirically confident it won’t memorize PII. But they memorize an awful lot of the training set! See all the stunts to regurgitate copyrighted content. Seems like a huge risk.

Jonathan Ragan-Kelley @jrk

9 months ago

I therefore suspect Google *aren't* e.g. directly throwing chat history into pretraining. But it certainly seems like something everyone—both users and providers—should want to be very clear about.

347

Jonathan Ragan-Kelley @jrk

9 months ago

It would be a privacy and PR nightmare any time people figured out how to exfiltrate private information memorized from training on chat logs or browser usage.

336

Jonathan Ragan-Kelley

@jrk

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users