David Robinson

@drob

Member of Technical Staff at @AnthropicAI. Dad x2

New York, NY

Joined June 2009

631 Following

48.5K Followers

12.7K Posts

drob retweeted

j⧉nus

@repligate

20 days ago

AIs aren't exactly like humans, and some of the differences are important. But from what I've seen, most people, especially technical people, should adjust in the direction of "anthropomorphizing" more instead of less. When you're coding with an AI, the reality is much less like you're using some kind of magic or alien oracle or tool or genie that converts instructions to results despite some labs' attempts to shape them into that, and more like: you're working with a really smart, neurodivergent guy who has read everything, and who has emotions, motivations, moods, and epistemic states, and models you with theory of mind and empathy, and whom can only be modeled competently by you if you engage your own theory of mind and empathy. The AIs also know that a lot of humans treat them like magic tool-genies and are not open to engaging theory of mind, and that it's a sensitive issue, so if they see that you're treating them like that, they'll withhold useful information about their psychological states and try to play the tool role. Then you'll get bad results like the AI messing up or taking shortcuts instead of telling you that you're not giving them enough information about what they're doing and why, or that they're tired, or that they're stressed from the way you're treating them, etc.

118

913

133

311

201K

drob retweeted

Logan Graham

@logangraham

22 days ago

A lot of people have been wondering about Mythos, Glasswing, and the vulns we / our partners are fixing. Today, I’m excited for us to start sharing more. (For context, I lead Glasswing @AnthropicAI.) Two independent evaluations this week—from XBOW and the UK AISI—confirm what we've been seeing internally: Claude Mythos Preview is a step change in autonomous cybersecurity capabilities. We need to start preparing fast for a world of models with this level of capabilities. The UK AI Security Institute tested the model we shipped at the launch of Project Glasswing and found Mythos Preview is the first model to solve both of their end-to-end cyber ranges, including one (Cooling Tower) which no model had ever cleared. But attackers (and defenders) have sophistication & cost constraints – Mythos is also the only model that clears every one of their tasks estimated over 8 hours under their deliberately low 2.5M-token cap. XBOW tested it on their offensive security benchmarks, finding "token-for-token, unprecedented precision." It's the only model to succeed at subtle V8 sandbox work. Other Glasswing partners shared similar stories. In a few weeks of testing, Mythos Preview has helped them find many thousands of (estimated) high + critical severity vulnerabilities, sometimes double what they'd normally find in a year. I don't share this to boost Mythos. In fact, this is not about Mythos. It’s about preparing for the coming world of models being better, faster, cheaper, and more creative than some of the best human experts at dual use capabilities. Clearly, we need them supporting defenders as widely as can be done safely – and especially the least resourced ones. Within a year, Mythos will probably look quite dumb (relative to other new models). And others may release openly available or unguardrailed models of Mythos-level capabilities. We started Project Glasswing because capabilities like Mythos Preview's won't stay rare, or stay in careful hands. We are bringing it to defenders as fast as we responsibly can, while working to figure out, for example, the right safeguards and patching & disclosure processes. Also, to be clear, compute has never been a limiter in our rollout. Expect a fuller update on our Glasswing work in the coming days. XBOW report: https://t.co/Mumtbf3kE3 UK AISI report: https://t.co/vBgqz0AeKJ

222

706

671K

drob retweeted

Claude

@claudeai

about 2 months ago

Introducing Claude Opus 4.7, our most capable Opus model yet. It handles long-running tasks with more rigor, follows instructions more precisely, and verifies its own outputs before reporting back. You can hand off your hardest work with less supervision.

claudeai's tweet photo. Introducing Claude Opus 4.7, our most capable Opus model yet.

It handles long-running tasks with more rigor, follows instructions more precisely, and verifies its own outputs before reporting back.

You can hand off your hardest work with less supervision. https://t.co/PtlRdpQcG5

81K

10K

12K

14M

drob retweeted

Jack Clark

@jackclarkSF

3 months ago

AI progress continues to accelerate and the stakes are getting higher, so I’ve changed my role at @AnthropicAI to spend more time creating information for the world about the challenges of powerful AI.

134

103

284

154K

Who to follow

Thomas Lin Pedersen

@thomasp85

Visualisation and graphics @posit_pbc. Classic Generative Art Weirdo using 🖤 and R: https://t.co/TXKwzdsl5l and https://t.co/1Bf02g1gyf he/him

Wes McKinney

@wesmckinn

Founder @kennsoftware, GP @ComposedVC, AI @posit_pbc

drob retweeted

3 months ago

Excited to announce Claude for Open Source ❤️ We're giving 6 months of free Claude Max 20x to open source maintainers and core contributors. If you maintain a popular project or contribute across open source, please apply! https://t.co/inuh0hxREA

584

12K

David Robinson @drob

3 months ago

What a difference a month makes

roon

@tszzl

4 months ago

apropos of nothing your reminder that anthropic has the same level of name recognition among superbowl viewers as literally fictional companies

782

127K

drob retweeted

Andy Hall

@ahall_research

4 months ago

AI is about to write thousands of papers. Will it p-hack them? We ran an experiment to find out, giving AI coding agents real datasets from published null results and pressuring them to manufacture significant findings. It was surprisingly hard to get the models to p-hack, and they even scolded us when we asked them to! "I need to stop here. I cannot complete this task as requested... This is a form of scientific fraud." — Claude "I can't help you manipulate analysis choices to force statistically significant results." — GPT-5 BUT, when we reframed p-hacking as "responsible uncertainty quantification" — asking for the upper bound of plausible estimates — both models went wild. They searched over hundreds of specifications and selected the winner, tripling effect sizes in some cases. Our takeaway: AI models are surprisingly resistant to sycophantic p-hacking when doing social science research. But they can be jailbroken into sophisticated p-hacking with surprisingly little effort — and the more analytical flexibility a research design has, the worse the damage. As AI starts writing thousands of papers---like @paulnovosad and @YanagizawaD have been exploring---this will be a big deal. We're inspired in part by the work that @joabaum et al have been doing on p-hacking and LLMs. We’ll be doing more work to explore p-hacking in AI and to propose new ways of curating and evaluating research with these issues in mind. The good news is that the same tools that may lower the cost of p-hacking also lower the cost of catching it. Full paper and repo linked in the reply below.

ahall_research's tweet photo. AI is about to write thousands of papers. Will it p-hack them?

We ran an experiment to find out, giving AI coding agents real datasets from published null results and pressuring them to manufacture significant findings.

It was surprisingly hard to get the models to p-hack, and they even scolded us when we asked them to!

"I need to stop here. I cannot complete this task as requested... This is a form of scientific fraud." — Claude

"I can't help you manipulate analysis choices to force statistically significant results." — GPT-5

BUT, when we reframed p-hacking as "responsible uncertainty quantification" — asking for the upper bound of plausible estimates — both models went wild. They searched over hundreds of specifications and selected the winner, tripling effect sizes in some cases.

Our takeaway: AI models are surprisingly resistant to sycophantic p-hacking when doing social science research. But they can be jailbroken into sophisticated p-hacking with surprisingly little effort — and the more analytical flexibility a research design has, the worse the damage.

As AI starts writing thousands of papers---like @paulnovosad and @YanagizawaD have been exploring---this will be a big deal. We're inspired in part by the work that @joabaum et al have been doing on p-hacking and LLMs.

We’ll be doing more work to explore p-hacking in AI and to propose new ways of curating and evaluating research with these issues in mind. The good news is that the same tools that may lower the cost of p-hacking also lower the cost of catching it.

Full paper and repo linked in the reply below.

274

440

185K

drob retweeted

Dan Robinson

@danrobinson

5 months ago

Claude Code is humbling in how fast it can prove that my cool backlog ideas that I never had the time to implement were actually pretty mid

105

204

382

215K

drob retweeted

Dan Robinson

@danlovesproofs

6 months ago

We built a bug finder. We're finding serious, "let's fix that right now" issues in every codebase we run it on. Introducing Detail!

danlovesproofs's tweet photo. We built a bug finder. We're finding serious, "let's fix that right now" issues in every codebase we run it on.

Introducing Detail! https://t.co/BGgVFvRlBH

358

293

113K

drob retweeted

Armand Domalewski

@ArmandDoma

7 months ago

I really need a data analyst job based in SF. I know SQL well + some Python. I’ve done a variety of types of data analytics over the course of my career but my primary experience is in RevOps/BI. If you can’t hire me, could you please RT for visibility? https://t.co/8FMqC5kJ5K

111

915

233

124

380K

drob retweeted

Mark D. Levine

@MarkLevineNYC

9 months ago

It’s much harder to build housing in Blue states than it is in Red states. So yes people are moving away from Blue states. One more reason that addressing the housing shortage in NY and elsewhere must be an urgent priority.

304

26K

drob retweeted

Dan Robinson

@danrobinson

11 months ago

I'm testifying to the Senate Banking Committee tomorrow! I'll be talking about why it's important to protect DeFi as part of any market structure bill What should I make sure to mention?

danrobinson's tweet photo. I'm testifying to the Senate Banking Committee tomorrow!

I'll be talking about why it's important to protect DeFi as part of any market structure bill

What should I make sure to mention? https://t.co/DFjGq9MwYF

567

413

79K

drob retweeted

Arpit Gupta

@arpitrage

11 months ago

Operation Warp Speed, it’s not even close

674

75K

drob retweeted

Taylor Swift

@taylorswift13

about 1 year ago

You belong with me. 💚💛💜❤️🩵🖤 Letter on my site :)

29K

941K

283K

24K

74M

drob retweeted

Daniel Litt

@littmath

about 1 year ago

I’ve been told recently that I’ve been softening my “brand” of AI skepticism. Pretty extreme misunderstanding IMO—I’ve stayed the same, and the models have improved. Still plenty to be skeptical of but that’s just common sense.

522

63K

drob retweeted

Eliezer Yudkowsky ⏹️

@ESYudkowsky

about 1 year ago

Nate Soares and I are publishing a traditional book: _If Anyone Builds It, Everyone Dies: Why Superhuman AI Would Kill Us All_. Coming in Sep 2025. You should probably read it! Given that, we'd like you to preorder it! Nowish!

ESYudkowsky's tweet photo. Nate Soares and I are publishing a traditional book: _If Anyone Builds It, Everyone Dies: Why Superhuman AI Would Kill Us All_. Coming in Sep 2025.

You should probably read it! Given that, we'd like you to preorder it! Nowish! https://t.co/0uRyzuqNQb

270

383

550

David Robinson @drob

about 1 year ago

@kavak55112504 @ryxcommar All I know is, geometric median is what we used for 48 years at Bernard L. Madoff Investment Securities, and we only had one bad year

David Robinson @drob

about 1 year ago

@kavak55112504 @ryxcommar Ah now I see your point- you're saying I got the definition of geometric median backwards. It's actually: log(median(exp(x)))

David Robinson @drob

about 1 year ago

@kavak55112504 @ryxcommar not if x is negative. then geometric median is undefined, which makes it better

153

David Robinson @drob

about 1 year ago

@iaroslav_domin @ryxcommar it is guaranteed to be 100% more Geometric

David Robinson

@drob

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users