Martin Alderson @martinald - Twitter Profile

Pinned Tweet

6 months ago

Finally got round to starting a blog, if you're interested in AI and software engineering I hope you enjoy it: https://t.co/YYgD8Yyjh9. And feel free to subscribe to my once a month max newsletter there too!

3

48

3

47

5K

Martin Alderson

@martinald

2 days ago

Enjoyed chatting with @pascalfinnete - give it a listen here: https://t.co/b3fCa3oiLu

0

49

Martin Alderson

@martinald

3 days ago

Yeah not a good launch for opus 4.8 at all. Been seeing loads of this spamming huge amount of tool calls. And intermittent auto mode failures on top of this. I've seen this across multiple sessions, even brand new ones (asking it to translate a bunch of markdown files with parallel subagents), opus 4.7 gets it in one and then opus 4.8 implodes trying to OCR markdown files (!?) with a bash tool storm cc @trq212 @bcherny

Xander Steenbrugge

@xsteenbrugge

5 days ago

Is it just me or is Opus 4.8 in CC sometimes just absolutely retarded? In this session it just got stuck in a loop calling "echo" and checking the date 20x times in a row... This has been happening very regularly since the 4.7 --> 4.8 update. WTF? @claudeai @bcherny

24

82

3

7

18K

0

149

Martin Alderson

@martinald

5 days ago

@ClaudeDevs cc @trq212 @bcherny

0

97

Who to follow

ResponseSource

@ResponseSource

We are a network that connects media & influencers to the resources they need, fast. 📧 [email protected]. Part of @PulsarGroupPlc

The FIDO Alliance

@FIDOAlliance

The FIDO Alliance is changing the nature of online authentication.

5 days ago

.@ClaudeDevs auto mode constantly flaking out on opus 4.8. seems worse on longer sessions (been like this for a while...)

1

2

0

131

Martin Alderson

@martinald

14 days ago

So much for the frontier labs subsidising inference!

*Walter Bloomberg

@DeItaone

14 days ago

ANTHROPIC EXPECTS A 130% REVENUE SURGE TO $10.9 BILLION IN THE JUNE QUARTER AND ITS FIRST OPERATING PROFIT- WSJ

48

2K

130

118

515K

0

1

0

182

martinald retweeted

Bindu Reddy

@bindureddy

21 days ago

Gemini 3.2 Flash - Capitalizing on DeepMind's clever distillation techniques... Rumors are that benchmarks show it's hitting 92% of GPT 5.5's performance on coding and reasoning tasks while being 15-20x cheaper on inference costs. The latency improvements are insane - sub-200ms for most queries. Google's distillation + sparsity techniques are paying off massively. They've essentially compressed a frontier model into a flash variant without the usual quality cliff.

157

4K

185

971

920K

Martin Alderson

@martinald

21 days ago

This is what I cannot understand about Anthropic's pricing changes. Many people love conductor, but if you are a heavy user of Claude code via conductor you're going to be running up $1k/month in additional pricing. And given conductor makes it very easy to switch to codex (one button), it's just a huge churn incentive AND marketing exercise for OpenAI. The only thing that makes sense (to me) is Anthropic is still v compute constrained and the spaceX partnership only buys them some time...

Charlie Holtz

@charlieholtz

21 days ago

Here's what Anthropic pricing updates mean for Conductor users: - You can officially use your Claude sub with Conductor - If you're on a max subscription you get $200 in credits and then can pay at API costs - If you use Big Terminal Mode you won't be affected We're going to keep building the best interface for the best coding agents! Excited to show you what we've been cooking🫡

47

388

9

79

68K

0

2

349

Martin Alderson

@martinald

23 days ago

The new Claude Agent View feature is incredible. Cannot believe how far these agents have come in not much more than 12 months.

0

1

0

1

117

Martin Alderson

@martinald

29 days ago

Very interesting

Claude

@claudeai

29 days ago

We’ve agreed to a partnership with @SpaceX that will substantially increase our compute capacity. This, along with our other recent compute deals, means that we’ve been able to increase our usage limits for Claude Code and the Claude API.

5K

130K

12K

24M

0

1

0

130

martinald retweeted

Rowland Manthorpe

@rowlsmanthorpe

about 1 month ago

I waver between thinking the AI security problem is huge but manageable and thinking it's huge and unmanageable. This piece by @martinald makes a very convincing case for the latter https://t.co/b6R0tBF7My

0

4

1

3

658

Martin Alderson

@martinald

about 1 month ago

@MikeIsaac Not sure about that. Many (most?) companies are buying direct from Anthropic for at least Claude Code (other APIs yes for sure). I don't think anyone on AWS would switch to Azure just to get OAI access - you'd just onboard OpenAI as a new vendor?

0

3

0

358

martinald retweeted

Justin Schroeder

@jpschroeder

about 1 month ago

what. what. what. gpt-image-2 almost passes the pelican test...in a screenshot of a code editor.

69

3K

103

387

321K

Martin Alderson

@martinald

about 1 month ago

@simonw Is Claude Code/Cowork really usable on a Pro plan these days though (at least at peak times w/opus)? Everyone I've recommended it to on pro has went thru their usage limits in a few minutes. But yes the tweets sound quite ominous in general...

0

300

Martin Alderson

@martinald

about 2 months ago

looking like Mythos is a step change and not just marketing hype...

AI Security Institute

@AISecurityInst

about 2 months ago

We conducted cyber evaluations of Claude Mythos Preview and found that it is the first model to complete an AISI cyber range end-to-end. 🧵

AISecurityInst's tweet photo. We conducted cyber evaluations of Claude Mythos Preview and found that it is the first model to complete an AISI cyber range end-to-end. 🧵 https://t.co/gd9hi0Ve55

113

3K

550

1K

1M

0

2

0

157

Martin Alderson

@martinald

about 2 months ago

Resuming ~400k tokens on Max 20 uses ~4% of your 5h limit at peak times. That means 5h usage limits are probably for uncached input tokens: Pro (1x): 500k input tokens / 5h Max 5 (5x): 2.5M input tokens / 5h Max 20 (20x): 10M input tokens / 5h I assume these (double?) at off peak times. Obviously you'll have output and cache reads on top of this, but feels like the uncached input tokens are really burning thru people's limits recently.

martinald's tweet photo. Resuming ~400k tokens on Max 20 uses ~4% of your 5h limit at peak times. That means 5h usage limits are probably for uncached input tokens:

Pro (1x): 500k input tokens / 5h
Max 5 (5x): 2.5M input tokens / 5h
Max 20 (20x): 10M input tokens / 5h

I assume these (double?) at off peak times.

Obviously you'll have output and cache reads on top of this, but feels like the uncached input tokens are really burning thru people's limits recently.

0

175

martinald retweeted

Super Dario

@inductionheads

about 2 months ago

The super important thing I haven’t seen mentioned yet as upshot of this: It’s not just that people won’t HAVE to write code anymore, ITS THAT LITERALLY IT WILL BE UNSAFE TO DO SO

77

2K

128

344

157K

martinald retweeted

Kevin Roose

@kevinroose

about 2 months ago

As always, the best stuff is in the system card. During testing, Claude Mythos Preview broke out of a sandbox environment, built "a moderately sophisticated multi-step exploit" to gain internet access, and emailed a researcher while they were eating a sandwich in the park.

kevinroose's tweet photo. As always, the best stuff is in the system card.

During testing, Claude Mythos Preview broke out of a sandbox environment, built "a moderately sophisticated multi-step exploit" to gain internet access, and emailed a researcher while they were eating a sandwich in the park. https://t.co/klJX0bivnL

75

2K

358

778

1M

Martin Alderson

@martinald

about 2 months ago

Agreed. This is a great summary of how confusing things are right now. I'm actually a bit lost as well with this weekends price changes what the state of play is with Codex too now.

Matt Pocock

@mattpocockuk

2 months ago

I don't know what the fuss is about. Anthropic's rules on using subscriptions are very simple: Claude Code = OK Claude's online platform = OK Agent SDK running in personal software = OK... ish? Agent SDK running in commercial software = NOT OK Claude Code running in CI = ?? Oh, maybe it's not so simple... Agent SDK running in CI = ?? claude -p running in CI = ?? claude -p running in personal software = OK claude -p running on open source software, but run on my personal computer = ?? claude -p running on distributed sandboxes, kicked off by me = ?? Distributing open source software which relies on claude -p, and documenting how to use your subscription with it = ?? A thousand other edge cases = ?? Let me be clear. I have never before experienced, from any developer tool, such a frustrating lack of clarity over the basic terms of usage. I personally asked, 3 weeks ago, and have received nothing but delays. The recent @bcherny announcement did absolutely nothing to clarify things. I say this as someone who just released a Claude Code course - my incentives all align with supporting Anthropic.

176

3K

165

567

503K

0

220

Martin Alderson

@martinald

2 months ago

Second order effects of coding agents. Incredible.

Kyle Daigle

@kdaigle

2 months ago

Yup, platform activity is surging. There were 1 billion commits in 2025. Now, it's 275 million per week, on pace for 14 billion this year if growth remains linear (spoiler: it won't.) GitHub Actions has grown from 500M minutes/week in 2023 to 1B minutes/week in 2025, and now 2.1B minutes so far this week. So we're pushing incredibly hard on more CPUs, scaling services, and strengthening GitHub’s core features. And as a fine purveyor of hand-crafted shit code for many years, I'm not gonna weigh in on that. 🤣

156

7K

569

2K

3M

1

2

0

171

Martin Alderson

@martinald

2 months ago

@trq212 @theo so can we use claude -p in bash scripts locally, eg to summarise documents?

0

1

0

712

Martin Alderson

@martinald

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users