Todd Hanford @thanford7 - Twitter Profile

It's still incredible that the best coding models use terrible coding practices at times. Both GPT 5.5 and Opus 4.8 often guess at fallback values. Here's an example output from GPT 5.5 (~25 LOC). The LiveKit docs (which GPT 5.5 has access to via MCP) show that these values can be accessed through one key each. So this can be reduced to 3 lines. Always a good reminder to slow down and review the code.

thanford7's tweet photo. It's still incredible that the best coding models use terrible coding practices at times. Both GPT 5.5 and Opus 4.8 often guess at fallback values. Here's an example output from GPT 5.5 (~25 LOC). The LiveKit docs (which GPT 5.5 has access to via MCP) show that these values can be accessed through one key each. So this can be reduced to 3 lines.

Always a good reminder to slow down and review the code.

0

18

Todd Hanford

@thanford7

6 days ago

@Vtrivedy10 Wow that's way cheaper than I thought! Thanks for the detailed write up!

0

14

Todd Hanford

@thanford7

7 days ago

We've seen many examples of AI agents doing terrible things like deleting production databases. I call this the "Murphy's Law of AI" - what an agent can do, it will do. This raises the important question of "what should an AI agent be allowed to do?" However, this misses the important nuance that AI agents are acting on behalf of a specific individual (or company). The better question is "what should an AI agent be allowed to do for {x} individual?" The answer is that it depends on the situation. There are three types of agent access you should think about: - User delegated = the agent inherits the user's permissions. This is best when the agent is not shared with others and acts as a personal assistant. - Agent owned = the agent has it's own permissions, essentially acting like a unique employee. This is best when the agent performs tasks in the background (e.g. cron jobs) - User/Agent intersection = the agent receives a union of its permissions and the user's permissions. This is best when an agent is shared across users. E.g. a legal research agent where each lawyer has access to different cases. I break this down further in the article.

Todd Hanford

@thanford7

7 days ago

https://t.co/KKby6Nw3ZK

0

1

0

80

0

33

Todd Hanford

@thanford7

7 days ago

https://t.co/KKby6Nw3ZK

0

1

0

80

Todd Hanford

@thanford7

12 days ago

All of these new layers of agentic programming (loops on top of agents, nested sub-agents) feel kind of like exotic derivatives. Complex - no one knows exactly what they are doing Illiquid - few people have adopted them Pricing difficulty - you stand to burn a lot of money/tokens and the underlying value is difficult to predict

thanford7's tweet photo. All of these new layers of agentic programming (loops on top of agents, nested sub-agents) feel kind of like exotic derivatives.

Complex - no one knows exactly what they are doing
Illiquid - few people have adopted them
Pricing difficulty - you stand to burn a lot of money/tokens and the underlying value is difficult to predict

0

1

0

9

Todd Hanford

@thanford7

13 days ago

I think there are two different models: 1) Legal product (like contract review) with human (lawyer verification) 2) Law firm with which uses AI tools The first treats lawyers as fungible. The second treats AI as fungible. You can tell which one you're dealing with by checking whether the website shows the partners at the firm.

0

1

0

7

Todd Hanford

@thanford7

13 days ago

@martin_casado A B+ player with no feedback regresses to an F player. This bookkeeping test is a good example https://t.co/YkhTDVdfux

thanford7's tweet photo. @martin_casado A B+ player with no feedback regresses to an F player. This bookkeeping test is a good example
https://t.co/YkhTDVdfux https://t.co/ria9EB5Mo0

0

2

0

66

Todd Hanford

@thanford7

13 days ago

@GergelyOrosz The pessimistic take is that a drug dealer is never going to tell you to do less cocaine.

0

5

0

289

thanford7 retweeted

Flo Crivello

@Altimor

18 days ago

Pulled the trigger today and switched 100% of Lindy traffic to DeepSeek v4, churning from Anthropic models. Saves us millions of $ and we're actually seeing an *increase* in performance on many core use cases. Transformative for the business.

170

3K

162

1K

935K

Todd Hanford

@thanford7

19 days ago

Everybody trashes AI content slop, but I'm actually enjoying reading and co-creating blog articles with an LLM

0

12

Todd Hanford

@thanford7

23 days ago

I've been complaining about this too! It's gotten really bad recently. I've had to report dozens of email domains as phishing, but @gmail doesn't automatically block or send these emails to spam even after I report them. I've had to set up automatic rules to delete these emails because I was getting dozens a day. These emails are so easy to identify as spam!

thanford7's tweet photo. I've been complaining about this too! It's gotten really bad recently. I've had to report dozens of email domains as phishing, but @gmail doesn't automatically block or send these emails to spam even after I report them.

I've had to set up automatic rules to delete these emails because I was getting dozens a day. These emails are so easy to identify as spam!

1

6

0

2

615

Todd Hanford

@thanford7

26 days ago

https://t.co/HgdDqrG9Ba

0

2

135

Todd Hanford

@thanford7

27 days ago

The new Google "Modern Web Guidance" is a joke. The user experience "mega skill" is a single skill file with: - 8 guides all with broken links - 3 of the 8 guides are related to scrollbars... These are the most random UX guides ever. I don't even think you could prompt Claude or Codex to write a skill this bad. https://t.co/2xQ4U2RJw8

thanford7's tweet photo. The new Google "Modern Web Guidance" is a joke. The user experience "mega skill" is a single skill file with:
- 8 guides all with broken links
- 3 of the 8 guides are related to scrollbars...

These are the most random UX guides ever. I don't even think you could prompt Claude or Codex to write a skill this bad.

https://t.co/2xQ4U2RJw8

0

265

Todd Hanford

@thanford7

27 days ago

In my experience it's a "tragedy of the AI commons". Our competitors pitch our customers that they have fantastic new AI capabilities (real or not doesn't matter) and so we have to do the same. Internally, it's hard to tell whether using tools for coding and other activities provides an advantage. IMO, a big difference between a staff engineer and a junior engineer is that a staff engineer has had to live with the consequences of their coding decisions for over a year. It's too early to say whether companies are truly accelerating their engineering or they are creating a "slop bomb" that will bite them much later.

0

6

0

213

Todd Hanford

@thanford7

about 1 month ago

@jaezun_ Looks great! Do you have to create your own chief of staff agent to create other agents or is that custom built / bootstrapped?

1

3

0

549

Todd Hanford

@thanford7

about 1 month ago

For the haters in the comments, you can just one shot the agent swarm using Wispr Flow to pipe into Claude mobile connected to your Mac mini through Claude remote control. You could even run a local hosted open source model if you're concerned about cost. Just connect that to your pi agent and add tools for image parsing and generation.

2

4

0

306

Todd Hanford

@thanford7

about 1 month ago

@chamath User error All you have to do is setup an agent swarm with: consolidator agent splitter agent analysis subagents deck generator agent

10

38

1

8

4K

Todd Hanford

@thanford7

about 1 month ago

@chamath Or have @jason setup an OpenClaw agent to run this analysis overnight for $400 in API costs. Probably best to add your credit card credentials in case your Claw needs to use external API services

0

2

0

294

Todd Hanford

@thanford7

Last Seen Users on Sotwe

Trends for you

Most Popular Users