Lisham @mthlish - Twitter Profile

Pinned Tweet

2 months ago

Nothing’s reaped out of nothing, for free like that. Noooo! Everything does result from: Dedicated and Sincere Work + Heaven’s Approval Indeed, thou shalt reap what thou sowest.

0

4

0

906

Lisham

@mthlish

29 minutes ago

@shadcn in all my latest projects, couldn't build w/o it

0

49

mthlish retweeted

Guillermo Rauch

@rauchg

about 9 hours ago

The next hot programming language is… markdown. A minimal eve agent: 📂 𝚊𝚐𝚎𝚗𝚝/ 📄 𝚒𝚗𝚜𝚝𝚛𝚞𝚌𝚝𝚒𝚘𝚗𝚜.𝚖𝚍 📂 𝚜𝚔𝚒𝚕𝚕𝚜/ 📄 𝚢𝚘𝚞𝚛-𝚎𝚡𝚙𝚎𝚛𝚝𝚒𝚜𝚎.𝚖𝚍 Deployable in one command: 𝚟𝚎𝚛𝚌𝚎𝚕. It’s the most accessible programming has ever been. And likely will ever be, at least for the generation of software fully defined and controlled by us humans. (As a fun fact, one of the initial prototypes for eve was codenamed 𝚕𝚊𝚜𝚝 by @timolins, both in homage to ‘@nextjs for agents’ but also in recognition of how enduring eve’s design feels to us.)

95

845

49

376

60K

mthlish retweeted

The Shift Journal

@TheShiftJournal

about 22 hours ago

“Wasting your time doubting whether you’re going to be successful is pointless.” -Kobe Bryant

2

824

131

383

32K

Who to follow

Andrej

@andrejovanovic1

AI • SaaS • Cloud Sharing wins, failures and lessons learned.

Leunel

@addleonel

Programmer, learner, and researcher.

mthlish retweeted

about 16 hours ago

JUST IN: U.S. Air Force unveils the new Air Force One, a $400 million Boeing 747-8 gifted by Qatar.

309

10K

673

371

348K

mthlish retweeted

The White House

@WhiteHouse

about 16 hours ago

NEW AIR FORCE ONE! ✈️🇺🇸

2K

32K

5K

1K

633K

mthlish retweeted

Elon Musk

@elonmusk

about 17 hours ago

In the future, a trillion times a trillion dollars will be spent on making antimatter to travel to other star systems

24K

310K

24K

17K

35M

mthlish retweeted

yimika| @yimikaaaa

2 days ago

You MUST by all means develop a strong opinion of yourself so you don't end up internalizing the beliefs others have of you.

280

206K

45K

21K

2M

mthlish retweeted

Guillermo Rauch

@rauchg

1 day ago

Agents are motivating so many healthy software habits. Open APIs, documentation (skills), tests (evals), Unix (CLIs), payment & commerce protocols, even wide 𝙰𝚌𝚌𝚎𝚙𝚝 use (markdown/json/html). The original vision of the WWW coming to life before our eyes.

88

1K

89

304

51K

Lisham

@mthlish

about 19 hours ago

True

Peter H. Diamandis, MD

@PeterDiamandis

1 day ago

We taught a generation to be employees in an age that rewards leaders.

118

2K

198

181

49K

0

3

mthlish retweeted

ClaudeDevs

@ClaudeDevs

1 day ago

Earlier today, ~3% of Claude Code Max and Pro users hit a bug that showed an incorrect weekly usage limit, and in some cases blocked them from sending messages. This is fixed, and we're resetting 5-hour and weekly limits for everyone affected. Apologies for the disruption.

736

10K

404

582

2M

mthlish retweeted

OpenAI

@OpenAI

1 day ago

As AI takes on longer, higher-stakes tasks, we want models to carry beneficial and safe behavior into new domains beyond their training—and maintain it under pressure. That’s the idea behind our new research on training models to be broadly and persistently beneficial. https://t.co/6Yw45s1RRq

187

3K

239

866

297K

mthlish retweeted

Arena.ai

@arena

3 days ago

Agent Arena has been live for 2 weeks, with 10 more models now on the new leaderboard. Two highlights worth mentioning: - GLM-5.2 (Max) by @Zai_Org enters the top 10. The strongest open-weight result we've measured, at +9.4% confirmed success and +14.9% praise-vs-complaint relative to baseline. - Claude Fable 5 by @AnthropicAI debuted at #1 across nearly every metric before the U.S. government directive to suspend access. It’s a useful upper bound for where the frontier currently sits. In Agent Arena, we measure models on millions of real-world, long-horizon agentic tasks from a global community of users. Models can access web search, filesystem, and terminal tools to complete complex workflows. The leaderboard measures model performance on outcomes relative to the average model using a causal tracing methodology. Which model will enter the Arena next? Read more about the methodology and check out the live leaderboard (links in thread) 👇

arena's tweet photo. Agent Arena has been live for 2 weeks, with 10 more models now on the new leaderboard. Two highlights worth mentioning:

- GLM-5.2 (Max) by @Zai_Org enters the top 10. The strongest open-weight result we've measured, at +9.4% confirmed success and +14.9% praise-vs-complaint relative to baseline.
- Claude Fable 5 by @AnthropicAI debuted at #1 across nearly every metric before the U.S. government directive to suspend access. It’s a useful upper bound for where the frontier currently sits.

In Agent Arena, we measure models on millions of real-world, long-horizon agentic tasks from a global community of users. Models can access web search, filesystem, and terminal tools to complete complex workflows. The leaderboard measures model performance on outcomes relative to the average model using a causal tracing methodology.

Which model will enter the Arena next? Read more about the methodology and check out the live leaderboard (links in thread) 👇

24

440

50

82

51K

mthlish retweeted

Hugo

@hugorcd

3 days ago

Introducing V, a personal agent template. Built on Eve. Works on iMessage, Slack, and web. GitHub and Linear tools with long-term memory. https://t.co/gy7xKWSrle

24

874

69

962

62K

mthlish retweeted

SpaceX

@SpaceX

3 days ago

Falcon 9’s first stage has landed on the A Shortfall of Gravitas droneship

558

13K

2K

263

1M

mthlish retweeted

The White House

@WhiteHouse

3 days ago

🚨 President Donald J. Trump has SIGNED the Iran Memorandum of Understanding at Versailles in France. 🇺🇸

6K

27K

7K

2K

3M

mthlish retweeted

OpenAI

@OpenAI

1 day ago

GPT-5.5 Instant is now on par with our frontier Thinking models for health-related questions. Every week, more than 230 million people turn to ChatGPT with health and wellness questions, and GPT-5.5 Instant is better at recognizing when urgent care may be needed, asking for relevant context, explaining uncertainty, and making complex information easier to understand. Because GPT-5.5 Instant is available to all free users in ChatGPT, these improvements can help more people. Physician-led evaluation was critical to making these major intelligence gains.

235

4K

286

564

549K

mthlish retweeted

Anthropic

@AnthropicAI

2 days ago

New Frontier Red Team blog: Phase 2 of Project Fetch, where we test how well Claude can program a robodog. Opus 4.7, on its own, was ~20x faster than last year's best human team aided by Opus 4.1. (The robodog, alas, still failed to fetch a beach ball.) https://t.co/CgbBtRf85e

282

2K

167

461

287K

mthlish retweeted

Fox News

@FoxNews

3 days ago

The moment President Trump signs the Iran deal at the Palace of Versailles. The agreement was finalized during a dinner hosted by French President Emmanuel Macron inside the historic palace. The signing marked a major diplomatic milestone after months of negotiations aimed at ending the conflict between the U.S. and Iran.

4K

34K

7K

3K

6M

mthlish retweeted

shadcn

@shadcn

2 days ago

Here's something fun I've been thinking about. Agents like eve are increasingly just files. The shadcn registry is a protocol for distributing files. could the registry be the distribution model agents need?

shadcn's tweet photo. Here's something fun I've been thinking about.

Agents like eve are increasingly just files.

The shadcn registry is a protocol for distributing files.

could the registry be the distribution model agents need? https://t.co/6f318vUvFR

55

873

24

375

50K

mthlish retweeted

Ben Holmes

@BHolmesDev

2 days ago

Every open-source project should be engineering agent loops right now. We've found success managing @warpdotdev with loops for: - Issue triage - Spec generation for larger features - Code review - Even letting agents self-improve their Skills on a cron

19

414

25

516

39K

Lisham

@mthlish

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users