musho @musHo - Twitter Profile

Pinned Tweet

musho

@musho

over 2 years ago

What if there's a better way/design than the default AI chat interface? A quick exploration:

12

330

12

300

118K

musho

@musho

4 days ago

@darylginn noice

0

1

0

249

musho

@musho

4 days ago

A week and a half in AI

Mercor

@mercor_ai

4 days ago

Claude Fable 5 takes #1 on APEX-SWE: 65.5% Pass@1 overall. It scores ~18pp higher than Opus 4.8. We tested @claudeai Fable 5 on APEX-SWE which measures whether AI models can do real software engineering work. Fable 5 tops our two APEX-SWE categories: - Integration: 61.3% - Observability: 69.7% The standout is Observability at 69.7%, 26pp ahead of Claude Opus 4.8. It is the first model to clear 50% on the category, and the only one that scores higher on Observability than on Integration. Every other model shows the reverse. Observability has been the bottleneck for every model we have measured. Fable 5 is the first to break it. Congrats to the @AnthropicAI team.

mercor_ai's tweet photo. Claude Fable 5 takes #1 on APEX-SWE: 65.5% Pass@1 overall. It scores ~18pp higher than Opus 4.8.

We tested @claudeai Fable 5 on APEX-SWE which measures whether AI models can do real software engineering work.

Fable 5 tops our two APEX-SWE categories:
- Integration: 61.3%
- Observability: 69.7%

The standout is Observability at 69.7%, 26pp ahead of Claude Opus 4.8. It is the first model to clear 50% on the category, and the only one that scores higher on Observability than on Integration. Every other model shows the reverse.

Observability has been the bottleneck for every model we have measured. Fable 5 is the first to break it.

Congrats to the @AnthropicAI team.

16

445

29

79

112K

0

6

0

888

musho

@musho

7 days ago · El Cerrito

@hyumankind this is the way

0

2

0

144

Who to follow

Programátor, otec troch detí a zahraničný agent v https://t.co/BLEa7id01I

Peter K.

@Peter_Kucerka

Marketing for DTC Brands Cocktails & Wines & Coffee

musho

@musho

7 days ago · El Cerrito

@fluixoo @mercor_ai sure thing

0

1

0

30

musho

@musho

8 days ago

Looking for a Brand / Visual Designer in San Francisco. DMs open. Fun fact: Early 2026, Mercor had the highest revenue per full-time designer in the world (~$1B).

3

16

1

19K

musho

@musho

8 days ago

@v2perx @mercor_ai dm sent

0

62

musho

@musho

8 days ago

@ianzelbo @mercor_ai 😅

0

81

musho

@musho

9 days ago · El Cerrito

@martinrariga @heyequals content

0

1

0

224

musho

@musho

9 days ago

@meltedhyperion @0xCharlota @figma I also spent it all on tokens

1

0

22

musho

@musho

11 days ago · El Cerrito

@0xCharlota .md skill

0

4

0

418

musho

@musho

11 days ago · El Cerrito

@DuaneKing used these in HK, pretty practical

1

0

288

musho

@musho

12 days ago · Berkeley

@freebean_co design by @adhikari_manan

0

343

musho

@musho

12 days ago

drinkable swag

3

8

3

0

583

musho

@musho

15 days ago

@mannupaaji Nice! Wish somebody would do it for the sides as well (tired of -1/-2px left margin)

0

2

0

538

musho

@musho

15 days ago

Meals, snacks, and 8x H100s per team... 😅

Mercor

@mercor_ai

15 days ago

We're running a 24-hour hackathon June 19–20 in San Francisco with @cognition, @etched, and @AnthropicAI. $50k top prize. $100k in total awards. Every accepted team gets 8xH100s, Anthropic credits, and Cognition API access. Guest judges include: @BrendanFoody, @robertwachen, and @silasalberti. Apply by 6/12: https://t.co/wuqEpBOSIm

mercor_ai's tweet photo. We're running a 24-hour hackathon June 19–20 in San Francisco with @cognition, @etched, and @AnthropicAI.

$50k top prize. $100k in total awards. Every accepted team gets 8xH100s, Anthropic credits, and Cognition API access.

Guest judges include: @BrendanFoody, @robertwachen, and @silasalberti.

Apply by 6/12: https://t.co/wuqEpBOSIm

12

340

16

205

60K

0

1

0

261

musho retweeted

Mercor

@mercor_ai

16 days ago

We tested @claudeai Opus 4.8 (High) on APEX-SWE ahead of today's release. It's the new #1 at 45.3% Pass@1, nearly 4 points ahead of GPT-5.3 Codex (41.5%). Congrats @AnthropicAI on the release and having three models in the top 5!