Stephan Hoyer @shoyer - Twitter Profile

Pinned Tweet

about 1 month ago

Things I didn’t expect from becoming a dad: - I’m now in a secret club with most of the world’s adult population - I’ve been magically transformed into a morning person! - Babies are genuinely lots of fun 😊

7

80

0

3

9K

Stephan Hoyer

@shoyer

4 days ago

@def_chris_suter @nikitabier Shouldn’t the most popular of my following just be a sorting order, not introduce random people I’m not following?

0

1

0

49

Stephan Hoyer

@shoyer

4 days ago

@nikitabier did you intentionally break the “Following” tab in X? It suddenly started injecting posts from people I don’t follow and became unusable 🫤

1

3

0

508

Stephan Hoyer

@shoyer

5 days ago

What's the best way for teams to share coding agent skills, especially across Claude/Codex? Checking everything into a shared repos sounds great, until you accidentally trigger your colleague's bespoke workflow.

1

3

0

605

Who to follow

Sebastien Bubeck

@SebastienBubeck

I work on AI at OpenAI. Former VP AI and Distinguished Scientist at Microsoft.

Gael Varoquaux 🦋

@GaelVaroquaux

Coder & Research director @inria ►Data, Health, & Computer science ►Python coder, (co)founder of @scikit_learn, joblib, @probabl_ai ►Art: @artgael ►Physics PhD

shoyer retweeted

8 days ago

This is the first code bench that actually aligns with how it feels to use these models coding.

119

4K

158

986

300K

Stephan Hoyer

@shoyer

14 days ago

Agents are so charmingly naive about effort estimation -- this is one afternoon's worth of work! Not sure if this will ever get old

shoyer's tweet photo. Agents are so charmingly naive about effort estimation -- this is one afternoon's worth of work! Not sure if this will ever get old https://t.co/EpGCnVYXWp

3

10

0

1

1K

Stephan Hoyer

@shoyer

18 days ago

@_sholtodouglas @_arohan_ Claude is notably bad at PDF forms. When I had it look at my tax returns, it confidently misstated which lines I had filled out. I can fix this by asking the model to read PDFs as images but that’s really awkward.

1

19

1

2

3K

Stephan Hoyer

@shoyer

21 days ago

Conductor will be OK -- GPT 5.5 is a better model than Opus 4.7, anyways, and OpenAI is much more accommodating of third-party usage of Codex. Still, what a huge self-own from Anthropic!

3

89

0

2

6K

Stephan Hoyer

@shoyer

21 days ago

Wow, this is incredibly disappointing to hear from Anthropic -- basically a death sentence for Claude in @conductor_build, which simply wraps Claude Code in a nicer GUI. Not sure how it got lumped in with autonomous agents like OpenClaw.

ClaudeDevs

@ClaudeDevs

21 days ago

This means that third-party tools built on the Agent SDK like Conductor and OpenClaw work with your Claude plan, but will draw from your credit the same way your own scripts do.

52

679

23

121

591K

41

529

19

99

113K

Stephan Hoyer

@shoyer

21 days ago

I'm happy to report that this issue seems to resolved with recent versions of Codex and GPT 5.5 -- it kept on running experiments for me for 4 hours last night!

shoyer's tweet photo. I'm happy to report that this issue seems to resolved with recent versions of Codex and GPT 5.5 -- it kept on running experiments for me for 4 hours last night! https://t.co/JQR1TsmXaj

Stephan Hoyer

@shoyer

about 2 months ago

Dear Codex, when I tell you to run a massive parameter sweep & model exploration overnight, you should not report “done for the night” after 30 minutes when you self-report that it’s only 2/3 done!

27

374

4

24

39K

0

10

1

2K

shoyer retweeted

Martin Shkreli

@MartinShkreli

22 days ago

i bet on a humbling

24

486

12

100

178K

shoyer retweeted

Arram

@arram

26 days ago

Asked Claude: 'There's a meme called the "fix everything easily switch". What policies do you think are the best candidates for being a real fix everything switch in the US? Give me your top ten, your confidence, your reasoning, and why a given policy has not been implemented.'

arram's tweet photo. Asked Claude:

'There's a meme called the "fix everything easily switch". What policies do you think are the best candidates for being a real fix everything switch in the US? Give me your top ten, your confidence, your reasoning, and why a given policy has not been implemented.' https://t.co/qc7cPXk8pB

146

2K

287

2K

926K

shoyer retweeted

Ryan Keisler @RyanKeisler

30 days ago

I'm excited to finally open-source the model from my 2022 paper, “Forecasting Global Weather with Graph Neural Networks”. Highlights: • 10-day forecast in <1 min • Initialize forecasts from ERA5 or IFS analysis • Scripts for eval, sensitivities, & Hurricane Sandy

18

1K

134

1K

126K

Stephan Hoyer

@shoyer

30 days ago

@RyanKeisler This paper was such a breakthrough! Reading it was the first time I believed that SOTA pure-AI weather prediction was possible. Thanks for sharing, Ryan.

2

18

0

3

2K

Stephan Hoyer

@shoyer

about 1 month ago

@_arohan_ I’m six months in and it’s already great!

0

3

0

186

Stephan Hoyer

@shoyer

about 1 month ago

Things I didn’t expect from becoming a dad: - I’m now in a secret club with most of the world’s adult population - I’ve been magically transformed into a morning person! - Babies are genuinely lots of fun 😊

7

80

0

3

9K

Stephan Hoyer

@shoyer

about 1 month ago

My one minor ask for @thsottiaux -- could GPT be a little more proactive about explaining what it's doing? Claude gives nice little updates that let me track it's progress. Codex will go through dozens of tool calls over tens of minutes and I have to trust it's still on track (which yes, it usually is!)

2

14

1

2K

Stephan Hoyer

@shoyer

about 1 month ago

GPT 5.5 in recent versions of Codex feels like a real breakthough -- incredibly smart and gets things done. I have a new default coding model. Sorry, Claude!

22

629

21

26

71K

Stephan Hoyer

@shoyer

about 1 month ago

@bravo_abad @AnimaAnandkumar Really cool work, but as the authors note in the discussion, this GPU based approach still about 2x slower than sparse direct solvers on the CPU.

0

2

0

209

Stephan Hoyer

@shoyer

about 1 month ago

@xanderai Not sure you’re in the minority. I think those with the bad experience are just very loud!

0

4

0

528

Stephan Hoyer

@shoyer

about 1 month ago

@charlieholtz Steering is such a win, thank you! I wonder if you could do this next-level version or if it would require fixes at the Claude/Codex level: https://t.co/qa0NDoHaDB

Stephan Hoyer

@shoyer

about 2 months ago

Feature request for coding agents: Address my latest request *now* & then decide whether to interrupt. My common pattern: 1. Agent does something iffy 2. Agent starts something slow (e.g., running tests) 3. I have a question about (1) that may or may not require interrupting (2)

5

11

0

1

2K

0

2

0

2

637

Stephan Hoyer

@shoyer

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users