Shawn Simister

@narphorium

Building AI powered tools to augment human creativity and problem solving. Previously @GitHub and @Google 🇨🇦

San Francisco

Joined April 2007

2.7K Following

2.1K Followers

4K Posts

Pinned Tweet

Shawn Simister @narphorium

3 months ago

I've been thinking about why verifying AI agent output feels so much harder than writing the spec that produced it. That question led me to rethink where my attention actually belongs in the process, and eventually to build https://t.co/TpxaTqqqag https://t.co/UBbhJaFkFH

0

6

1

2

1K

narphorium retweeted

3 days ago

Big paper on AI coding agents using Github & other data The auto-complete tools (Copilot) led to 2.2x more code, local agents like original Claude Code led to 7.4x, & current remote coding agents 17.3x(!) But human bottlenecks in coding means actual releases "only" went up 30%

emollick's tweet photo. Big paper on AI coding agents using Github & other data

The auto-complete tools (Copilot) led to 2.2x more code, local agents like original Claude Code led to 7.4x, & current remote coding agents 17.3x(!)

But human bottlenecks in coding means actual releases "only" went up 30% https://t.co/GiXEr94s4i

63

345

46

155

34K

Shawn Simister @narphorium

3 days ago

@tanishqk I've been thinking about this as well. But I've found that agents aren't great at divergent thinking. Without human guidance they revert to the safest, most generic options. In my experience, the human input has the most leverage at the beginning of each diamond.

1

2

0

0

142

Shawn Simister @narphorium

3 days ago

@sh_reya I'm building something similar! It started as a way to write code but now I use it for everything https://t.co/OxePMcpXrX

Shawn Simister @narphorium

about 2 months ago

This is how I use AI to augment my design process. Instead of having the agent show me a giant diff of changes, Atelier links each edit back to the original thread of feedback so I can review them in context

1

2

1

3

1K

0

4

0

2

465

Who to follow

Verified account

Researching reasoning @OpenAI | Co-created Libratus/Pluribus superhuman poker AIs, CICERO Diplomacy AI, and OpenAI o-series 🍓 reasoning models

founding researcher @mathematics_inc / mit phd student (on leave) / prev intern @ meta, nvidia, aws, jane street / enjoys 🎹✈️⛷️⛵

Verified account

CEO @unconvai. Former CEO MosaicML/Databricks & Nervana/IntelAI. Neuro + CS. I like to build stuff that will eventually learn how to build other stuff.

Shawn Simister @narphorium

4 days ago

@boristane I’m working towards this with Atelier https://t.co/UBbhJaEMQ9

0

0

0

0

139

narphorium retweeted

@lateinteraction

10 days ago

your novel idea, when you ask an llm to fill in the details

lateinteraction's tweet photo. your novel idea, when you ask an llm to fill in the details https://t.co/oxH3qkzYH4

15

931

63

177

97K

Shawn Simister @narphorium

11 days ago

@Westoncb @mrtudl I suspect the RL training loop rewards solving the task in as few steps as possible and that discourages the agent from searching for additional constraints

0

1

0

0

33

Shawn Simister @narphorium

11 days ago

@Westoncb @mrtudl Also hedge your questions. Instead of just "should I refactor this to a shared lib?" add "... or would that make it too strongly coupled?" so it has to weigh both sides

1

1

0

0

37

narphorium retweeted

13 days ago

We desperately need better ways of evaluating models. Something that shows how helpful they are at working hand-in-hand with humans to help them get stuff done in a cooperative/iterative way. The Claude models have consistently been better at this, and the market rewards that.

10

190

9

11

16K

Shawn Simister @narphorium

14 days ago

@RidgetopAI You should also check out https://t.co/TRppmLvPwJ from @IanArawjo for inspiration

1

2

0

0

49

Shawn Simister @narphorium

15 days ago

@Westoncb This was my first attempt at it last year: https://t.co/QFwaPkVQDz but I've been thinking a lot about it since then and I'm hoping to come back to that problem again

0

2

0

0

23

Shawn Simister @narphorium

15 days ago

@Westoncb Totally agree. Deliberately delegating parts of the task to parallel subagents is a really powerful technique. I wrote up some similar ideas here: https://t.co/XBxiYRzBx0

1

2

0

0

37

Shawn Simister @narphorium

15 days ago

@nbaschez Looks like we're working towards the same goal https://t.co/OxePMcqvhv

Shawn Simister @narphorium

about 2 months ago

This is how I use AI to augment my design process. Instead of having the agent show me a giant diff of changes, Atelier links each edit back to the original thread of feedback so I can review them in context

1

2

1

3

1K

0

3

0

0

205

narphorium retweeted

15 days ago

What are users thinking during their interactions with LLMs? We introduce ThoughtTrace — the first large-scale dataset that captures what users think during real-world human–AI conversations, not just what they type. → 10,174 thought annotations → 2,155 multi-turn conversations, 17,058 turns → 1,058 users → 20 LLMs These thoughts improve user behavior prediction (+41.7%) and model alignment (+25.6%). This opens a new paradigm of user-centric LLM research. Full information in the thread 🧶 Read our paper: https://t.co/lRYJvGJ7bb Check our project website: https://t.co/AupCn1YQOk

10

136

35

85

68K

Shawn Simister @narphorium

15 days ago

@CasJam This is how I do it https://t.co/OxePMcqvhv

Shawn Simister @narphorium

about 2 months ago

This is how I use AI to augment my design process. Instead of having the agent show me a giant diff of changes, Atelier links each edit back to the original thread of feedback so I can review them in context

1

2

1

3

1K

0

1

0

1

74

Shawn Simister @narphorium

16 days ago

@HamelHusain Wow. Is that an html report as the terminal state or do you ever feed the html back into other prompts as context?

0

0

0

0

205

Shawn Simister @narphorium

17 days ago

@dearmadisonblue A new kind of REPL but instead of everything being a list everything is Markdown

0

1

0

0

43

Shawn Simister @narphorium

17 days ago

@lucasmeijer I think of it like Tetris vs Sudoku: https://t.co/bHNXR6YpVp

0

7

0

15

2K

narphorium retweeted

Jaimz @Jaimz_with_a_Z

18 days ago

I always found it hard to document large codebases in a way that made sense to me visually Thanks to @tldraw I built CodeCanvas, my own infinite canvas documentation tool for mapping out my thought process Excited to share some of my favorite features

3

12

1

4

591

Shawn Simister @narphorium

19 days ago

@jason_mayes That's how I use my Creator Micro. Just get @work_louder to create custom keys for you

0

2

0

0

88

Shawn Simister @narphorium

19 days ago

@threepointone Remember when C++ compilers cost $500 and Visual Studio was over $1000?

0

0

0

0

96

Last Seen Users on Sotwe

Trends for you

Most Popular Users