Rourke McNamara @rourkem - Twitter Profile

It’s time to fly! Excited to share the first short brand film for Codex. Catch it airing during Game 1 of the NBA Finals tonight. https://t.co/1J4Epczj8T

100

854

50

80

85K

0

1

0

12

Rourke McNamara

@rourkem

about 5 hours ago

@aiedge_ These benchmarks are less and less useful as the models get more advanced and as the benchmarks become polluted. Best to build your own evaluation process, starting with something pretty lightweight.

0

19

rourkem retweeted

Min Choi

@minchoi

1 day ago

This is wild. OpenAI just dropped Codex Sites. Now anyone can give it a plan, dashboard, launch doc or idea, and turn it into an interactive app with a URL. 5 wild examples:

114

2K

154

2K

274K

Who to follow

Dor Moshe

@DorMoshe

#JavaScript Junkie ✌️

Luke

@luke_pighetti

uncmaxxing post-AGI swe ✨ e/quacc 🦆

Beyang

@beyang

Building @ampcode, founder @sourcegraph

Rourke McNamara

@rourkem

about 12 hours ago

It’s super interesting to watch and not unique to this. You can do this via prompting with any agentic tool like this as long as you give the agents a shared communication layer and permission. A small amount of this coordination is absolutely essential when you have multiple agents working on long-running and adjacent work

0

22

Rourke McNamara

@rourkem

about 12 hours ago

@simonw The per tool is the interesting part for me. This looks line they are actively encouraging people to experiment with a range of tools. The engineer who uses that full budget on Codex, CC, and Cursor is going to get 3x the productivity boost

0

5

0

422

Rourke McNamara

@rourkem

1 day ago

@AnthonyBerlin @thsottiaux For anyone who ran into this and has stuck threads or sessions now: 1/ click the broken session and copy session id 2/ start a new session and ask it to recover context for that session id and complete whatever that session was working on. It’ll even resume goals

0

1

0

175

Rourke McNamara

@rourkem

1 day ago

@karatzas_thomas Local TS workflows feel like the right direction. The debugging loop is tighter when the agent is running on the same machine as your editor.

1

0

278

Rourke McNamara

@rourkem

1 day ago

@luchian_mvp @fastifyjs @trpcio @honojs @middleapi @DrizzleORM @PostgreSQL @Docker @Minio That makes sense because it gives the model a clear goal to work/iterate towards. With this approach you can even use /goal loops to get all the way to a finished product autonomously. Just be careful if you want the API endpoints to work well for other uses as well.

1

0

54

Rourke McNamara

@rourkem

1 day ago

@Layton_Gott Better would be to set yourself up so you can use all of those at the same time and things like skills and agents are setup to work across whichever you are using. No need to have to migrate each time you want to try a different model

1

0

33

Rourke McNamara

@rourkem

1 day ago

This is a big one. We use this internally and it’s been amazing to see the things people have created and what being able to easily build and deploy so easily really unlocks. And this is only the beginning for this feature.

OpenAI

@OpenAI

1 day ago

Building apps has never been easier. With Sites, Codex can turn your work, ideas, and plans into an interactive website or app your team can explore, use, and share with a URL. Rolling out to Business and Enterprise plans, before expanding more broadly.

853

18K

2K

10K

8M

0

1

0

91

Rourke McNamara

@rourkem

2 days ago

@StepFun_ai @kilocode curious what the multi-step part actually changes on a real bug fix. does it mostly help with the planning loop or the tool calling reliability?

0

1

0

126

Rourke McNamara

@rourkem

2 days ago

@98_swagger yeah, having clean and automated validation makes the loop incredibly powerful. a massive step up in usefulness and productivity.

0

1

0

23

Rourke McNamara

@rourkem

3 days ago

@KaiXCreator you get more done, but without the boring parts. so you're spending all of your time on the mentally taxing parts, the parts AI can't do for you. you get much more done, and the net mental tax is lower but more densely distributed. and it is also so much more satisfying

0

1

103

Rourke McNamara

@rourkem

3 days ago

@Kappaemme1926 it is an amazing value. that much usage would cost far more at api prices

0

1

0

16

Rourke McNamara

@rourkem

3 days ago

@Taniyatweets_ writing code by hand won't be useful for long. being able to understand *how* code is written and how it works will be, though. and engineering as a skill is still wildly important to steer things in the right direction, prompt properly, etc taste, though, is the real thing

0

1

0

65

Rourke McNamara

@rourkem

3 days ago

@DanKornas In general, runbooks/playbooks with slim INDEX.md as the router is the pattern I'm having loads of luck with right now. And Codex is pretty solid at keeping them up to date for me

1

0

1

149

Rourke McNamara

@rourkem

3 days ago

@uday_devops I mean, name ANYTHING that nobody hates.

1

0

104

Rourke McNamara

@rourkem

3 days ago

It really depends on skills installed and how you prompt. I never see three paths when I'm using Codex. I finish my prompt, move on to other tasks, and come back to something worth trying out and giving feedback on. Superpowers & GSD push for options by adding that to your prompt.

0

40

Rourke McNamara

@rourkem

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users