Curious if people would want this.
Someone at the Codex hackathon built an MCP plugin that connects ChatGPT with Codex.
The problem is very specific: GPT-5.5 Pro is currently much stronger at planning, research, and breaking down tasks, but you can’t use it directly inside Codex. Codex is great at execution, but its planning ability isn’t quite at the same level.
Their approach was to use MCP to expose the local dev environment to ChatGPT, so GPT-5.5 Pro handles the planning and task breakdown, while Codex handles the implementation.
In other words: let each AI do what it’s best at. One thinks, the other builds.
There’s also an unexpected bonus: ChatGPT and Codex have separate rate limits. With the same account, connecting them this way can feel like effectively doubling your available quota.
I think this is a pretty interesting direction. A lot of people are still asking, “Which AI tool should I use?” But the real answer might be: use all of them, and make them work together.
The walls between AI tools are starting to come down through protocols like MCP. In the future, it may not be about choosing the single strongest AI, but about building the right AI stack for your workflow.
Would you use something like this?
I’m trying to recreate this beautiful browser game by @probiex007, learning WebGL and Three.js along the way.
Still rough, but it’s already playable in the browser. Huge respect for how
@threejs@webgl_webgpu
I’m trying to recreate this beautiful browser game by @probiex007, learning WebGL and Three.js along the way.
Still rough, but it’s already playable in the browser. Huge respect for how
@threejs@webgl_webgpu
Spent the last few months building this — shipped in0 v1.0.0 today 🚀
A native macOS terminal multiplexer (workspaces / tabs / splits), but the part I actually care about: live AI agent status. See at a glance which split your Claude/Codex agent is in, and whether it's thinking, waiting on you, or done — right from the sidebar.
No more window-hopping to guess if your agent is stuck.
Built clean-room on libghostty + SwiftUI/AppKit.
I built voiceBox because I was tired of uploading my voice to someone else's servers just to clone it or narrate a long doc.
So I made it run entirely on your Mac. Apple MLX + Qwen3-TTS/ASR, native on Apple Silicon. Drop in a reference clip, generate speech in that voice, narrate long text — and nothing ever leaves your machine. After the first model download, it works fully offline.
No cloud. No API keys. No accounts. Your voice stays yours.
macOS 15+, Apple Silicon. Free download:
https://t.co/L1LVWVKygd
spent part of yesterday rewriting a PRD because /goal changes the unit of work.
before: break the feature into 12 tasks and hope the agent doesn’t drift.
after: define the finish line, the guardrails, and the 3 things that must never change. the agent can chew through the middle on its own.
that feels much closer to how i already manage a good engineer: not micromanaging steps, but being strict about acceptance. codex is starting to look more like that than a chatty autocomplete.
Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.
some random ai thoughts:
- for code, i went from 80/20 claude/gpt to 80/20 gpt/claude in <3 months. surprised by this tbh, and interested to see where the split is at in another 3mo.
- claude still mogs gpt for non-coding agent stuff. codex feels like an engineer (which is great for coding!), whereas claude still feels like a general purpose coworker. gpt still lacks that coworker magic
- i’m pretty meh on opus 4.7. my experience hasn’t been *bad*, but it certainly hasn’t been good. sideways if anything.
- anthropic has got to figure out the compute thing. you can feel it as a user. vibes are all out of whack bc of it. my opinions above are all likely downstream of this. it’s an issue.
- anthropic labs continues to be the goat of ai product. claude design is another hit. it’s fantastic. idk why it’s not talked about more? a+
- updated claude code app is great. i finally switched out of the terminal for it. very well done.
- how are people STILL sleeping on the claude agent sdk? i feel like i’m going insane.
- gpt 5.5 is incredible. the level to which i trust it for engineering is amazing. if i could only have one model rn, it would be this one just bc of strong need for the coding use case.
- codex team is killing it. app has been the gold standard since 5.3 release (buuut i credit conductor team for the ui innovation that everyone is using now). though i could do with a little less passive aggressive shots at ant from the codex team. TARS, dial up class by 30%. it’s a long race guys haha
- i uninstalled cursor this month and am now back to vs code for my ide. composer just can’t hang with claude/gpt, and the product feels a bit all over the place. pretty stoked about the xai thing though, because their team is absolutely stacked and i’m excited to see what they might be able to do with that compute. codex and claude code are t1, cursor is t2. i would love if this deal got xai/cursor to t1 for a real trio there.
- gemini…? seems like this is 2-3 models now where the model seems like a great release and then nobody ever uses it? i’m bullish google/deepmind but weird it hasn’t translated to product use in any form. kinda disappointed still
- no open source models have hit the opus 4.5 level. was hopeful the new deepseek would get there, but nope. good oss agents will have to wait a few more months it would seem…