nothing like switching to claude for a few days to try out a new model and going back to codex xhigh to remind you how much better 5.5 is right now
it's really not close
@Cesc_Vilanova European lawmakers make it very hard to ship things there early. It’s quite unfortunate. Feels like the incorrect tradeoff, and cookie notices all over again. Saying this as a native European. :(
Using Codex Computer Use to navigate the Settings app is incredible. It knows MacOS so much better than me. Similar experience to it using the command line. Superhuman.
Shopify CEO Tobi Lutke explains Goodhart’s law and why he doesn’t like KPIs or OKRs
“Goodhart’s law is real. The moment a metric becomes a goal, it’s no longer a useful metric… No metric by itself is a complete heuristic for a complex business. There’s a million different tensions in a company, and you can’t keep all of them in harmony by optimizing for one thing.”
For this reason, Shopify doesn’t use KPIs or OKRs. But as Tobi explains, this doesn’t mean they don’t value data and metrics.
“We are extremely data informed. We have invested enormous amounts of money and time into systems that give us basically everything at our fingertips… But what Shopify attempts to do is just not over-fit for what’s quantifiable.”
People love optimizing for highly-quantifiable things because there’s immediate gratification that comes from seeing a number go up. But Tobi thinks that the most important aspects of a product are rarely quantifiable:
“The overlap of the most valuable things you can do with a product and the things that happen to be fully quantifiable are like maybe 20%. Which leaves 80% of a value space unaddressable by the people who only look at quantifiable things.”
He continues:
“Shopify is comfortable with unquantifiable things like taste, quality, passion, love, hate… The sort of deep satisfaction that a craftsperson feels when they’ve done a job well is actually a better proxy if you allow it to be.”
They then have robust analytics systems that tell the company if something’s wrong or a new rollout breaks something.
“We think about it as a cockpit for a pilot. The decisions are still made by pilots, and we think this leads to better results… I think there needs to be more acceptance in business of unquantifiable things… And then metrics take a support function.”
Source: @lennysan (Feb 2025)
Thanks for the feedback on Codex in the ChatGPT mobile app. While it’s in preview, we’re working to improve it fast.
What you can expect next: push notifications, /fork, ability to restore after revoking, better reconnects, fixing the ability to control other devices, fewer mobile thread errors, better git diff & full-file, no plan mode issues, and lots more polish/bug fixes.
You've been asking for this one...
Now in preview: Codex in the ChatGPT mobile app.
Start new work, review outputs, steer execution, and approve next steps, all from the ChatGPT mobile app. Codex will keep running on your laptop, Mac mini, or devbox.
A few things I want:
- Faster / smarter loading of the thread. Stalls the whole app while loading. Seems like the latest messages could load first?
- Faster switching between threads across hosts. Not sure what the best UI is, but it takes many taps to go between threads. Even worse when I have multiple hosts.
- Some way to dismiss a "completed task" notification without opening it.