If you're excited about helping define what the UX and pattern language for a future of agentic and self-assembling enterprise software, please reach out!
We're looking for NYC-based design engineers to join our small, talent-dense team.
This is an opportunity to define the frontier of HCI. We're building the platform for self-assembling, self-optimizing software already in production with consequential enterprises.
Our eng team is made up of folks who have built and maintained large open-source projects like Neovim and Analytics.js and scaled massive data systems at companies like Google, Stripe, Wix, and Segment.
DM if you are interested or know someone who might be!
you can whine about ensloppification or you can take advantage of the generationally low bar that exists for everything now
when we post a job opening 9/10 applications are total slop, all it takes is one thoughtful email to get properly looked at - but no one puts in the effort
Another thing: what you get from writing things yourself isn't just the code. It's an improved understanding of what the code does. That mental model is what lets you come up with further improvements, or invent a different way of doing things. You can't come up with ways to improve a blackbox you don't understand.
For most projects this doesn't really matter, because the code is the only thing you need. But if you're doing something novel, if you're doing research, the code is not the most important part. Understanding what the code does is the most important part.
Firebolt just published a fascinating breakdown they completely rebuilt their data warehouse platform architecture with a verifiable agent harness for an agent-centric future.
They propose that for the agentic future - one where agents both develop and use the platform - database infrastructure should be monolithic and single tenant.
Interestingly, they also rebuilt their catalog and metadata service using the new Ducklake spec in order to benefit from an open standard and a verified oracle.
using AI for coding is a deeply technical engineering craft
most people don't approach it as so, and don't get the results we associate with high craft
but the ones who do have been sprinting ahead
more tokens wont save you, more thinking + skill + llm intuition will
have been saying this for almost 9 months now
When they announced that “feature” I was pretty sure they had misspoken.
Turns out it *actually* turns on the tokenmaxxing mode if you just say the word “workflow”??
So every time I say the word 'workflow' in Claude Code...
(let's say, when I'm creating a new GitHub workflow)
...it tries to enter 'workflow' mode, spinning up dozens of subagents to complete my task.
Stupid fucking thing
Besides being much smarter, gpt 5.5 is a much more “neutral” model than opus 4.6.
You can always tell when a code change was written by opus. It tends to be extremely reward hacky and comment-ridden, whereas with codex, you can't actually distinguish it.
It’s much more steerable and lets the developers preferences shine through.
Skill used to be a natural speed limit to what you could do in software engineering - but that's no longer true.
A junior engineer could only write so many lines of code that compiled and worked, and their skill was the rate limiter. As they got better the scope of what they could do increased along with the speed which they could do it.
With AI, that's no longer true. The raw output a junior vs senior engineer can produce is roughly the same, yet the quality and risk blast radius is vastly different between the two.
On the one hand, that's great - you're no longer bottle necked by technicalities that may or may not be relevant to what you're trying to achieve (ie. the particular syntax of a language).
But on the other, you now don't have any backpressure. Someone with lesser skill can produce as much in volume as an expert (which Chamath seems to think makes the latter no longer relevant !!) obsfucating the risk and tech debt that used to be naturally limited.
The biggest problem of all is that on the surface the two look the same - which explains a lot of the exec AI psychosis we're seeing right now.
Besides being much smarter, gpt 5.5 is a much more “neutral” model than opus 4.6.
You can always tell when a code change was written by opus. It tends to be extremely reward hacky and comment-ridden, whereas with codex, you can't actually distinguish it.
It’s much more steerable and lets the developers preferences shine through.
From @mitsuhiko 's latest post on AI and the Pi codebase.
It's becoming more and more obvious that coding agents tend to just work in very incremental, short term fashion to accomplish any given task - as a result of the way they've been RL'd.
We collectively need to name this so we can work to avoid it. It's a special type of slop that needs its own term.
Good articulation of a problem with AI not being self-limiting enough.
It feels like we haven't yet given a name to the issue of LLMs being too incremental and reward-hacky rather than pushing back on ideas and suggesting rework of existing code.
It feels like the core thing that leads AI codebases into the pit of despair.
Good articulation of a problem with AI not being self-limiting enough.
It feels like we haven't yet given a name to the issue of LLMs being too incremental and reward-hacky rather than pushing back on ideas and suggesting rework of existing code.
It feels like the core thing that leads AI codebases into the pit of despair.
Skill used to be a natural speed limit to what you could do in software engineering - but that's no longer true.
A junior engineer could only write so many lines of code that compiled and worked, and their skill was the rate limiter. As they got better the scope of what they could do increased along with the speed which they could do it.
With AI, that's no longer true. The raw output a junior vs senior engineer can produce is roughly the same, yet the quality and risk blast radius is vastly different between the two.
On the one hand, that's great - you're no longer bottle necked by technicalities that may or may not be relevant to what you're trying to achieve (ie. the particular syntax of a language).
But on the other, you now don't have any backpressure. Someone with lesser skill can produce as much in volume as an expert (which Chamath seems to think makes the latter no longer relevant !!) obsfucating the risk and tech debt that used to be naturally limited.
The biggest problem of all is that on the surface the two look the same - which explains a lot of the exec AI psychosis we're seeing right now.
tbh I wish software engineers had a similar aversion to AI code. it's great to use AI, but I shouldn't be able to tell you used it.
my fiancee saying she thinks non-tech people are better users of AI because they don't want people to know they're using it
why do people (including me) have an aversion to AI writing but not as much to AI code? if a piece of text smells AI i stop reading it but i use things coded entirely with AI every day