Two years ago we published our stance on AI in engineering interviews. Core question: does the candidate understand the code?
That question still holds—but it's no longer where we gather our most interesting signal.
Worst pattern: yolo --dangerously-skip-permissions, accept every diff without reviewing. The code kind-of-works but tells us nothing we couldn't learn from the model itself.
Surprise finding: plan-first thinking is way rarer than expected. Most candidates jump straight to "make the model write code."
Same pattern as alarm fatigue in hospitals—when monitors beep constantly, nurses learn to ignore them. The safety system becomes the danger.
Auto mode could be the right trade-off: let the model assess risk rather than prompting on everything. Cautiously hopeful!
Anthropic just announced auto mode for Claude Code permissions—I am unreasonably excited about this!
For the last few weeks, Claude Code's permission system has been quietly training us to be **less** safe.
What the team actually did:
• Attempt to preempt the permission model—custom allow/deny lists, hooks to block destructive commands, mining past transcripts for things we'd been approving on autopilot
• Switched to pi ($1K in API tokens, still better than the prompt loop)
• Moved work to remote agents (Devin, Niteshift) partly just to avoid the babysitting
• Probably ran with --dangerously-skip-permissions without telling me, tbh
The safety mechanism was producing the opposite of safety.
@ntkris Getting increasingly hard to find (although obvs not impossible) to find things which don't fit the latter two categories – at least in white-collar settinga
We're automating @elicitorg. When everyone goes on holiday in December, the company keeps running. Bugs triaged. Features shipped. Metrics dutifully updated by robots who don't know it's Christmas.
Step one: figure out what to automate. I set up a Notion database, asked people to decompose their work into triggers, steps, outputs.
Turns out this was the wrong question…
The Elicit API is now available in preview for Pro and Teams users. You can search 138M+ papers and generate Research Reports from your code, scripts, or AI tools.
Get your API key at https://t.co/Q3r5PggChI and check out https://t.co/yMxHqkK0Ob
@ntkris We get a lot of value from spending time with customers to see what's *actually* slowing them down – pretty hard to shadow yourself though!
… or is it easy? I don't know at this poitn
We're running a retreat next week to automate as many of these rote tasks as we can. Demos on Friday. More on what we learn soon.
Starting lesson: don't ask people to map their jobs. Ask what they'd love to never do again.
What's the most tedious part of your job?