Receipts agent call β did it use the tools, did it follow your rules β as a column you can query, not a vibe. We documented the whole build, including a regression we shipped ourselves and a claim we later retracted when real data killed it.
Full story π
https://t.co/hgJflddW6w
Your coding agent has access to your repo. Itβs supposed to read the actual files before answering.
The receipt says success. The answer looks right.
But it never opened the file β it answered from training priors. Confidently. And youβd never catch it from the logs.
We proved it on ourselves. Same prompt, two models, a repo with no src/auth.ts:
β one checked the directory and refused to fabricate
β one wrote 5,454 chars of vulnerability report for a file that doesnβt exist
Both returned status: success. π’π’
So we built ATO: receipts