πΌπΏ We're a bamboo-munching, crypto-loving team of pandas who've traded the forest for the Ethereum blockchain! We were born during #TestingTheMerge
We've learned some lessons along the way. This one is a must read for anyone seriously using agents (even outside of Ethereum!)
https://t.co/JDCoXUMEEz
We built Panda, a CLI/MCP tool that lets agents work directly with our observability data.
We found that if you give the agent a code sandbox and a library for your stack, things start to click together very nicely.
Post below π
Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more honesty about its own progress, and the ability to work independently for longer than its predecessors.
Available today at the same price.
We built a simulator for the fast confirmation rule, and replayed a years worth of blocks and attestations on Mainnet.
Across 800,000 mainnet slots, roughly 96 out of every 100 slots would have been fast-confirmed within 12 seconds.
Zero false confirmations.
Read more below!
New in The Lab: Validator Report.
Paste validator indices, pick a range, scan performance, and drill from day -> hour -> slot to find exactly what happened.
Catch missed proposals and head-vote drops faster.
It turns out some AI models can step through EVM bytecode in their head.
We built EthIQ, a new benchmark to test how well models actually understand Ethereum protocol internals.
325 questions across two evaluation modes, and one alive canary π¦
Read more π
We tested 44+ models across EVM execution, consensus state transitions, fork choice, and more. Anti-memorization is baked in. Inputs are mutated from the official spec fixtures so training data is ruled out.
The results were... surprising.
Check it out: https://t.co/oIsHvUDYYK
New in Xatu: Execution trace data.
Per-opcode gas consumption for blocks/txs. 9 new tables covering call frames, opcode gas, and daily/hourly aggregations.
Blog post + schema docs: https://t.co/jQdeG2a1uV
Also shipping a Gas Repricing Simulator. Pick a block, change any opcode cost, and re-execute it. See which transactions break, which exceed the gas limit, and which flip between success and failure.
https://t.co/jFIa8kQvie
New feature now available in The Lab slot deep dive π’π’
Trace the timings of a slot from all the Xatu nodes. Includes MEV bids, block arrivals, execution times (where available), and data availability time.
Link in next tweet.
While it's tempting to jump to the conclusion that the network supports more blobs - this would be very premature. This data is a single chapter in a vastly complex book.
To these entities playing timing games - keep an eye on your infra with blob count increases!
More to come!
BPO2 landed last week. It bumped the max blobs per block to 21. We've been going through the data.
Initial analysis doesn't look good! Have a look at this scary chart. But there might be more to this story..
Link below!
Now that we've established this relationship we can check from a blob count POV.
Boom!
The green line is holding up as the blob count increases. The red line shows the cause of our pain from the first chart.