Nine years at @elastic , shipping the #observability products other teams got paged on. Watched the same pattern every time: AI tools arrive, talk big, then ask the on-call engineer to retrieve more context.
Today we announce what we built instead..
We talked to a lot of engineers about production incidents.
And today we are launching the first Context and Control Model for Production and announced our $2.5M seed from @marathon_vc , with participation from an exceptional group of angel investors.
#ContextAndControl#AISRE #ProductionAI
5/5Try it now:
Works with your production. Multi-account support for teams with multiple environments.
From scattered context to full production understanding.
4/5Every investigation is persistent.
Start a thread in the terminal during an incident at 3am. Pick it up in the web UI the next morning. Context doesn't die when you close the tab.
Remove the human, ship faster, no approval needed.
Then the CISO walks in and the room goes quiet.
@Gartner_inc predicts 40% of agentic AI projects will be canceled by end of 2027.
#AISRE#Kubernetes#ReliabilityEngineering#SRE
"For a decade, we’ve rewarded the heroes who fix things fast. AI will keep accelerating development; that’s inevitable.
The answer isn’t to slow down. It’s to think faster than we ship." by our CTO, Chris Overton
https://t.co/olshSXa576
1/ Reliability doesn’t live in process — it lives in understanding.
In knowing what changed, when, and how it affects the system.
Full blog & story in the thread 👇
Shift-left is only real if you understand cause-and-effect in your production environment.
When engineers know "this change cascades to payment failures in 90 seconds" BEFORE deploying, reliability stops being an afterthought.
Operational knowledge while writing code, not after incidents.
#GenAI #CausalAI #SRE #ReliabilityEngineering #CloudNative #VibeCoding
Yann LeCun at GTC 2025: "Scaling text prediction won't get us to real intelligence." This matters for production engineering. When your system fails, you don't need LLMs generating plausible explanations.
#SRE#CausalAI#OnCall#ProductionEngineering#ReliabilityEngineering