Gave a talk on Saturday at @apiconflagos’ API connect: Q2 2026 event themed Systems Design for the age of AI-assisted engineering.
My talk was titled -> Failure as a design input: Moving from resilient to antifragile.
This was hugely centered on recognizing traditional design patterns and addressing the mindset shift that has to happen in order to have systems and APIs that thrive in chaos.
Shoutout to @Greyisheep and the APIConf team for putting this together.
Let’s do this again.
DevOps Lead Interview at Netflix
Round 1 – Systems at Scale, K8s, Cloud & Linux
• How would you implement fine-grained service discovery across 1000+ microservices using Envoy or Istio?
• Explain how you’d leverage eBPF + Cilium to enforce network security policies at runtime, and what the advantages are over traditional CNIs?
• Netflix runs multi-cloud. Describe your approach to cross-cloud routing, IAM, and secret syncing.
• What happens when systemd units fail intermittently on EKS nodes? How do you detect and heal?
• Your app teams demand custom AMIs. What’s your pre-prod vetting strategy at kernel and runtime?
• Walk through advanced kube-probe configurations to detect business logic failures, not just HTTP 200.
• How do you handle DNS-level outages inside a service mesh without a full app redeploy?
• Terraform remote_state backend suddenly times out. What’s your recovery and damage containment strategy?
Workshop Spotlight🧑💻
Get ready to level up your DevOps game with @Atomicdeo, DevOps Engineer at Sterling Bank🧑🍳
If you’re into cloud, DevOps, or scaling systems the right way, this session is where you should be
📆December 5th
Get your free ticket https://t.co/d1m9s6ngdV