Are you at KubeCon Salt Lake City? watch new video i recorded with Natan Yellin @aantn from @RobustaDev Using Azure AI and HolmesGPT as an AI Assistant for AKS Alerts.
Robusta team is at KubeCon! They are close to CNCF store.
#holmesGPT@azuretar
https://t.co/S7FuWd2sMU
🎉 Are you attending KubeCon NA 2024 too? It would be a pleasure meeting you there! I'll also be available at the @RobustaDev booth(T45) on all days. See you soon :)
DevOps failed. DevOps was supposed to be "devs doing ops" - or at least something close to that. There was never supposed to be a job called "DevOps Engineer".
But where did it go wrong and why did everyone need to hire DevOps engineers anyway? More on that tomorrow.
Are you looking to improve the speed and efficiency of your cloud development workflow?
@Aviramyh and @aantn discuss how mirrord can transform your #K8s development experience into a magical journey on the @RobustaDev channel
Find out more in this video: https://t.co/SFODocnnRi
Future of AI is k8s Troubleshooting
Why is my application not running?
Ask HolmesGPT by @RobustaDev will look at the
- Pod Status
- Deployment Config
- Network Policies
- Resource Usage
Repeatedly fetching missing data until the problem is resolved.
https://t.co/XtTwPT3Gwa
📢 Using Kubernetes and want to shrink your cloud bill? Start by checking if your teams allocated the right amount of CPU/Memory to their pods.
In this video by our CTO @arikalon1, see how you can save $$ with minor changes in just 5 minutes.
https://t.co/BZzgwX0Vy7
📺 NEXT WEEK: Natan Yellin (@aantn), CEO of @RobustaDev, joins #PagerDutyCommunity to discuss how PagerDuty and Robusta can work together to effectively manage incidents in your Kubernetes environments.
LinkedIn Live ➡️ https://t.co/LdZcPVDNqH
Twitch ➡️ https://t.co/ycLZyZT5u2
How do you really know if your latest deployment succeeded?
You can use an AI agent to check!
This is an interesting use case for HolmesGPT that a user told me about on Slack. He is running HolmesGPT in ci/cd to check on new deploys and verify they're healthy. If there is a problem, Holmes investigates and messages his team on Slack.
Any volunteers to test HolmesGPT on PagerDuty or OpsGenie incidents? I have the PR ready and need testers.
You'll get another pair of AIs on your incidents ;)
Prerequisite is to have your own OpenAI/Azure key. (All open source, no data is sent to us.)
https://t.co/W38zxld3q8
AI is pretty good at analyzing pod logs and spotting interesting lines you miss on your own.
It's a really obvious use case, but one I've actually forgotten to share before!
If you paste an alert into an LLM and ask why it fired - the LLM will do a poor job. But so would a great engineer if you forced him to answer on the spot without troubleshooting first.
What if you let the LLM investigate like a human? Let it look at the alert. Then query your observability data. Then think. Then fetch more data.
For several months, we've secretly been building the world's first troubleshooting agent for alerts and incidents. Today we're releasing it as open source to the world.
https://t.co/82o3MxF3UQ
I need 10 bold volunteers to beta test something in the next 6 hours. All you need is an OpenAI API Key and a K8s cluster with at least 10 nodes.
I'll send each participant a free "chief yaml officer" t-shirt in appreciation.
If interested, leave a comment and I will DM details