Making LLMs run efficiently can feel scary, but scaling isn’t magic, it’s math! We wanted to demystify the “systems view” of LLMs and wrote a little textbook called “How To Scale Your Model” which we’re releasing today. 1/n
535 ⭐️ KubeAI is a Kubernetes-native platform for running AI/ML workloads. It simplifies the deployment 🔧 and management 📝 of AI models in Kubernetes clusters, making it easier to scale AI applications.
https://t.co/3aMHI8iZYH
#starhistory#GitHub#OpenSource
Tutorial: Private RAG with Verba, #Weaviate, Embedding Model and LLMs all running inside your own K8s cluster. Copy paste-able instructions for all.
https://t.co/MMwkT1VLp9
Need to provide a private OpenAI API compatible endpoint backed by OSS or fine-tuned LLMs? Lingo: an open source lightweight ML proxy and autoscaler for K8s makes this easy: https://t.co/HZgfadCuQq
@mehran__jalali https://t.co/pUUNyWWYRV purpose is to make deploying and finetuning OSS LLMs just as easy as OpenAI. It's free and open source. Would love your feedback @mehran__jalali
Unlock the power of Apache Kafka on #Kubernetes!
Check out this tutorial to learn how to deploy and manage Kafka clusters with ease on #GKE for scalable and high-availability data streaming ↓
https://t.co/8MfDTRDPaO
Annoyed about `gcloud container node-pools update --node-labels` overwriting all existing labels? Solve it with a simple bash script 😀
https://t.co/rFmHSwDpCE
⚡ Announcing the Spark Connector for Weaviate!
🗃️ The Spark Connector allows easy importing of data from @ApacheSpark into Weaviate
👀 Check out this tutorial to get started
👉 https://t.co/NzJ9Y8PjmR
👉 https://t.co/SmejUWcdch
More in the 🧵