Containerlab sFlow-RT Development Environment provides example Javascript and Python scripts. Leaf / spine switches with FRRouting and Host sFlow used in SONiC, NVIDIA, VyOS etc. for realistic telemetry. https://t.co/JsBaZg3jlN
Explore publicly accessible dashboards showing live data from operational networks, including: an AI/ML RoCEv2 fabric, a world-wide Kubernetes cluster, and an Internet Exchange Provider (IXP). Learn how to monitor your own networks. https://t.co/82etPmxpq8
Learn how standard measurements from data center switches provides visibility into RDMA traffic from AI / ML workloads. Troubleshoot latency and drops. Includes live dashboards showing production traffic. https://t.co/Uz8VkGVrk1
N96 Talks are Streaming! https://t.co/XpQXCxCcUI
The N96 talks are on YouTube! 🎥✨ Whether you couldn’t make it in person or just want to relive the best moments, the full lineup is available!
Subscribe to our YouTube + keep the conversation going long after the conference.
The SDSC Expanse cluster live AI/ML metrics dashboard is a joint InMon / San Diego Supercomputer Center (SDSC) demonstration at SC25 conference being held this week in St. Louis. Click on the dashboard link during the show to see live traffic. https://t.co/yWxLcsOuVv
Real-time visibility into production Ultra Ethernet Transport (UET) traffic using industry standard data center switch telemetry https://t.co/KpgO3Ljn5D
Vector Packet Processor (VPP) release 25.10 extends the sFlow implementation to include support for dropped packet notifications. https://t.co/cYNtAP0cTw
Industry standard packet sampling in data center switch hardware from all leading vendors (Arista, Cisco, Dell, Juniper, NVIDIA, etc.) provides a cost effective solution for even the largest AI / ML fabrics. https://t.co/RJuE3NcFz6
Trimming packets that would otherwise be dropped in AI/ML networks is part of Ultra Ethernet congestion control and currently supported in NVIDIA switches and adapters. Monitoring trimmed packets is a useful metric for network visibility https://t.co/S1yh8gWa59
pwru (packet, where are you?) is an open source tool from Cilium that used eBPF instrumentation in recent Linux kernels to trace network packets through the kernel. Try it out using Multipass on your laptop. https://t.co/fJA6BFJYVm
Grafana dashboard showing performance metrics for AI/ML RoCEv2 network traffic used for inter-GPU communications. Includes step by step instructions to give it a try! https://t.co/LxFtYWCdJR
The availability of the Cisco IOS XR Release 25.1.1 brings sFlow dropped packet notification support to Cisco 8000 series routers, making it easy to capture and analyze packets dropped at router ingress. https://t.co/j5uvzhn0WO
The application provides performance metrics for AI/ML RoCEv2 network traffic, for example, large scale CUDA compute tasks using NVIDIA Collective Communication Library (NCCL) operations for inter-GPU communications. Step by step instructions. https://t.co/xsoNvAXhwC