ML @ AMD
Former ML+3D Engineer @ Stability AI
Ex. AMD Research Engineer, RT & Neural Rendering
2021 Graduate, Computer Graphics Group @ University of Tokyo.
Super happy to see this :) all those late nights fixing and getting ROCm + PyTorch running on Strix Halo with @scottttw
on TheRock have finally paid off 🥲 https://t.co/UtANhHk8Hf
It's now ready and widely available. This is just the beginning, and we're just getting started!
Local [Superintelligence + Supercomputing] + Signed by @LisaSu 💻 🚀🚀🚀
Ryzen AI Max+ PRO 395 (Strix Halo) with ROCm
* runs GPT-OSS locally
* runs Battlefied 6 like a desktop
* 16x Zen5 cores for builds
@sama@gdb Sarah As promised please tag me if you run into any issues 🤙
Generative AI gives answers. Agentic AI executes work but it needs more than just fast answers.
That shift changes the infrastructure equation. As agents scale, customers need CPU-rich infrastructure that can support more orchestration, retrieval, APIs, databases, caches, containers, VMs and workflows at rack scale.
That is where AMD EPYC processors are built to deliver the highest performing CPUs for the next era of AI: https://t.co/kWZXaYSYSR
AMD Ryzen AI Max+ is available now.
16 cores. 32 threads. 40 compute units. 50 TOPS NPU. Up to 128GB unified memory.
Built on x86, it brings breakthrough local AI performance to the enterprise and developer workflows teams already use. No translation layer, no performance penalty, no workflow disruption.
Introducing AMD AI Playbooks, your shortcut to building AI on AMD.
Step-by-step guides to help you run real AI workloads, customize them for your environment, and get up and running faster.
New playbooks drop every month.
Start building: https://t.co/eA1h5ePtF0
@AnushElangovan@wild_zones@AIatAMD@wild_zones with the above PR also applied to flash-attention itself, we have it working fine on windows. I'll post a short list of commands that makes it easy to run with comfyui.https://t.co/oORjfNF5nv
7.13 has multi-arch packaging so you can pull only the arch you need at runtime -- get the benefits of native ISA targets but also don't need the bloat (or jit to a lower level ISA).
And now it's documented. For people serious about performance, this is a real AMD advantage. Thanks @AnushElangovan for pushing this through! Code is here: https://t.co/WEEj9EVZe8
Speed is the moat.
MI455X is both fast, and is the fastest I have seen in execution for bringup of a complex GPU platform. MI455X is right on target for shipments in 2H2026 - Irrespective of what @SemiAnalysis_ says - rev your engines because speed is coming.