Low-level developer. Peakbagger. Private Pilot.
Founder/CTO Ringcube (acquired by Citrix) and Deepfactor (acquired by Cisco).
Building hypervisors and OSes.
llama.cpp now has an official website: https://t.co/vztdUpdBWL
Our goal is to make local AI accessible to everyone, and improving the user experience is a big part of that. On the new landing page you’ll find a single-line cross-platform installer. The installation provides a single unified `llama` entrypoint which you can use to run/serve models and interface with 3rd-party agentic applications.
While oriented towards simplified user experience, the new `llama` application also provides all the advanced functionality of the existing llama.cpp tooling with which experienced users are already familiar. Also note that all GGUF models that you might have already downloaded with llama.cpp in the past will be automatically available to use without downloading again (they are stored in the common HF cache on your machine).
We have many improvements in the pipeline both at the UX and at the engine level and we plan to iteratively ship new things over the coming months. One of the main focuses will be seamless integration with local-friendly 3rd-party agents (such as Pi). In the meantime, we’ll continue to listen for feedback from the community and adjust accordingly, so keep letting us know what you think and need.
@AlexFengzh@MrPeterLMorris nice. my nas only has PCIe x4 which means I'm not getting 50Gb/s even if the storage fabric could go that fast (and it can't, anyway).
This blew up in the best of ways! We've got @NVIDIAAI GB10 users of all skill levels, helping each other build and grow. Whether you're looking for help fine tuning or setting up your first GB10, you will inevitably find someone who is happy to help!
https://t.co/Z2m5vyYLLs
“Why can’t an OpenBSD installation just work correctly and be usable without a hundred hours of fiddling?”
Now it can. Start a Riot https://t.co/Jbm3fbsLj4
There's an invite-only Signal group for issues as well as https://t.co/S84NI0BYow
WARNING: Very BETA.