@VictorTaelin llmfit can be quite helpful to see if the model will fit, which quants to use, and estimated tok/s. Are you going to use it for exotic stuff like formal verification and lambda calculus? If so maybe create a mini-benchmark for your use-cases?
@tunguz Dear @AMD and @LisaSu, @realGeorgeHotz and @__tinygrad__ want to help with your "catastrophically weak software". Please reach out to them. You have a fiduciary responsibility to your shareholders to fix this mess.
@ecekamar I noticed in your December 13 blog post that Phi-4 was expected to be available on Hugging Face the following week. I haven't seen it listed, but very much look forward to having a look at the weights. Could you please provide an update on its release status? Thanks!
@1littlecoder@artificialguybr@teknium Typically it does, but there has been many issues with correctness and performance with the MPS backend in PyTorch. I've been trying to crack https://t.co/yVxGnnLHhE for a while, but not making a lot of progress unfortunately.
@__tinygrad__ Jensen actually picks up the bat phone when it rings. Then your problem magically goes away. This earns NVIDIA the top spot.
@LisaSu, @__tinygrad__ is calling.
@Kleos00 I was considering having a go at https://t.co/Fs8lPfbQsM or the Hutter Prize. But I don't think I can beat the Hutter Prize cause it's already 100x on a non-trivial dataset. At Neuralink nobody is responding to emails. I kind of lost interest.