Honored & excited to be part of the new #GenerativeAI center at @UTAustin@MLFoundations, with 600 pieces of H100 GPUs newly arriving - hook’em horns of #AI!
Quadratic attention has been indispensable for information-dense modalities such as language... until now.
Announcing Mamba: a new SSM arch. that has linear-time scaling, ultra long context, and most importantly--outperforms Transformers everywhere we've tried.
With @tri_dao 1/
The way people interact with the computer is going to change.
I am working on LLM controller for your Mac
Inspired by LLM OS (@karpathy)
But I think LLM first OS will come in 1-2 years.
We need intermediate step of Expanded /w LLM OS first. This is how we get there.
Search for collaborators…
🔥🔥Now one can prune LIaMA, from 7B to 65B, up to 70% sparsity, in one shot! 🚀
Kudos to the brilliant minds🤯 at VITA -@luu_yinn@KyriectionZhang@ShiweiLiu9 - for spearheading groundbreaking #llm#compression advancements!
A sensible piece by Nello Cristianini about AI existential risk.
Or lack thereof.
"If we’re going to label AI an ‘extinction risk’, we need to clarify how it could happen"
https://t.co/IUYqJZaqWW