My Codex usage is what happens when โjust one more promptโ stops being a plan and starts being a dependency ๐
๐ค I had to upgrade to Pro 20x just to keep the addiction running.
๐ Thanks, Sam @sama . ๐ ๐ธ๐ธ๐ธ
107 TFLOPS on Mac M3 with a 9070XT attached. Our GEMM with the AMD_LLVM backend is now beating hipBLASLt on the card. And I love how portable LLVM is, this is just brew install llvm@19.
A Microsoft paper suggests that GPT-4o-mini is a ~8B parameter model.
It means distillation works pretty well at @OpenAI.
Other model sizes from the paper:
- Claude 3.5 Sonnet: 175B
- GPT-4: 1.76T
- GPT-4o: 200B
- o1-preview: 300B
- o1-mini: 200B
@sama, would you consider open-sourcing GPT-4o-mini? It could run on our local devices.
We're excited to announce #AlphaFold 3 with @GoogleDeepMind in @Nature: our new AI model for predicting biomolecule structures with unprecedented breadth and accuracy.
Expanding beyond proteins to tackle DNA, RNA, small molecules to fuel advances in biology & drug design ๐งต
Have you ever wanted to train LLMs in pure C without 245MB of PyTorch and 107MB of cPython? No? Well now you can! With llm.c:
https://t.co/PoGTZIwASL
To start, implements GPT-2 training on CPU/fp32 in only ~1,000 lines of clean code. It compiles and runs instantly, and exactly matches the PyTorch reference implementation.
I chose GPT-2 to start because it is the grand-daddy of LLMs, the first time the LLM stack was put together in a recognizably modern form, and with model weights available.
@herbertong August 8th the best advertisement ๐๐๐ in my opinion would be to the launch ๐ of the robotaxi service in the tunnels dug by The Boring Companies in Las Vegas. ๐คฏWhat do you think @herbertong ๐ค?
Watch @symbolica Principle Scientist Dr. Paul Lessard talk about why category theory is the key to achieving a description of symbolic reasoning in machines and how we are using it to usher in a new era in machine learning.
Introducing Gemini 1.5: our next-generation model with dramatically enhanced performance. It also achieves a breakthrough in long-context understanding.
The first release is 1.5 Pro, capable of processing up to 1 million tokens of information. ๐งต https://t.co/qT0aXdFL0n
Introducing SDXL Turbo: A real-time text-to-image generation model.
SDXL Turbo achieves state-of-the-art performance with a new distillation technology, enabling single-step image generation with unprecedented quality, reducing the required step count from 50 to just one.
The code, research paper, and weights for non-commercial use are now available on our website.
You can test SDXL Turbo on Stability AIโs image editing platform @Clipdropapp, with a beta demonstration of the real-time text-to-image generation capabilities.
Learn more: https://t.co/L39rZWf9F7
@teslaeconomist [1/2] Probably during the first phase of the pilot line of the new open box car manufacturing design the engineers at Tesla are realizing that all the work (or quite all) of building the car could be done by Optimus in 12/18 months further development.
Optimus can now sort objects autonomously ๐ค
Its neural network is trained fully end-to-end: video in, controls out.
Come join to help develop Optimus (& improve its yoga routine ๐ง)
โ https://t.co/dBhQqg1qya