TLMs: Tiny LLMs and Agents on Edge Devices with @cormacb
https://t.co/u0fHD7j5kZ
Function Gemma ships at 270 million parameters and runs nearly 2,000 tokens per second prefill on a Pixel 7. Out of the box, it hits 46% accuracy on a fixed set of app intents. Fine tune on a synthetically generated dataset and that clears 90% on eight of ten functions.
Cormac walks through the two paths developers have for on device AI: a skill harness built on Gemma 4 with a restaurant roulette demo running fully on device. Then Eloquent, a production transcription app built by chaining two sub billion parameter models together.
cc @osanseviero
Check out this video on how to run Gemma 4 locally on an iPhone!
It runs completely offline and handles long context, meaning no data plan, no API calls, and no monthly fees required.
Our first successful Gemma 4 Runtime in London with @swyx@patloeber@nick_kango@cormacb and others! 💎Great to go out for a run and talk about Gemma, agents, evals and more
Here's a ridiculous result from the @OpenAI GPT-2 paper (Table 13) that might get buried --- the model makes up an entire, coherent news article about TALKING UNICORNS, given only 2 sentences of context.
WHAT??!!
This is really cool project, congrats to Lucian and all involved -- battery life increase from 6 weeks to 1.5 years!
https://t.co/QubaftqS8t https://t.co/C2HIOfDWCq
We are hiring in Seattle to join the https://t.co/LKBtDNhPbU crew that recently joined us through acquisition. Great opportunity to join a top class team working on one of the new frontiers of CS/ML. https://t.co/4R4677N1AM
My colleague Raghu has just released "Quantizing deep convolutional networks for efficient inference" - https://t.co/D87ziflTqd
This white paper covers practical quantization approaches for most common CNNs in @TensorFlow, I think it will be super useful!
It is starting to look like deep learning workflows of the future feature autotuned architectures running with autotuned compute schedules across arbitrary backends. I don't know if I should be excited or scared.