Launching a “train your own LLM” course next week with @fb_ldn! So excited about this one ☺️
One-liner: deeply understand how LLMs work by training one from scratch based on Karpathy’s nanochat, in a small group of technical founders/engineers, guided by an AI/ML expert — all in 3 weeks!
Basically we realized that Feynman was right and you can’t fully understand something until you create (or recreate) it; hence this course.
We will cover:
* what’s an LLM
* what are tokens, embeddings, activation functions
* what’s a transformer, what is attention, how it works
* what data an LLM is trained on, where this data comes from
* what training means (finding weights for neurons to activate)
* what is loss, why it improves with training
* what is fine-tuning
* what evals mean, how to evaluate an LLM after training
By the end of 3 weeks, you’ll have trained a gpt2-lvl LLM from scratch and acquired an intuitive + first-principles understanding of how these models are built & trained, and what their limitations are
Logistics: 3 live lectures (1hr each) + 3 office hours + async homework, small group of ~10 technical founders, myself and @fb_ldn as instructors (Fabian has an AI & ML background, co-founded Shipamax W17 and exited to WiseTech Global)
DM if interested!
@a16z Jenna & I (exited YC founders, also married) have also been removing our household admin using an agent called Hermo (https://t.co/LZXQnqZOBL). We’ve now opened Hermo up (currently in private launch) for others who want less mental load without having to set up anything technical
@jessegenet Jenna & I (exited YC founders, also married) have also been removing our household admin using an agent called Hermo (https://t.co/LZXQnqZOBL). We’ve now opened Hermo up (currently in private launch) for others who want less mental load without having to set up anything technical
Now you can hold the beating heart of an AI in your hands, on your laptop.
You can teach the sand to think.
You can watch it as it learns.
https://t.co/WWK7F5Vllc
Introducing...
Gemma 4 Multimodal Fine-Tuner for Apple Silicon
- LoRA fine-tunning toolkit for Gemma LLM
- runs locally on macOS via PyTorch and Metal
- streams data from Google Cloud to your machine
- fine-tune on audio, image and text
- easy-to-use CLI wizard
If you want to fine-tune the new Gemma 4 on text, images, or audio without renting an H100 or copying a terabyte of data to your laptop, this is the only toolkit that does it all on Apple Silicon.
I built a simple internal chatbot that answers your team's SOPs and FAQs using your existing Google Drive docs
in the walkthrough, I show:
– how it finds the right doc for any question
– how your team can “@” specific files (like in Slack)
– how to set it up and share it with your team in minutes
reply “chatbot” and I’ll send the full video + code (must be following)
Masters of Doom - super exciting book for anyone interested in the history of id software, @ID_AA_Carmack and the greatest games of all time Commander Keen, Doom and Quake
We raised $2.7 million to an #opensource#mlops framework for production-ready ML pipelines! Check out @zenml_io with our new README and docs! Give us a star and run your first pipeline today! So excited! https://t.co/njzGOS3bQu
What happened to all the MEAN/MERN stack devs that used to scream about MongoDB being a superior db to Postgres for every use case because “Postgres isn’t web scale”?
@British_Airways is it possible to get refunds for cancelled flights, please? COVID as excuse for not getting support via phone or chat is not acceptable anymore - it started February last year!
@HelloFreshUK I got a text message that I'll get another delivery even though I cancelled via chat. Your support bot is also ... not able to handle this
Loved discussing some of the latest topics from ICDAR 2021 in our Shipamax research group. If you want to be at the forefront of cutting-edge machine learning you can never stop learning :) I'm always humbled by the brain-power of our team.