For Danya ♥️
A legend, a teacher, a friend…
We never got to record a course with him, so we made one for him.
10 of his best games.
Free. Forever.
https://t.co/Nl5vJrmpV4
The paper and codes are available at:
🔗 https://t.co/oKsk8NMlsh
🔗 https://t.co/ZT2Yia1W3u
🙌 Big thanks to amazing co-authors: @Cote_Marc , @michaelalbada, @goodboyanush , Jack W. Stokes, Tong Wang, Amir Abdi, William Blum, @mageed
🧵4/4
Gukesh:
"99 out of 100 times I would lose. Just a lucky day!"
"First classical win against Magnus, I mean, not the way I wanted it to be, but okay I'll take it."
#NorwayChess
An attempt to explain (current) ChatGPT versions.
I still run into many, many people who don't know that:
- o3 is the obvious best thing for important/hard things. It is a reasoning model that is much stronger than 4o and if you are using ChatGPT professionally and not using o3 you're ngmi.
- 4o is different from o4. Yes I know lol. 4o is a good "daily driver" for many easy-medium questions. o4 is only available as mini for now, and is not as good as o3, and I'm not super sure why it's out right now.
Example basic "router" in my own personal use:
- Any simple query (e.g. "what foods are high in fiber"?) => 4o (about ~40% of my use)
- Any hard/important enough query where I am willing to wait a bit (e.g. "help me understand this tax thing...") => o3 (about ~40% of my use)
- I am vibe coding (e.g. "change this code so that...") => 4.1 (about ~10% of my use)
- I want to deeply understand one topic - I want GPT to go off for 10 minutes, look at many, many links and summarize a topic for me. (e.g. "help me understand the rise and fall of Luminar"). => Deep Research (about ~10% of my use). Note that Deep Research is not a model version to be picked from the model picker (!!!), it is a toggle inside the Tools. Under the hood it is based on o3, but I believe is not fully equivalent of just asking o3 the same query, but I am not sure.
All of this is only within the ChatGPT universe of models. In practice my use is more complicated because I like to bounce between all of ChatGPT, Claude, Gemini, Grok and Perplexity depending on the task and out of research interest.
Feeling lost in the world of LLM evaluation tools? 🧭 Our latest blog post is here to help you navigate your options with confidence! Discover how to pick the perfect tool for your needs! Read more: https://t.co/2BvqZvgSAA #AI#LLMEvaluation#Opik#DeepEval#ArizeAI#LangFuse
My first personal laptop is a free #Apple#Mac laptop. After 2 decades of working with computers, I am owning my first personal laptop. Can you #believe it ?
Delivered a day-long tutorial on "Foundation Models and Generative AI" at IJCB 2024 in Buffalo, USA!
We covered a broad spectrum of topics, from the fundamentals of Foundation Models and Generative AI to Prompt Engineering, AI Agents, Responsible AI, and real-world applications in Biometrics. The session also included some advanced concepts and hands-on activities.
It was a proud moment to team up with Anush (@goodboyanush), an IAB PhD alumnus, once again after quite some time to bring this tutorial to life. We hope the audience found it as enriching and enjoyable!
#IJCB2024 #GenerativeAI #FoundationModels #ArtificialIntelligence #MachineLearning #PromptEngineering #ResponsibleAI #DeepLearning
@GothamChess@GothamChess you have 4.88 million subscribers in YouTube. Ask everyone to donate $2 towards chess. And you can conduct the world championship! I am ready to donate my $2