change of plans: we are going to release o3 and o4-mini after all, probably in a couple of weeks, and then do GPT-5 in a few months.
there are a bunch of reasons for this, but the most exciting one is that we are going to be able to make GPT-5 much better than we originally though. we also found it harder than we thought it was going to be to smoothly integrate everything. and we want to make sure we have enough capacity to support what we expect to be unprecedented demand.
Heading toward the next Silicon Valley? If you're eyeing the AI job market, the UAE might just be your next big opportunity. hashtag#AI hashtag#AIJobs hashtag#innovation hashtag#tech hashtag#Dubai hashtag#UAE hashtag#AIInvestments https://t.co/lGA22zhb9h
Beautiful Paper.
A comprehensive survey of post-training methods including fine-tuning, reinforcement learning, and test-time scaling to refine LLMs reasoning.
Methods Explored in this Paper 🔧:
→ Systematically explores fine-tuning techniques that adapt LLMs for specific tasks, but acknowledges risks of overfitting and forgetting.
→ Reinforcement Learning from Human Feedback, are examined for aligning LLMs with human preferences and improving response quality.
→ Test-time scaling strategies, such as chain of thought prompting and tree of thought, are discussed to enhance reasoning during inference without retraining, by dynamically adjusting computation based on query complexity.
→ The paper also investigates reward modeling, policy optimization algorithms like Proximal Policy Optimization and Direct Preference Optimization, and efficient fine-tuning approaches to optimize LLMs post-training.
@realDonaldTrump Zelenskyy never planned to sign the deal. He showed up to trigger you. It was a bg smoke to disappear uncaught with money he stolen and/or play a martyr to be able to steal even more. Don’t let this bastard escape
Let's get deeper into Transformers. The magic of residual connections—a powerful shortcut that can supercharge your model’s performance. A trick that works wonders! (as shown in calculations 👍 ) #deeplearning#transformers#gradientdescent https://t.co/kZ0d5r6jZ7
Llama 3.1.
- Comparable to GPT-40 and Claude 3.5 Sonnet, according to the benchmarks
• The weights are publicly available
• 128K context #meta#llama#ai#llm https://t.co/YnePaPCNSD
everyone talks about AI, but - hire me to show you how to use it :)
#nvidia#ai#hireme
“big gap between the revenue expectations implied by the AI infrastructure build-out, and actual revenue growth in the…” https://t.co/8j9LqfkPsO via @sequoia
See you tonight at 6pm AEDT to hear about #reproducible analysis with #workflowr. It’s not too late to sign up. Zoom link provided later today https://t.co/iCVBCKonWx
Reminder this event is now online only.
@JovMaksimovic#RStats@RLadiesGlobal