🎉 New paper 🎉 Introducing the Toolformer, a language model that teaches itself to use various tools in a self-supervised way. This significantly improves zero-shot performance and enables it to outperform much larger models. 🧰
🔗 Link: https://t.co/FvjzhysMze
MAI is a really cool team of kind, highly motivated and skilled people.
Our team worked with them in the final stretch of this model contributing some of our swe 🧙♀️
proud of our Froggy team 🐸 and expect further cool updates from us...
MAI-Thinking-1 is out!
Excited to share what we are building and how climbing from scratch (no distillation) actually works: simple recipes, rigorous science, self-distillation, patience, and great infra.
Check out our tech report has the full story of our RL climbs.
https://t.co/aLW40sWz4d
Excited to share as many details on what we @MicrosoftAI have been working on. Building a LLM from scratch is an awesome journey with pain and suffering battling unknowns but also many cool moments to see it (somehow) works out every stage! https://t.co/WTRRRRwUGu
🤖 Want an agent that can learn new tasks from only a handful of demonstrations and no weight updates?
🚀 Check out our new work on In-Context Learning for Sequential Decision-Making, where we show how we can use transformers to few-shot learn new Procgen and MiniHack tasks.
👋 If you want to learn more about it, come chat with us at the FMDM workshop @NeurIPSConf on Friday, December 15.
🙌 Kudos to @sharathraparthy who did an outstanding job leading this work, designing and running lots of experiments, and digging deep trying to understand the model’s behavior. 🧵👇
Excited to be giving an oral presentation at @NeurIPSConf on Toolformer: Language Models Can Teach Themselves to Use Tools [https://t.co/ECE4X3uJZa]!
When: Wednesday at 10:15am
Where: Ballroom A-C (level 2)
https://t.co/xe60Eoi4Oy
Inflection AI just announced Inflection-2, a HUGE new 175 billion parameter language model.
Capabilities exceed Google and Meta's top models and “is very close” to catching GPT-4.
The CEO also said the company’s next model will be 10x larger in six months.
Thrilled to announce that Inflection-2 is now the 2nd best LLM in the world! 💚✨🎉
It will be powering https://t.co/1RWFB5RHtF very soon. And available to select API partners in time. Tech report linked...
Come run with us!
https://t.co/8DZwP1Qnqo
It has been nothing short of incredible to be a part of this team and celebrate every accomplishment! And we’re still *just* getting started 🏃🏽♀️🏃🏽♀️🏃🏽♀️
Utterly insane weekend. So sad. Wishing everyone involved the very best.
In the meantime, we finished training Inflection-2 last night! ✨
It's now the 2nd best LLM in the world... & we're scaling MUCH further. Details v soon.
Come run with us!
In just over 100 days since launching Pi, we’ve just hit one billion messages exchanged. A huge milestone 🤯
Any predictions on how long it will take us to get to 2 billion?!
🚨New Paper 🚨
Self-Alignment with Instruction Backtranslation
- New method auto-labels web text with instructions & curates high quality ones for FTing
- Our model Humpback 🐋 outperforms LIMA, Claude, Guanaco, davinci-003 & Falcon-Inst
https://t.co/93qi4JDnpb
(1/4)🧵
Lost in the Middle: How Language Models Use Long Contexts
https://t.co/eHGjq1r9S5
Exciting work exploring the effectiveness of long context, led by @nelsonfliu and with Kevin Lin, Ashwin Paranajape, John Hewitt, @percyliang@Fabio_Petroni@MicheleBevila20
Excited to announce that we’ve raised $1.3B to build one of the largest clusters in the world and turbocharge the creation of Pi, your personal AI.
https://t.co/p5AfRXGPan
We’re proud to announce Inflection-1, the best-in-class LLM developed at Inflection!
Inflection-1, which powers https://t.co/e1SMbsrbJW, outperforms GPT-3.5, Chinchilla, and LLaMA on a number of academic benchmarks.
More details in our technical memo: https://t.co/rOVlEXepNN
One of our key sources of human data is no longer fully “human"!
We estimate that 33-46% of crowd workers on MTurk used large language models (LLMs) in a text production task - which may increase as ChatGPT and the like become more popular and powerful.
https://t.co/SJfKjDM6gX