Timo Schick @timo_schick - Twitter Profile

Pinned Tweet

over 3 years ago

🎉 New paper 🎉 Introducing the Toolformer, a language model that teaches itself to use various tools in a self-supervised way. This significantly improves zero-shot performance and enables it to outperform much larger models. 🧰 🔗 Link: https://t.co/FvjzhysMze

41

1K

282

485

510K

timo_schick retweeted

Enxhell Luzhnica @eluzhnica

3 days ago

Happy to share what we've been up to! https://t.co/bW2XWwcoQK

4

66

10

20

7K

timo_schick retweeted

Alessandro Sordoni @murefil

4 days ago

MAI is a really cool team of kind, highly motivated and skilled people. Our team worked with them in the final stretch of this model contributing some of our swe 🧙‍♀️ proud of our Froggy team 🐸 and expect further cool updates from us...

0

83

7

8

6K

timo_schick retweeted

Hanna Hajishirzi

@HannaHajishirzi

4 days ago

MAI-Thinking-1 is out! Excited to share what we are building and how climbing from scratch (no distillation) actually works: simple recipes, rigorous science, self-distillation, patience, and great infra. Check out our tech report has the full story of our RL climbs. https://t.co/aLW40sWz4d

HannaHajishirzi's tweet photo. MAI-Thinking-1 is out!

Excited to share what we are building and how climbing from scratch (no distillation) actually works: simple recipes, rigorous science, self-distillation, patience, and great infra.

Check out our tech report has the full story of our RL climbs.
https://t.co/aLW40sWz4d

24

866

127

381

118K

Who to follow

Ofir Press

@OfirPress

I push the AI frontier by building tough benchmarks with amazing people. SWE-bench, SWE-agent, SciCode, AlgoTune. Postdoc @Princeton. PhD @nlpnoah @UW.

Ethan Perez

@EthanJPerez

Alignment team lead at Anthropic

UW NLP

@uwnlp

The NLP group at the University of Washington.

timo_schick retweeted

Frank Xu @frankxu2004

4 days ago

Excited to share as many details on what we @MicrosoftAI have been working on. Building a LLM from scratch is an awesome journey with pain and suffering battling unknowns but also many cool moments to see it (somehow) works out every stage! https://t.co/WTRRRRwUGu

frankxu2004's tweet photo. Excited to share as many details on what we @MicrosoftAI have been working on. Building a LLM from scratch is an awesome journey with pain and suffering battling unknowns but also many cool moments to see it (somehow) works out every stage! https://t.co/WTRRRRwUGu https://t.co/e2xBU5sYHo

9

138

15

31

9K

timo_schick retweeted

Frank Xu @frankxu2004

3 days ago

What a ride to work with you on coding RL. Insanely fun

2

26

1

6

3K

timo_schick retweeted

Luca Soldaini 🎀

@soldni

4 days ago

Climbing with no distillation, like the Big Boys do, has been super fun! Read the tech report for a taste of our ̶s̶u̶f̶f̶e̶r̶i̶n̶g̶ journey

soldni's tweet photo. Climbing with no distillation, like the Big Boys do, has been super fun! Read the tech report for a taste of our ̶s̶u̶f̶f̶e̶r̶i̶n̶g̶ journey https://t.co/JotbX1tTjW

18

711

50

383

68K

Timo Schick @timo_schick

almost 2 years ago

Looking forward to my first non-virtual conference in almost 2 years. If you’re attending #ICML2024 and want to chat, drop me a message 😊

1

25

0

2

3K

timo_schick retweeted

Roberta Raileanu @robertarail

over 2 years ago

🤖 Want an agent that can learn new tasks from only a handful of demonstrations and no weight updates? 🚀 Check out our new work on In-Context Learning for Sequential Decision-Making, where we show how we can use transformers to few-shot learn new Procgen and MiniHack tasks. 👋 If you want to learn more about it, come chat with us at the FMDM workshop @NeurIPSConf on Friday, December 15. 🙌 Kudos to @sharathraparthy who did an outstanding job leading this work, designing and running lots of experiments, and digging deep trying to understand the model’s behavior. 🧵👇

0

70

11

28

11K

timo_schick retweeted

Jane Yu @JaneYuBear

over 2 years ago

Excited to be giving an oral presentation at @NeurIPSConf on Toolformer: Language Models Can Teach Themselves to Use Tools [https://t.co/ECE4X3uJZa]! When: Wednesday at 10:15am Where: Ballroom A-C (level 2) https://t.co/xe60Eoi4Oy

0

131

21

31

15K

timo_schick retweeted

Rowan Cheung

@rowancheung

over 2 years ago

Inflection AI just announced Inflection-2, a HUGE new 175 billion parameter language model. Capabilities exceed Google and Meta's top models and “is very close” to catching GPT-4. The CEO also said the company’s next model will be 10x larger in six months.

rowancheung's tweet photo. Inflection AI just announced Inflection-2, a HUGE new 175 billion parameter language model.

Capabilities exceed Google and Meta's top models and “is very close” to catching GPT-4.

The CEO also said the company’s next model will be 10x larger in six months. https://t.co/cdIo2zHtgl

9

739

60

147

272K

timo_schick retweeted

Mustafa Suleyman

@mustafasuleyman

over 2 years ago

Thrilled to announce that Inflection-2 is now the 2nd best LLM in the world! 💚✨🎉 It will be powering https://t.co/1RWFB5RHtF very soon. And available to select API partners in time. Tech report linked... Come run with us! https://t.co/8DZwP1Qnqo

71

995

109

230

547K

timo_schick retweeted

Anusha Bala @anushabalak

over 2 years ago

It has been nothing short of incredible to be a part of this team and celebrate every accomplishment! And we’re still *just* getting started 🏃🏽‍♀️🏃🏽‍♀️🏃🏽‍♀️

1

12

2

0

3K

timo_schick retweeted

Inflection AI @inflectionAI

over 2 years ago

🎉 Introducing Inflection-2, the 2nd best LLM in the world! Get ready to experience the future of AI with us. https://t.co/j8sUvZTMbH

49

884

166

243

314K

timo_schick retweeted

Mustafa Suleyman

@mustafasuleyman

over 2 years ago

Utterly insane weekend. So sad. Wishing everyone involved the very best. In the meantime, we finished training Inflection-2 last night! ✨ It's now the 2nd best LLM in the world... & we're scaling MUCH further. Details v soon. Come run with us!

69

1K

190

141

376K

timo_schick retweeted

Pi @pi

almost 3 years ago

In just over 100 days since launching Pi, we’ve just hit one billion messages exchanged. A huge milestone 🤯 Any predictions on how long it will take us to get to 2 billion?!

14

100

6

1

8K

timo_schick retweeted

Jason Weston

@jaseweston

almost 3 years ago

🚨New Paper 🚨 Self-Alignment with Instruction Backtranslation - New method auto-labels web text with instructions & curates high quality ones for FTing - Our model Humpback 🐋 outperforms LIMA, Claude, Guanaco, davinci-003 & Falcon-Inst https://t.co/93qi4JDnpb (1/4)🧵

jaseweston's tweet photo. 🚨New Paper 🚨
Self-Alignment with Instruction Backtranslation

- New method auto-labels web text with instructions & curates high quality ones for FTing

- Our model Humpback 🐋 outperforms LIMA, Claude, Guanaco, davinci-003 & Falcon-Inst

https://t.co/93qi4JDnpb
(1/4)🧵 https://t.co/9iU79bxDuo

13

651

138

423

358K

timo_schick retweeted

Maithra Raghu

@maithra_raghu

almost 3 years ago

Lost in the Middle: How Language Models Use Long Contexts https://t.co/eHGjq1r9S5 Exciting work exploring the effectiveness of long context, led by @nelsonfliu and with Kevin Lin, Ashwin Paranajape, John Hewitt, @percyliang @Fabio_Petroni @MicheleBevila20

maithra_raghu's tweet photo. Lost in the Middle: How Language Models Use Long Contexts

https://t.co/eHGjq1r9S5

Exciting work exploring the effectiveness of long context, led by @nelsonfliu and with Kevin Lin, Ashwin Paranajape, John Hewitt, @percyliang @Fabio_Petroni @MicheleBevila20 https://t.co/yaJGevoJvk

4

124

24

52

27K

timo_schick retweeted

Mustafa Suleyman

@mustafasuleyman

almost 3 years ago

Excited to announce that we’ve raised $1.3B to build one of the largest clusters in the world and turbocharge the creation of Pi, your personal AI. https://t.co/p5AfRXGPan

142

3K

309

454

929K

timo_schick retweeted

Inflection AI @inflectionAI

almost 3 years ago

We’re proud to announce Inflection-1, the best-in-class LLM developed at Inflection! Inflection-1, which powers https://t.co/e1SMbsrbJW, outperforms GPT-3.5, Chinchilla, and LLaMA on a number of academic benchmarks. More details in our technical memo: https://t.co/rOVlEXepNN

19

368

86

136

163K

timo_schick retweeted

Manoel @manoelribeiro

almost 3 years ago

One of our key sources of human data is no longer fully “human"! We estimate that 33-46% of crowd workers on MTurk used large language models (LLMs) in a text production task - which may increase as ChatGPT and the like become more popular and powerful. https://t.co/SJfKjDM6gX

manoelribeiro's tweet photo. One of our key sources of human data is no longer fully “human"!

We estimate that 33-46% of crowd workers on MTurk used large language models (LLMs) in a text production task - which may increase as ChatGPT and the like become more popular and powerful.

https://t.co/SJfKjDM6gX https://t.co/lRHp4tpfZF

40

2K

509

534

858K

Timo Schick

@timo_schick

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users