#instructGPT - Twitter Hashtag

over 2 years ago · Palo Alto

Paper title is all you need: The monumental 2020 #GPT3 is about "Language Models are Few-Shot Learners" (aka #PromptEngineering) and the 2022 #InstructGPT paper is about "Training language models to follow instructions with human feedback" (aka #RLHF). Happy birthday!@OpenAI #LLM

Jim Fan

@DrJimFan

over 2 years ago

Today is the 2-year birthday of InstructGPT, the mother of all modern LLMs. The AI circle has a time dilation effect - can't believe it's been 2 yrs! InstructGPT laid out the canonical recipe of pre-training -> supervised finetuning -> RLHF, a strategy that everyone is still following till this day (with a bit of variations like DPO). InstructGPT was likely OpenAI's last paper that detailed how they train the frontier models. Looking back, I think it marked the watershed moment when LLMs finally went from an academic curiosity (GPT-3) to an impactful product (ChatGPT). Some fun facts: - InstructGPT didn't invent RLHF. In fact, the blog linked to the OG RLHF work, also done by OpenAI's team in 2017. It was conceived to solve hard-to-specify tasks in simulated robotics. RLHF asked a human annotator to give 900 binary preferences, which helped a simple "hopper" robot learn backflips in sim: https://t.co/gZRPGk6CFO - InstructGPT was published at NeurIPS 2022 in New Orleans! I was presenting MineDojo at the conference and was quite surprised to see the OpenAI poster there. - The models came in 3 sizes: 1.3B, 6B, 175B. The labelers strongly preferred Instruct-1.3B to the old, prompt-engineered GPT-3-175B. Microsoft Phi-1, one of the best-known small LMs, was also 1.3B. - InstructGPT is a master class on how to present your research. The 3-step figure is crystal clear and becomes one of the most iconic visuals in AI. The intro section has no BS and gives 8 take-home messages in bold. The discussions on limitations and bias are grounded and honest. Your weekend read: https://t.co/50Ko4pTRks

DrJimFan's tweet photo. Today is the 2-year birthday of InstructGPT, the mother of all modern LLMs. The AI circle has a time dilation effect - can't believe it's been 2 yrs! InstructGPT laid out the canonical recipe of pre-training -> supervised finetuning -> RLHF, a strategy that everyone is still following till this day (with a bit of variations like DPO).

InstructGPT was likely OpenAI's last paper that detailed how they train the frontier models. Looking back, I think it marked the watershed moment when LLMs finally went from an academic curiosity (GPT-3) to an impactful product (ChatGPT).

Some fun facts:
- InstructGPT didn't invent RLHF. In fact, the blog linked to the OG RLHF work, also done by OpenAI's team in 2017. It was conceived to solve hard-to-specify tasks in simulated robotics. RLHF asked a human annotator to give 900 binary preferences, which helped a simple "hopper" robot learn backflips in sim: https://t.co/gZRPGk6CFO
- InstructGPT was published at NeurIPS 2022 in New Orleans! I was presenting MineDojo at the conference and was quite surprised to see the OpenAI poster there.
- The models came in 3 sizes: 1.3B, 6B, 175B. The labelers strongly preferred Instruct-1.3B to the old, prompt-engineered GPT-3-175B. Microsoft Phi-1, one of the best-known small LMs, was also 1.3B.
- InstructGPT is a master class on how to present your research. The 3-step figure is crystal clear and becomes one of the most iconic visuals in AI. The intro section has no BS and gives 8 take-home messages in bold. The discussions on limitations and bias are grounded and honest.

Your weekend read: https://t.co/50Ko4pTRks

18

807

153

399

113K

0

7

0

1

2K

Big Data & Society @BigDataSoc

over 2 years ago

New Article: Analyzing #AI as 'automated subjects' using #psychoanalysis & #criticalmediastudies. Magee et al.'s study delves into LLMs like #OpenAI's #InstructGPT, exploring their design, biases, and interactions with users. #AILanguage #CriticalAI https://t.co/N1DAaYVejM

2

18

6

3K

Karthik Valmeekam @karthikv792

almost 3 years ago

The all-new #InstructGPT-3.5, supposedly acing chess, still sucks at planning. 9% in blocks world (vs. 6.1% for GPT3.5). Add obfuscation, and it dives to 0.6%! You'd think an "1800 Elo chess player" would be a tad better at stacking blocks, no? 🙄 (w/ @mattdmarq & @rao2z)

karthikv792's tweet photo. The all-new #InstructGPT-3.5, supposedly acing chess, still sucks at planning. 9% in blocks world (vs. 6.1% for GPT3.5). Add obfuscation, and it dives to 0.6%!

You'd think an "1800 Elo chess player" would be a tad better at stacking blocks, no? 🙄
(w/ @mattdmarq & @rao2z) https://t.co/RxYtebny6V

Grant Slatton

@GrantSlatton

almost 3 years ago

The new GPT model, gpt-3.5-turbo-instruct, can play chess around 1800 Elo. I had previously reported that GPT cannot play chess, but it appears this was just the RLHF'd chat models. The pure completion model succeeds. https://t.co/UXEkAfEVQT See game & thoughts below:

GrantSlatton's tweet photo. The new GPT model, gpt-3.5-turbo-instruct, can play chess around 1800 Elo.

I had previously reported that GPT cannot play chess, but it appears this was just the RLHF'd chat models. The pure completion model succeeds.

https://t.co/UXEkAfEVQT

See game & thoughts below: https://t.co/pztWrIuKFD

96

1K

242

552

1M

3

29

3

4

12K

helloitsliam (MVP Alumni, MCT)

@helloitsliam

about 3 years ago · Virginia

BadGPT: Exploring Security Vulnerabilities of #ChatGPT via Backdoor Attacks to #InstructGPT https://t.co/QA54gTEimu

0

1

0

153

MIXI 中途採用公式アカウント

@mixi_job_info

about 3 years ago

会話AIロボット #Romi の仕組み、話題の #ChatGPT で用いられた学習方法である「#InstructGPT」について紹介してます↓↓ https://t.co/6YrFk3wtXl ※ #レバテックLAB 様（@levtech_inc）にて掲載いただきました！

0

2

3

0

743

André Rieu @superrieu

about 3 years ago

Vortrag eines Entwicklers von #OpenAI zur spezifischen Trainingsmethode des Reinforcement Learning from Human Feedback (#RLHF) und den #InstructGPT-Modellen, die ganz entscheidend zum spektakulären Erfolg von #ChatGPT beigetragen haben. #KI https://t.co/F3WJGN3YHE

1

2

0

287

Big Think

@bigthink

over 3 years ago

Unlike its predecessor, #InstructGPT is far less likely to wander into bizarre lies, emotional rants, and manipulative tangents. #openAI https://t.co/HoSDJIsn0A

0

6

3

4

4K

Muse™ - AI for Business @musetm_grenoble

over 3 years ago

Teaching #ChatGPT to interact with Humans Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech

Enrico Molinari #VivaTech2025

@enricomolinari

over 3 years ago

Teaching #ChatGPT to interact with Humans Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech @albertogaruccio @rshevlin @Clagett @TylerCohenWood @cgledhill @JBarbosaPR @efipm @Damien_CABADI @Chris_Skinner @SabineVdL

enricomolinari's tweet photo. Teaching #ChatGPT to interact with Humans

Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech

@albertogaruccio @rshevlin @Clagett @TylerCohenWood @cgledhill @JBarbosaPR @efipm @Damien_CABADI @Chris_Skinner @SabineVdL https://t.co/RwPtzuK4e0

0

10

7

0

825

0

1

3

0

458

Kay Hosoda＠AIエンジニア @kay_hosoda

over 3 years ago

Axross Recipeで新しいレシピを公開しました！ ChatGPTできることを紹介するレシピ | Kay Hosoda https://t.co/TA0xjG4V9s #AxrossRecipe #ChatGPT #InstructGPT #GPT #AI #チャットボット

0

4

1

1K

Enrico Molinari #VivaTech2025

@enricomolinari

over 3 years ago

Teaching #ChatGPT to interact with Humans Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech @mikeflache @Paula_Piccard @Shi4Tech @Xbond49 @tewoz @HaroldSinnott @Ronald_vanLoon @NafisAlam @pierrepinna @Khulood_Almani

0

12

11

2

1K

Enrico Molinari #VivaTech2025

@enricomolinari

over 3 years ago

Teaching #ChatGPT to interact with Humans Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech @evankirstel @HeinzvHoenen @FrRonconi @CurieuxExplorer @Hana_ElSayyed @PawlowskiMario @MargaretSiegien @ParisFinForum

0

22

14

0

1K

Paris Fintech Forum @ParisFinForum

over 3 years ago

RT @enricomolinari Teaching #ChatGPT to interact with Humans Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech @JimHarris @AnthonyRochand @dmgerbino @RAlexJimenez @JeromeM...

ParisFinForum's tweet photo. RT @enricomolinari https://t.co/ADOEIYBxdn Teaching #ChatGPT to interact with Humans

Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech

@JimHarris @AnthonyRochand @dmgerbino @RAlexJimenez @JeromeM...

0

2

1

0

172

Enrico Molinari #VivaTech2025

@enricomolinari

over 3 years ago

Teaching #ChatGPT to interact with Humans Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech @JimHarris @AnthonyRochand @dmgerbino @RAlexJimenez @JeromeMONANGE @aure79lien @rwang0 @terence_mills @avataverse

1

15

13

2

665

Enrico Molinari #VivaTech2025

@enricomolinari

over 3 years ago

Teaching #ChatGPT to interact with Humans Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech @albertogaruccio @rshevlin @Clagett @TylerCohenWood @cgledhill @JBarbosaPR @efipm @Damien_CABADI @Chris_Skinner @SabineVdL

0

10

7

0

825

Enrico Molinari #VivaTech2025

@enricomolinari

over 3 years ago

Teaching #ChatGPT to interact with Humans Guodong Zhao via @enricomolinari #InstructGPT #Metaverse #marketing #finserv #fintech #govtech @RagusoSergio @ingliguori @stratorob @KanezaDiane @nigewillson @kuriharan @enilev @Nicolas2Pinto @TheRudinGroup @njhochman @sonu_monika

enricomolinari's tweet photo. Teaching #ChatGPT to interact with Humans

Guodong Zhao via @enricomolinari #InstructGPT #Metaverse #marketing #finserv #fintech #govtech

@RagusoSergio @ingliguori @stratorob @KanezaDiane @nigewillson @kuriharan @enilev @Nicolas2Pinto @TheRudinGroup @njhochman @sonu_monika https://t.co/X67MChAeKH

0

2

3

0

320

Enrico Molinari #VivaTech2025

@enricomolinari

over 3 years ago

Teaching #ChatGPT to interact with Humans Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #marketing #finserv #fintech #govtech @kalydeoo @sebbourguignon @labordeolivier @JeroenBartelse @pierrecappelli @globaliqx @FGraillot @Visible_Banking @richardturrin

enricomolinari's tweet photo. Teaching #ChatGPT to interact with

Humans Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #marketing #finserv #fintech #govtech

@kalydeoo @sebbourguignon @labordeolivier @JeroenBartelse @pierrecappelli @globaliqx @FGraillot @Visible_Banking @richardturrin https://t.co/f4973DThkm

0

7

3

0

542

Enrico Molinari #VivaTech2025

@enricomolinari

over 3 years ago

Teaching #ChatGPT to interact with Humans Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #marketing #finserv #fintech #govtech @TheAdityaPatro @RLDI_Lamy @bimedotcom @EvaSmartAI @Sharleneisenia @GlenGilmore @CyrilCoste @TerenceLeungSF @JagersbergKnut

0

45

11

1

806

Enrico Molinari #VivaTech2025

@enricomolinari

over 3 years ago

Teaching #ChatGPT to interact with Humans Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #marketing #finserv #fintech #govtech @Nicochan33 @BetaMoroney @ipfconline1 @psb_dc @SpirosMargaris @mvollmer1 @dinisguarda @YuHelenYu @3itcom @pascal_bornet @Ym78200

0

26

14

1

1K

unimatrix0.x @UniMatrixZ0

over 3 years ago

"#OpenAI ’s is built on decades of research.": => 1980s–’90s: Recurrent Neural Networks #RNN => 2017: #Transformers => 2018–2019: #GPT and #GPT-2 => 2020: #GPT-3 => 2022: #InstructGPT, #OPT, #BLOOM => December 2022: #ChatGPT "Overnight" success https://t.co/F2ieO1PAhh

UniMatrixZ0's tweet photo. "#OpenAI ’s is built on decades of research.":

=> 1980s–’90s: Recurrent Neural Networks #RNN

=> 2017: #Transformers

=> 2018–2019: #GPT and #GPT-2

=> 2020: #GPT-3

=> 2022: #InstructGPT, #OPT, #BLOOM

=> December 2022: #ChatGPT

"Overnight" success

https://t.co/F2ieO1PAhh https://t.co/JsFps2Bul2

1

5

0

1

344

Giorgio Robino @solyarisoftware

over 3 years ago

https://t.co/2IRy6Kagpd Please register and participate (as annotator) to the amazing @ykilcher project that has the goal to build a crowd sourced #opendata #opensouce alternative to proprietary #chatGPT, based on the #instructgpt logic, with much more! #OpenAssitant #RLHF

Yannic Kilcher 🇸🇨

@ykilcher

over 3 years ago

It's surprisingly fun to collect data for OpenAssistant - Our open-source alternative to ChatGPT! Check out the video: https://t.co/Q0ZaezzjPs #openassistant #chatgpt

ykilcher's tweet photo. It's surprisingly fun to collect data for OpenAssistant - Our open-source alternative to ChatGPT!
Check out the video:
https://t.co/Q0ZaezzjPs
#openassistant #chatgpt https://t.co/uMCO2aA9ll

19

461

89

80

86K

2

8

1

0

855

Top Tweets for #instructGPT

Last Seen Hashtags on Sotwe

Trends for you

Most Popular Users