Top Tweets for #InstructGPT
Paper title is all you need: The monumental 2020 #GPT3 is about "Language Models are Few-Shot Learners" (aka #PromptEngineering) and the 2022 #InstructGPT paper is about "Training language models to follow instructions with human feedback" (aka #RLHF). Happy birthday!@OpenAI #LLM
Today is the 2-year birthday of InstructGPT, the mother of all modern LLMs. The AI circle has a time dilation effect - can't believe it's been 2 yrs! InstructGPT laid out the canonical recipe of pre-training -> supervised finetuning -> RLHF, a strategy that everyone is still following till this day (with a bit of variations like DPO).
InstructGPT was likely OpenAI's last paper that detailed how they train the frontier models. Looking back, I think it marked the watershed moment when LLMs finally went from an academic curiosity (GPT-3) to an impactful product (ChatGPT).
Some fun facts:
- InstructGPT didn't invent RLHF. In fact, the blog linked to the OG RLHF work, also done by OpenAI's team in 2017. It was conceived to solve hard-to-specify tasks in simulated robotics. RLHF asked a human annotator to give 900 binary preferences, which helped a simple "hopper" robot learn backflips in sim: https://t.co/gZRPGk6CFO
- InstructGPT was published at NeurIPS 2022 in New Orleans! I was presenting MineDojo at the conference and was quite surprised to see the OpenAI poster there.
- The models came in 3 sizes: 1.3B, 6B, 175B. The labelers strongly preferred Instruct-1.3B to the old, prompt-engineered GPT-3-175B. Microsoft Phi-1, one of the best-known small LMs, was also 1.3B.
- InstructGPT is a master class on how to present your research. The 3-step figure is crystal clear and becomes one of the most iconic visuals in AI. The intro section has no BS and gives 8 take-home messages in bold. The discussions on limitations and bias are grounded and honest.
Your weekend read: https://t.co/50Ko4pTRks

New Article: Analyzing #AI as 'automated subjects' using #psychoanalysis & #criticalmediastudies. Magee et al.'s study delves into LLMs like #OpenAI's #InstructGPT, exploring their design, biases, and interactions with users. #AILanguage #CriticalAI https://t.co/N1DAaYVejM
The all-new #InstructGPT-3.5, supposedly acing chess, still sucks at planning. 9% in blocks world (vs. 6.1% for GPT3.5). Add obfuscation, and it dives to 0.6%!
You'd think an "1800 Elo chess player" would be a tad better at stacking blocks, no? ๐
(w/ @mattdmarq & @rao2z)

The new GPT model, gpt-3.5-turbo-instruct, can play chess around 1800 Elo.
I had previously reported that GPT cannot play chess, but it appears this was just the RLHF'd chat models. The pure completion model succeeds.
https://t.co/UXEkAfEVQT
See game & thoughts below:

BadGPT: Exploring Security Vulnerabilities of #ChatGPT via Backdoor Attacks to #InstructGPT
https://t.co/QA54gTEimu
ไผ่ฉฑAIใญใใใ #Romi ใฎไป็ตใฟใ่ฉฑ้กใฎ #ChatGPT ใง็จใใใใๅญฆ็ฟๆนๆณใงใใใ#InstructGPTใใซใคใใฆ็ดนไปใใฆใพใโโ
https://t.co/6YrFk3wtXl
โป #ใฌใใใใฏLAB ๆง๏ผ@levtech_inc๏ผใซใฆๆฒ่ผใใใ ใใพใใ๏ผ
Vortrag eines Entwicklers von #OpenAI zur spezifischen Trainingsmethode des Reinforcement Learning from Human Feedback (#RLHF) und den #InstructGPT-Modellen, die ganz entscheidend zum spektakulรคren Erfolg von #ChatGPT beigetragen haben. #KI https://t.co/F3WJGN3YHE
Unlike its predecessor, #InstructGPT is far less likely to wander into bizarre lies, emotional rants, and manipulative tangents. #openAI https://t.co/HoSDJIsn0A
Teaching #ChatGPT to interact with Humans
Guodong (Troy) Zhao via
@enricomolinari
#InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech
Teaching #ChatGPT to interact with Humans
Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech
@albertogaruccio @rshevlin @Clagett @TylerCohenWood @cgledhill @JBarbosaPR @efipm @Damien_CABADI @Chris_Skinner @SabineVdL

Axross Recipeใงๆฐใใใฌใทใใๅ
ฌ้ใใพใใ๏ผ
ChatGPTใงใใใใจใ็ดนไปใใใฌใทใ | Kay Hosoda https://t.co/TA0xjG4V9s #AxrossRecipe #ChatGPT #InstructGPT #GPT #AI #ใใฃใใใใใ
Teaching #ChatGPT to interact with Humans
Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech
@mikeflache @Paula_Piccard @Shi4Tech @Xbond49 @tewoz @HaroldSinnott @Ronald_vanLoon @NafisAlam @pierrepinna @Khulood_Almani

Teaching #ChatGPT to interact with Humans
Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech
@evankirstel @HeinzvHoenen @FrRonconi @CurieuxExplorer @Hana_ElSayyed @PawlowskiMario @MargaretSiegien @ParisFinForum

RT @enricomolinari Teaching #ChatGPT to interact with Humans
Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech
@JimHarris @AnthonyRochand @dmgerbino @RAlexJimenez @JeromeM...

Teaching #ChatGPT to interact with Humans
Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech
@JimHarris @AnthonyRochand @dmgerbino @RAlexJimenez @JeromeMONANGE @aure79lien @rwang0 @terence_mills @avataverse

Teaching #ChatGPT to interact with Humans
Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech
@albertogaruccio @rshevlin @Clagett @TylerCohenWood @cgledhill @JBarbosaPR @efipm @Damien_CABADI @Chris_Skinner @SabineVdL

Teaching #ChatGPT to interact with Humans
Guodong Zhao via @enricomolinari #InstructGPT #Metaverse #marketing #finserv #fintech #govtech
@RagusoSergio @ingliguori @stratorob @KanezaDiane @nigewillson @kuriharan @enilev @Nicolas2Pinto @TheRudinGroup @njhochman @sonu_monika

Teaching #ChatGPT to interact with
Humans Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #marketing #finserv #fintech #govtech
@kalydeoo @sebbourguignon @labordeolivier @JeroenBartelse @pierrecappelli @globaliqx @FGraillot @Visible_Banking @richardturrin

Teaching #ChatGPT to interact with Humans
Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #marketing #finserv #fintech #govtech
@TheAdityaPatro @RLDI_Lamy @bimedotcom @EvaSmartAI @Sharleneisenia @GlenGilmore @CyrilCoste @TerenceLeungSF @JagersbergKnut

Teaching #ChatGPT to interact with Humans
Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #marketing #finserv #fintech #govtech
@Nicochan33 @BetaMoroney @ipfconline1 @psb_dc @SpirosMargaris @mvollmer1 @dinisguarda @YuHelenYu @3itcom @pascal_bornet @Ym78200

"#OpenAI โs is built on decades of research.":
=> 1980sโโ90s: Recurrent Neural Networks #RNN
=> 2017: #Transformers
=> 2018โ2019: #GPT and #GPT-2
=> 2020: #GPT-3
=> 2022: #InstructGPT, #OPT, #BLOOM
=> December 2022: #ChatGPT
"Overnight" success
https://t.co/F2ieO1PAhh

https://t.co/2IRy6Kagpd
Please register and participate (as annotator) to the amazing @ykilcher project that has the goal to build a crowd sourced #opendata #opensouce alternative to proprietary #chatGPT, based on the #instructgpt logic, with much more!
#OpenAssitant #RLHF
It's surprisingly fun to collect data for OpenAssistant - Our open-source alternative to ChatGPT!
Check out the video:
https://t.co/Q0ZaezzjPs
#openassistant #chatgpt

Last Seen Hashtags on Sotwe
HappyStPatricksDay
Seen from Colombia
stepdaddy
Seen from Turkey
fart #animation
Seen from United Kingdom
ุชุงูุฌู
Seen from Germany
feminization
Seen from France
TyraChikocho
Seen from United Kingdom
livejasmin
Seen from Saudi Arabia
brighteyesbrightfuture
Seen from United States
futa
inle
Seen from Turkey
Most Popular Users

Elon Musk 
@elonmusk
240.2M followers

Barack Obama 
@barackobama
119.3M followers

Donald J. Trump 
@realdonaldtrump
111.6M followers

Cristiano Ronaldo 
@cristiano
109M followers

Narendra Modi 
@narendramodi
107M followers

Rihanna 
@rihanna
97.3M followers

NASA 
@nasa
92.1M followers

Justin Bieber 
@justinbieber
90.6M followers

KATY PERRY 
@katyperry
86.8M followers

Taylor Swift 
@taylorswift13
80.6M followers

Lady Gaga 
@ladygaga
72.2M followers

Kim Kardashian 
@kimkardashian
69.4M followers

YouTube 
@youtube
68.6M followers

Virat Kohli 
@imvkohli
68.6M followers

Bill Gates 
@billgates
63.4M followers

The Ellen Show
@theellenshow
62.5M followers

CNN 
@cnn
61.9M followers

Neymar Jr 
@neymarjr
61.1M followers

X 
@x
60.9M followers

Selena Gomez 
@selenagomez
59.9M followers















