Top Tweets for #instructGPT
Paper title is all you need: The monumental 2020 #GPT3 is about "Language Models are Few-Shot Learners" (aka #PromptEngineering) and the 2022 #InstructGPT paper is about "Training language models to follow instructions with human feedback" (aka #RLHF). Happy birthday!@OpenAI #LLM
Today is the 2-year birthday of InstructGPT, the mother of all modern LLMs. The AI circle has a time dilation effect - can't believe it's been 2 yrs! InstructGPT laid out the canonical recipe of pre-training -> supervised finetuning -> RLHF, a strategy that everyone is still following till this day (with a bit of variations like DPO).
InstructGPT was likely OpenAI's last paper that detailed how they train the frontier models. Looking back, I think it marked the watershed moment when LLMs finally went from an academic curiosity (GPT-3) to an impactful product (ChatGPT).
Some fun facts:
- InstructGPT didn't invent RLHF. In fact, the blog linked to the OG RLHF work, also done by OpenAI's team in 2017. It was conceived to solve hard-to-specify tasks in simulated robotics. RLHF asked a human annotator to give 900 binary preferences, which helped a simple "hopper" robot learn backflips in sim: https://t.co/gZRPGk6CFO
- InstructGPT was published at NeurIPS 2022 in New Orleans! I was presenting MineDojo at the conference and was quite surprised to see the OpenAI poster there.
- The models came in 3 sizes: 1.3B, 6B, 175B. The labelers strongly preferred Instruct-1.3B to the old, prompt-engineered GPT-3-175B. Microsoft Phi-1, one of the best-known small LMs, was also 1.3B.
- InstructGPT is a master class on how to present your research. The 3-step figure is crystal clear and becomes one of the most iconic visuals in AI. The intro section has no BS and gives 8 take-home messages in bold. The discussions on limitations and bias are grounded and honest.
Your weekend read: https://t.co/50Ko4pTRks

New Article: Analyzing #AI as 'automated subjects' using #psychoanalysis & #criticalmediastudies. Magee et al.'s study delves into LLMs like #OpenAI's #InstructGPT, exploring their design, biases, and interactions with users. #AILanguage #CriticalAI https://t.co/N1DAaYVejM
The all-new #InstructGPT-3.5, supposedly acing chess, still sucks at planning. 9% in blocks world (vs. 6.1% for GPT3.5). Add obfuscation, and it dives to 0.6%!
You'd think an "1800 Elo chess player" would be a tad better at stacking blocks, no? 🙄
(w/ @mattdmarq & @rao2z)

The new GPT model, gpt-3.5-turbo-instruct, can play chess around 1800 Elo.
I had previously reported that GPT cannot play chess, but it appears this was just the RLHF'd chat models. The pure completion model succeeds.
https://t.co/UXEkAfEVQT
See game & thoughts below:

BadGPT: Exploring Security Vulnerabilities of #ChatGPT via Backdoor Attacks to #InstructGPT
https://t.co/QA54gTEimu
会話AIロボット #Romi の仕組み、話題の #ChatGPT で用いられた学習方法である「#InstructGPT」について紹介してます↓↓
https://t.co/6YrFk3wtXl
※ #レバテックLAB 様(@levtech_inc)にて掲載いただきました!
Vortrag eines Entwicklers von #OpenAI zur spezifischen Trainingsmethode des Reinforcement Learning from Human Feedback (#RLHF) und den #InstructGPT-Modellen, die ganz entscheidend zum spektakulären Erfolg von #ChatGPT beigetragen haben. #KI https://t.co/F3WJGN3YHE
Unlike its predecessor, #InstructGPT is far less likely to wander into bizarre lies, emotional rants, and manipulative tangents. #openAI https://t.co/HoSDJIsn0A
Teaching #ChatGPT to interact with Humans
Guodong (Troy) Zhao via
@enricomolinari
#InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech
Teaching #ChatGPT to interact with Humans
Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech
@albertogaruccio @rshevlin @Clagett @TylerCohenWood @cgledhill @JBarbosaPR @efipm @Damien_CABADI @Chris_Skinner @SabineVdL

Axross Recipeで新しいレシピを公開しました!
ChatGPTできることを紹介するレシピ | Kay Hosoda https://t.co/TA0xjG4V9s #AxrossRecipe #ChatGPT #InstructGPT #GPT #AI #チャットボット
Teaching #ChatGPT to interact with Humans
Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech
@mikeflache @Paula_Piccard @Shi4Tech @Xbond49 @tewoz @HaroldSinnott @Ronald_vanLoon @NafisAlam @pierrepinna @Khulood_Almani

Teaching #ChatGPT to interact with Humans
Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech
@evankirstel @HeinzvHoenen @FrRonconi @CurieuxExplorer @Hana_ElSayyed @PawlowskiMario @MargaretSiegien @ParisFinForum

RT @enricomolinari Teaching #ChatGPT to interact with Humans
Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech
@JimHarris @AnthonyRochand @dmgerbino @RAlexJimenez @JeromeM...

Teaching #ChatGPT to interact with Humans
Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech
@JimHarris @AnthonyRochand @dmgerbino @RAlexJimenez @JeromeMONANGE @aure79lien @rwang0 @terence_mills @avataverse

Teaching #ChatGPT to interact with Humans
Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #ehealth #marketing #finserv #fintech #govtech
@albertogaruccio @rshevlin @Clagett @TylerCohenWood @cgledhill @JBarbosaPR @efipm @Damien_CABADI @Chris_Skinner @SabineVdL

Teaching #ChatGPT to interact with Humans
Guodong Zhao via @enricomolinari #InstructGPT #Metaverse #marketing #finserv #fintech #govtech
@RagusoSergio @ingliguori @stratorob @KanezaDiane @nigewillson @kuriharan @enilev @Nicolas2Pinto @TheRudinGroup @njhochman @sonu_monika

Teaching #ChatGPT to interact with
Humans Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #marketing #finserv #fintech #govtech
@kalydeoo @sebbourguignon @labordeolivier @JeroenBartelse @pierrecappelli @globaliqx @FGraillot @Visible_Banking @richardturrin

Teaching #ChatGPT to interact with Humans
Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #marketing #finserv #fintech #govtech
@TheAdityaPatro @RLDI_Lamy @bimedotcom @EvaSmartAI @Sharleneisenia @GlenGilmore @CyrilCoste @TerenceLeungSF @JagersbergKnut

Teaching #ChatGPT to interact with Humans
Guodong (Troy) Zhao via @enricomolinari #InstructGPT #Metaverse #marketing #finserv #fintech #govtech
@Nicochan33 @BetaMoroney @ipfconline1 @psb_dc @SpirosMargaris @mvollmer1 @dinisguarda @YuHelenYu @3itcom @pascal_bornet @Ym78200

"#OpenAI ’s is built on decades of research.":
=> 1980s–’90s: Recurrent Neural Networks #RNN
=> 2017: #Transformers
=> 2018–2019: #GPT and #GPT-2
=> 2020: #GPT-3
=> 2022: #InstructGPT, #OPT, #BLOOM
=> December 2022: #ChatGPT
"Overnight" success
https://t.co/F2ieO1PAhh

https://t.co/2IRy6Kagpd
Please register and participate (as annotator) to the amazing @ykilcher project that has the goal to build a crowd sourced #opendata #opensouce alternative to proprietary #chatGPT, based on the #instructgpt logic, with much more!
#OpenAssitant #RLHF
It's surprisingly fun to collect data for OpenAssistant - Our open-source alternative to ChatGPT!
Check out the video:
https://t.co/Q0ZaezzjPs
#openassistant #chatgpt

Last Seen Hashtags on Sotwe
Most Popular Users

Elon Musk 
@elonmusk
240.4M followers

Barack Obama 
@barackobama
119.3M followers

Donald J. Trump 
@realdonaldtrump
111.7M followers

Cristiano Ronaldo 
@cristiano
110.1M followers

Narendra Modi 
@narendramodi
107M followers

Rihanna 
@rihanna
97.5M followers

NASA 
@nasa
92.1M followers

Justin Bieber 
@justinbieber
90.8M followers

KATY PERRY 
@katyperry
87.4M followers

Taylor Swift 
@taylorswift13
81.2M followers

Lady Gaga 
@ladygaga
72.8M followers

Kim Kardashian 
@kimkardashian
69.7M followers

Virat Kohli 
@imvkohli
69.5M followers

YouTube 
@youtube
68.7M followers

Bill Gates 
@billgates
63.7M followers

The Ellen Show
@theellenshow
62.5M followers

Neymar Jr 
@neymarjr
62.2M followers

CNN 
@cnn
61.9M followers

X 
@x
60.8M followers

Selena Gomez 
@selenagomez
60.5M followers















