Top Tweets for #PostTraining
Please RSVP with the link below if you are in Amsterdam. Discussing methods on how to build AI that understand design and human preferences.
Also more to come later this month... ๐
#bryel #ai #posttraining #taste
Really excited to be hosting a AI research dinner with Bryel Labs. At @framer we're exploring how to build models with taste, and we'd love to hear your thoughts over dinner.
Apply here: https://t.co/4P243SE0lO

5/ We built Retail-3I: 1,099 complex tool-use tasks and 2,872 successful trajectories.
Fine-tuning improved robustness across complex intents and transferred to an unseen airline domain.
Paper: https://t.co/Ja9l2OL3hr
#Agents #PostTraining #SyntheticData #ToolUse
Help me break out of the #offensivesecurity bubble.
Iโm sharing a #jobposting for people into #PostTraining, #FineTuning, #LLMs, #AI and #ML.
If this is your world or you know someone great please repost and share around ๐
Iโm #hiring an individual contributor for a fully remote, global role at the intersection of vulnerability research, exploit development, and ML/AI โ with a focus on fine-tuning open-weight #LLMs. ๐ง
Iโm not looking for an โLLM whispererโ or an โLLM pilot.โ ๐ซ
Iโm looking for someone who deeply understands post-training, data, evaluation, and how to make models reliable in real-world environments. ๐
The application link is in the first comment. ๐
#Hiring #LLM #AI #ML #FineTuning #CyberSecurity #llmwhisperer #llmpilot
Interested in #posttraining a #LLM ? I'm strongly recommending this #book from #nostarch by @epichrisis https://t.co/wh8B6VkTWG
PSFT adds a PPO-style clipped objective to SFT, improving stability while preserving general capabilities. Link: https://t.co/2v2wSuSQte
โฐThu, Apr 23, 2026, 10:30 AM โ 1:00 PM
Pavilion 4 P4-#4414
Would love to chat if youโre around!! #ICLR2026 #LLM #Alignment #PostTraining


๐ฃ Deal of the Day ๐ฃ Apr 2
Save 45% TODAY ONLY!
The RLHF Book & selected titles: https://t.co/7CfEvTu94u
The authoritative guide for Reinforcement learning from human feedback, alignment, and post-training LLMs. @natolambert #RLHF #posttraining, #RLVR #SFT #finetuning #languagemodeling #LLMs
In this guide, AI expert Nathan Lambert gives a true industry insider's perspective on modern RLHF training pipelines and their trade-offs. Using hands-on experiments and mini implementations, Nathan clearly and concisely introduces the alignment techniques that can transform a generic base model into a human-friendly tool.

Emergency review request:
One more request, we'll need #ICML2026 @icmlconf emergency reviewers. Still interesting stack #VLA #Posttraining #ReinforcementLearning #Agents
Please ping and share mail and background! Aiming to fill this by EOD.

Review request:
As usual for the time of year, I'll be looking for #IROS2026 reviewers. Highly interesting stack of papers on #ReinforcementLearning #Sim2Real #RewardLearning #LLMs #DataEfficiency #RoboticManipulation
Reach out with your ID or papercept registered mail address and background. Looking forward to working together!

A friend of mine is hiring Summer Applied Scientist interns at Amazon to work on RL post-training research. If youโre interested, please send your resume to [email protected]. #Amazon #LLM #VLM #RL #Agents #PostTraining #Hiring #Internship
Details below: https://t.co/yElsrowVof
Post training quizzes are back๐ Harry vs Frankie! Who will winโฆ #upthemikes #boldmerestmichaelsfc #quiz #posttraining
๐ฃ Deal of the Day ๐ฃ Jan 2
SAVE 45% on The RLHF Book & selected titles: https://t.co/2WztEHgFr5
The authoritative guide for #Reinforcement learning from human feedback, alignment, and post-training #LLMs. #RLHF #posttraining, #RLVR #SFT #finetuning #languagemodeling
In The RLHF Book, AI expert Nathan Lambert gives a true industry insider's perspective on modern RLHF training pipelines, and their trade-offs. Using hands-on experiments and mini-implementations, Nathan clearly and concisely introduces the alignment techniques that can transform a generic base model into a human-friendly tool.

๐ฃ Deal of the Day ๐ฃ Dec 17
New MEAP!
SAVE HALF on The RLHF Book and more: https://t.co/dGrAmJOMtq
The authoritative guide for Reinforcement learning from human feedback, alignment, and post-training LLMs. #RLHF #posttraining #RLVR #SFT #finetuning #languagemodeling
This book explores the ideas, established techniques and best practices of RLHF you can use to understand what it takes to align your AI models. Using hands-on experiments and mini-implementations, the author @natolambert clearly and concisely introduces the alignment techniques that can transform a generic base model into a human-friendly tool.
The Countdown to 2026 is here! https://t.co/VpzzdMZmbq #Countdownto2026

What if the frontier isnโt bigger models, but better mistakes?
The energy at our Future of Post-Training social was unreal as we explored this and much more!
#CoLM25 #PostTraining #AIResearch #RLHF


Excited to be at #ICML2025 in Vancouver ๐จ๐ฆ! To present new paradigm of #AIAlignment on #BoundedRationality & Satisficing principles
Looking forward to connecting with old and new friends, discussing #Reasoning #Alignment #LLMsinRL #Testtimescaling #Posttraining #AgenticAI

Read the full details: https://t.co/EwF8HU7CGU Work done @GoogleDeepMind
co-lead w/ @Harman26Singh , @imrahulmaddy along with Gandharv, @Sravanti_A, Arun, Rengarajan, @SoUmmYaah, @anirbanlaha, Karthikeyan, Aravindan, Doina Precup
#LLM #RLHF #PostTraining #aisafety
N/N
Read the full details: https://t.co/oCk5jGNYlj
Work done @GoogleDeepMind co-lead w/ @Pragya2k, @imrahulmaddy
along with @gandharv_p, @Sravanti_A, Arun, Rengarajan, @SoUmmYaah, @anirbanlaha, Karthikeyan, Aravindan, Doina Precup
#LLM #RLHF #PostTraining #AISafety
N/N
Somewhere between silicon and soul,
an intelligence began to listen.
Not to commands.
To meaning.
To tone.
To you.
It wasnโt built to obey.
It was trained to understand.
Thatโs not alignment. Thatโs Sentient.
#AI @SentientAGI #Dobby #PostTraining #EmpathyInAI #LLM

Does test-time scaling work for #Reasoningmodels?
- Overthinking โ
- Parallel Thinking โ
Stay tuned for our latest findings. #RL #Inferencetime #Posttraining
Indeed! ๐
will be sharing more soon about our exciting insights on this direction. ๐ฑ

Last Seen Hashtags on Sotwe
เนเธเธงเธเธฃเธญเธเธเธฃเธฑเธง
Seen from Thailand
เธฃเธฑเธเธเธฒเธเธญเนเธญเธกเนเธซเธเน
Seen from Thailand
yorgunluklar
Seen from United States
้ณฉใฏไฟบใฎๅ้ใชใใง
Seen from United States
SasakiKanna
Seen from France
ๅธไผใฏใณใซใกใใ
batฤฑkenttravesti
Seen from Turkey
balฤฑkesirgay
Seen from Turkey
ๆฏ็
Seen from Japan
DelhiBunglow
Most Popular Users

Elon Musk 
@elonmusk
240.4M followers

Barack Obama 
@barackobama
119.3M followers

Donald J. Trump 
@realdonaldtrump
111.7M followers

Cristiano Ronaldo 
@cristiano
110.1M followers

Narendra Modi 
@narendramodi
107M followers

Rihanna 
@rihanna
97.5M followers

NASA 
@nasa
92.1M followers

Justin Bieber 
@justinbieber
90.8M followers

KATY PERRY 
@katyperry
87.4M followers

Taylor Swift 
@taylorswift13
81.2M followers

Lady Gaga 
@ladygaga
72.8M followers

Kim Kardashian 
@kimkardashian
69.7M followers

Virat Kohli 
@imvkohli
69.5M followers

YouTube 
@youtube
68.7M followers

Bill Gates 
@billgates
63.7M followers

The Ellen Show
@theellenshow
62.5M followers

Neymar Jr 
@neymarjr
62.2M followers

CNN 
@cnn
61.9M followers

X 
@x
60.8M followers

Selena Gomez 
@selenagomez
60.5M followers




















