We share two blogs outside of the HumanLM paper: https://t.co/K0pOqeMPal
Is Synthetic Data Good Enough to Train User Simulators? — by me and @ArpandeepKhatua
Persona Dropout Makes Robust User Simulators @Es2C003
+ Code is ready here! https://t.co/Iw4eRh63gX
Announcing 🌇HumanLM, a RL framework that trains LLMs to simulate human users’ responses, along with 🌆Humanual, a comprehensive user simulation benchmark
https://t.co/LivDkQ2ioo
🌄 One thing that’s fascinating about our society: human users shape the world and determine the value of almost everything
👨💼 Human reactions reflect how justifiable policies are
👩🎨 Human preferences determine the popularity of blogs/products/media
👩💻 Human feedback evaluates LLMs and makes the best LLM collaborators
🌅If we know how to simulate users **accurately**, we know how things are evaluated and what the future looks like, and we can improve things in a way that like or can collaborate well with.
So, meet HumanLM, our effort to enable a more human-centric future by simulating users.