@florian_tramer They should customize the safeguard?message for recruitment targets. Instead of *conversation aborted* it should say “respond to recruiter to continue”
Bacteria move around using a molecular machine called the flagellar motor that rotates faster than the flywheel of a race car engine and switches directions in an instant. After 50 yrs, scientists have finally figured out how it works. “My lifelong quest is now fulfilled.” Link⤵️
@rmcentush Reminds me of that time I inadvertently stole all of my bf’s saved passwords, bc he let me log into his iPad to setup a HomePod. iCloud auto sync did the deed.
Prediction: unless patched, GPT's tendency to say delve will be assimilated into North American style English. Soon it won't stand out as odd anymore.
More broadly, RLHF'ed LLMs will shape cultural norms in unexpected ways.
@tszzl@emollick Contemporary AI makers of AI slop (eg it’s not x it’s y) are a lot closer to what real users prefer. They’re genuinely useful rethorical devices, albeit now a bit over used
@tszzl@emollick this I think is just down to improved RL rewards. “Delve” was essentially a form of reward hacking: a phrase that Nigerian RLHF raters rewarded (due to its use in Nigerian formal English ), but that wasn’t actually liked by most users.
@_arohan_ In the process there was exactly one time when I received readability feedback that asked me to change smth. And upon review, it was revealed that the code generated by Gemini in fact had correctly followed the guidelines, and the human had misinterpreted them.