You’d think these requirements would’ve existed earlier based on the number of hazing-related deaths over the years, but hooray for Biden signing an anti-hazing bill that mandates institutions to share hazing statistics. https://t.co/5M34nYnC9v
New from my team: Synthesis of research on appropriate reliance on generative AI, linked in the MSR blog below. @samirpassi and I will post interesting insights in the coming days.
@mustafasuleyman@Microsoft Welcome to Microsoft! Let's talk about building appropriate reliance on our consumer products - so people don't over-trust AI when it's wrong, and don't under-trust either. I lead a cross-company team on this.
Lex is right, Sam was wrong.
Our research shows that people don’t really check the work of AIs, once those AIs cross a certain threshold of quality. We found that consultants, for example, “fell asleep at the wheel” and did not look for AI errors. Training didn’t help, either.
The RAI Maturity Model is the most meaningful research project I've worked on. Prioritizing collaboration and communication across disciplines is essential for any org that wants to build AI responsibly. Read more at @mihaela_v's LinkedIn #RAI#maturity
Y'all, I've been doing a series of posts about the RAI Maturity Model on LinkedIn https://t.co/EWuRHaACfz
Today: building a *common language* across disciplines so you can work together.
Internship for PhD students: https://t.co/PxKqruAuBd - Work with my team & with MS Office. I am looking for a social scientist with experience in organizational communication, new employee experience, productivity, mixed-methods field studies.
Aether, founded by @erichorvitz and @BradSmi, was the first group to start thinking about Responsible AI at @Microsoft. Our deep thinking continues to influence the company's approach to RAI. See what's top of mind for Aether in Review of 2023 research: https://t.co/52lrtTMpTh
Proud to share Microsoft's AI & Productivity research initiative's release of its first public report. The research measures the impact AI-powered tools like Copilot have on productivity and we saw drastic gains in time saved on common info worker tasks https://t.co/XQkjQ3eNL7
Interesting work on how the power of prompting can lead a generalist LLM to be a specialist, outperforming a fine-tuned model for medical applications. New great research coming from people on my team at Microsoft.
1/8 We’ve published a study of the power of prompting to unleash expertise from GPT-4 on medical benchmarks without additional fine-tuning or expert-curated prompts: https://t.co/qKI2ELKVQa
Summary of results:
I also want to shout out to @mihaela_v Shipi Dhanorkar @samirpassi@RuotongWang1 Zoe Kahn for all your work helping develop the Responsible AI Maturity Model https://t.co/ah1UYULn5U and to the 90+ RAI consultants & AI practitioners we interviewed that made its creation possible!
Join me Wednesday as I talk with @HeyMarvinApp about how my colleagues at Microsoft & I created the Responsible AI Maturity Model. Hint: it involved analyzing a huge amount of complex qualitative data and I'll share many of our learnings. Register at: https://t.co/SFeuDSxLFr
New paper: Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
We survey over 250 papers to review challenges with RLHF with a focus on large language models. Highlights in thread 🧵
Today is the day that the world's first algorithmic bias auditing law goes into effect in NYC! Has anyone seen one in the wild yet? The law does not exactly make it easy to find them. https://t.co/4MKkB9qUgZ