Major personal update:
After a bit of hard work and a whole lot of extreme good luck, I'm happy to share that I'll be joining @UW (@uw_ischool) to start my PhD this Fall!
It has been an extremely rewarding journey so far and I look forward to the next phase!
Could social media make us less polarized instead of more?
We tested 5 algorithms on 3 platforms with 10,000 people for 6 months during the 2024 election, and found that the answer is yes.
🧵
🚨🎉Excited to announce that our paper “Grok in the Wild: Characterizing the Roles and Uses of Large Language Models on Social Media” is accepted at @icwsm 2026! In this paper, we investigate how, when, and to what effect Grok is used on X.
🧵1/n thread
🚨 New paper! 🔍
“Question the Questions: Auditing Representation in Online Deliberative Processes”
In deliberative polls, participants propose questions for experts but only a few make it to the panel. How representative are those chosen questions of everyone’s interests? 🧵👇
Our auditing tools are integrated into a deliberation platform @StanfordDDL used in 50+ countries, allowing moderators to measure and improve representation in real time (thanks, Harshvardhan Agarwal and others!) (6/7)
(please reshare) I'm recruiting multiple PhD students and Postdocs @uwcse@uwnlp
(https://t.co/I5wQsFnCLL). Focus areas incl. psychosocial AI simulation and safety, Human-AI collaboration.
PhD: https://t.co/ku40wCrpYh
Postdocs: https://t.co/K9HUIPJ5h6
Today we're releasing Community Alignment - the largest open-source dataset of human preferences for LLMs, containing ~200k comparisons from >3000 annotators in 5 countries / languages!
There was a lot of research that went into this... 🧵
Without the YC community this guy would still be operating and would have maybe never been caught
The startup guild of YC is a necessary invention to help founders be more successful than they would be alone
@Marco_Piani Your evaluation idea definitely makes sense to compare AI-assisted vs human written notes (we do something similar here: https://t.co/4rl7dSg5C7) but not sure if that directly solves the bottleneck issue?
@Marco_Piani Yea, I agree with rating being a bottleneck and I think there's a lot of interesting ways to address that issue. For instance, tools/algorithms to assist human raters, transfer ratings across similar notes etc.