You Don't Know Jack About AI...
And ChatGPT probably doesn't either
For a long time, it was hard to pin down what exactly AI was. Fast-forward to 2024, and we all now know exactly what AI is.
AI = ChatGPT.
Or not.
https://t.co/tDDXzFAF7e
Interested in long-context audio LLMs and hallucinations? We released ~1,140 hrs of synthetic doctor-patient conversations with reference SOAP notes. BeTraC Challenge: build the best open end-to-end SOAP-note system. Two tracks: ≤6B and ≤36B params. https://t.co/UzTx4AHw9I
@GaoZhaolin Can't you suppress the "among responses" variance by, e.g., setting temperature=0? This would give you a cleaner signal for prompt optimization. (I've found that this works quite well in practice.)
✨📢New preprint!
Most people feel empathy for others but have a hard time communicating it.
We built Lend an Ear, an LLM-powered role-playing platform to help people practice and improve their empathic communication skills.
Learn Chess on Duolingo
Start learning chess for free on Duolingo♟️ Now on Android and iOS
Solve bite-sized puzzles and play full games. It's fast, fun, and just a little savage. Ready to make your move?
#duolingo#chess
How do we reliably judge if AI companions are performing well on subjective, context-dependent, and deeply human tasks? 🤖
Excited to share the first paper from my postdoc (!!) investigating when LLMs are reliable judges - with empathic communication as a case study 🧐
🧵👇
Huge congrats to PHD student Sanket Shah @sunk8th on his successful PhD defense, "Decision-Focused Learning for the Masses With Applications to Public Health"! 🎉 What a fantastic way to celebrate our Teamcore group's 30th anniversary, with Sanket becoming our group's 40th PhD!
(1/9) Excited to share my recent work on "Alignment reduces LM's conceptual diversity" with @TomerUllman and @jennhu, to appear at #NAACL2025! 🐟
We want models that match our values...but could this hurt their diversity of thought?
Preprint: https://t.co/C4icfhCDGz
You Don't Know Jack About AI...
And ChatGPT probably doesn't either
For a long time, it was hard to pin down what exactly AI was. Fast-forward to 2024, and we all now know exactly what AI is.
AI = ChatGPT.
Or not.
https://t.co/tDDXzFAF7e
Join us at #ICLR2025 in Singapore!
Submit your work at the intersection of machine learning and climate (biodiversity counts!) by Jan 31.
We especially encourage submissions that are focused on:
🔢 data-centric methods and challenges
🌏 focused on the Asia / Pacific region
I am thrilled that @_arodriguezca will be giving a keynote talk at the Autonomous Agents for Social Good (#aasg2025) workshop @AAMASconf!
Submit your papers by Feb 4th, 2025 and see you in Detroit!
More details: https://t.co/F7MNNMzyar
📢Interested in #AIforSocialGood? We invite you to submit any work related to social impact to the Autonomous Agents for Social Good (#aasg2025) workshop @AAMASconf
DEADLINE: Feb 4, 2025
See: https://t.co/F7MNNMzyar
@aparna_taneja@sunk8th
📢 Please retweet: We're recruiting PhD students at UC Berkeley and UCSF!
Please apply if you are interests in machine learning for healthcare, statistics, causal inference, or medical vision-language models.
For more details, check out this link: https://t.co/3nPu1RNdcB
Excited to share our latest work, where we produce sets with both statistical coverage and high decision utility. Applied to dermatological diagnosis, our method yields sets with coherent diagnostic meaning 🏥. More details in the thread 🧵👇
📢 My team at Meta is hiring PhD research interns! We study core machine learning, optimization, amortization, flows, and control for modeling and interacting with complex systems (...and we use basic physics... 🙃)
Please apply here and message me:
https://t.co/eeL69uUJTI
🧵
Congratulations to my PhD student Sanket Shah @sunk8th for being awarded the Siebel Scholarship class of 2025! Sanket's work has focused on #MLforSocialImpact
https://t.co/CXKHMreG0V
https://t.co/6goZmdktbz
🚨 New preprint: How should we measure task similarity when predictions are used for decision-making?
Traditional dataset distances based only on features & labels fall short for PtO tasks. Our work with @konglingkai_AI@kaiwang_gua@elmelis@MilindTambe_AI addresses this issue