Matthew Jörke

@mjoerke

PhD Student @stanford CS | HCI/AI, health behavior change | he/him

Stanford, CA

Joined May 2024

87 Following

121 Followers

24 Posts

Pinned Tweet

Matthew Jörke @mjoerke

2 months ago

I’m excited to share that Bloom, where we ran a four week study on LLM health coaching, just won a Best Paper Award at CHI! 🏆 Paper: https://t.co/MRSQntEosA Website: https://t.co/OOfGWqhcTf Interest form: https://t.co/pkTxc3Lpua Come see my talk! https://t.co/VfJFT7nXnf [1/11]

mjoerke's tweet photo. I’m excited to share that Bloom, where we ran a four week study on LLM health coaching, just won a Best Paper Award at CHI! 🏆

Paper: https://t.co/MRSQntEosA
Website: https://t.co/OOfGWqhcTf
Interest form: https://t.co/pkTxc3Lpua
Come see my talk! https://t.co/VfJFT7nXnf

[1/11] https://t.co/1ckPe8DgQx

18K

mjoerke retweeted

Vishakh Padmakumar

@vishakh_pk

16 days ago

People are increasingly worried that AI tools make us overreliant. But how do we actually measure this? We introduce Offloading Score, a measure of reliance based on the fraction of cognitive effort offloaded to AI while completing a task. In a controlled user study, Offloading Score detects increased reliance under time pressure, while several common alternatives do not. (1/9)

$vishakh_pk's tweet photo. People are increasingly worried that AI tools make us overreliant. But how do we actually measure this? We introduce Offloading Score, a measure of reliance based on the fraction of cognitive effort offloaded to AI while completing a task. In a controlled user study, Offloading Score detects increased reliance under time pressure, while several common alternatives do not. (1/9)$

213

100

77K

mjoerke retweeted

Diyi Yang

@Diyi_Yang

about 1 month ago

The next frontier of AI is not only more capable model; it is an AI that *humans* can meaningfully live and work with :) With all students in my cs329x Human-Centered LLM class, we present 60+ pages of insights for developing Human-Centered LLMs (HCLLMs), from design & data sourcing to training, eval & deployment 🧵

Diyi_Yang's tweet photo. The next frontier of AI is not only more capable model; it is an AI that *humans* can meaningfully live and work with :)

With all students in my cs329x Human-Centered LLM class, we present 60+ pages of insights for developing Human-Centered LLMs (HCLLMs), from design & data sourcing to training, eval & deployment 🧵

288

183

54K

mjoerke retweeted

Michael Y. Li

@michaelyli_

about 2 months ago

Can a language model learn, end-to-end, what to keep in its own KV cache and what to throw away? Can it learn to forget while it learns to reason? Deep learning's central lesson: capability emerges from end-to-end optimization, not heuristics/strong inductive biases. But for efficiency, we rely heavily on hand-designed approaches. 🗑️ Introducing Neural Garbage Collection (NGC): we train a language model to jointly reason and manage its own KV cache, using reinforcement learning with outcome-based task reward alone. No SFT, no proxy objectives, no summarization in natural language. New paper with @jubayer_hamid, Emily Fox, and @noahdgoodman!

michaelyli_'s tweet photo. Can a language model learn, end-to-end, what to keep in its own KV cache and what to throw away? Can it learn to forget while it learns to reason?

Deep learning's central lesson: capability emerges from end-to-end optimization, not heuristics/strong inductive biases. But for efficiency, we rely heavily on hand-designed approaches.

🗑️ Introducing Neural Garbage Collection (NGC): we train a language model to jointly reason and manage its own KV cache, using reinforcement learning with outcome-based task reward alone. No SFT, no proxy objectives, no summarization in natural language.

New paper with @jubayer_hamid, Emily Fox, and @noahdgoodman!

905

132

738

165K

Matthew Jörke @mjoerke

2 months ago

@FerryLee_AIPOCH The app integrates with Apple's HealthKit API so while we used Apple Watches in our study, the platform itself is not tied to any particular wearable! If your wearable/smart device can read/write to HealthKit, Bloom can read that data too

Matthew Jörke @mjoerke

2 months ago

18K

Matthew Jörke @mjoerke

2 months ago

If you’re at CHI, come see our presentation! https://t.co/XjlWxaw8ql And if you’re interested in chatting more, drop me a line at [email protected]. We’d love to hear from you! [11/11]

161

Matthew Jörke @mjoerke

2 months ago

We’re actively working on releasing Bloom to the public. If you’d like to try it out, please fill out our interest form: https://t.co/QihNodi7Qt If you’re interested in building on Bloom, our code is open source https://t.co/ngVIoDZmVQ [10/11]

209

mjoerke retweeted

Aishwarya Mandyam

@Aishwarya_R_M

7 months ago

✨I'm on the research scientist and postdoc job market! I'll be graduating from my PhD this academic year with a thesis that focuses on reinforcement learning and healthcare. ✨

334

52K

Matthew Jörke @mjoerke

about 2 years ago

@mbodhisattwa @oshaikh13 Thanks for sharing! Looking forward to reading this :)

Matthew Jörke @mjoerke

about 2 years ago

Many thanks to my co-authors @sapkotashardul_ , Lyndsea Warkenthien, Niklas Vainio, @PSchmiedmayer, @EmmaBrunskill, @landay!!

560

Matthew Jörke @mjoerke

about 2 years ago

In a user study with 16 participants, we find that GPTCoach can adhere to motivational interviewing principles and contextualize a user's wearable data to their unique circumstances. Participants also appreciated its supportive and non-judgmental tone.

654

Matthew Jörke @mjoerke

about 2 years ago

In a counterfactual comparison to vanilla GPT4, GPTCoach is more consistent with motivational interviewing, asking more open-ended questions and giving advice with permission.

mjoerke's tweet photo. In a counterfactual comparison to vanilla GPT4, GPTCoach is more consistent with motivational interviewing, asking more open-ended questions and giving advice with permission. https://t.co/nFroo84wOs

624

Matthew Jörke @mjoerke

about 2 years ago

We built GPTCoach, a GPT4-based chatbot that implements an evidence-based health coaching program, uses counseling strategies from motivational interviewing, and can query and visualize a user’s health data from a wearable through tool use.

mjoerke's tweet photo. We built GPTCoach, a GPT4-based chatbot that implements an evidence-based health coaching program, uses counseling strategies from motivational interviewing, and can query and visualize a user’s health data from a wearable through tool use. https://t.co/zVCgPn9Y2K

Matthew Jörke @mjoerke

about 2 years ago

Through formative interviews with 22 participants, we learned that *all* health experts adopted a facilitative approach that did not give unsolicited advice. Notably, this contrasts with how current LLMs are trained to answer questions and give advice.

423

Matthew Jörke

@mjoerke

Last Seen Users on Sotwe

Trends for you

Most Popular Users