Edward Chen

@edchene

CS PhD Student @StanfordAILab. AI Alignment/Safety.

Joined July 2013

1.5K Following

254 Followers

33 Posts

edchene retweeted

Julie Kallini ✨

@JulieKallini

25 days ago

Fast Byte Latent Transformer is accepted to ICML 2026! ⚡🥪 Byte-level LMs promise to free us from subword tokenizers, but decoding one byte at a time is super slow. We make BLT generation more efficient with BLT-D: text diffusion for parallel byte decoding. 1/

735

111

459

97K

edchene retweeted

Qinan Yu @qinan_yu

about 1 month ago

1/8 RLVR improves accuracy but does not always lead to causal and verifiable CoTs. Surprisingly, this happens even on reasoning-intensive tasks! But we can fix this with reward shaping and SFT-before-RL.

qinan_yu's tweet photo. 1/8 RLVR improves accuracy but does not always lead to causal and verifiable CoTs.

Surprisingly, this happens even on reasoning-intensive tasks!

But we can fix this with reward shaping and SFT-before-RL. https://t.co/maUpuANn63

240

207

34K

edchene retweeted

Ken Liu

@kenziyuliu

3 months ago

Can we build a blind, *unlinkable inference* layer where ChatGPT/Claude/Gemini can't tell which call came from which users, like a “VPN for AI inference”? Yes! Blog post below + we built it into open source infra/chat app and served >15k prompts at Stanford so far. How it helps with AI user privacy: # The AI user privacy problem If you ask AI to analyze your ChatGPT history today, it’s surprisingly easy to infer your demographics, health, immigration status, and political beliefs. Every prompt we send accumulates into an (identity-linked) profile that the AI lab controls completely and indefinitely. At a minimum this is a goldmine for ads (as we know now). A bigger issue is the concentration of power: AI labs can easily become (or asked to become) a Cambridge Analytica, whistleblow your immigration status, or work with health insurance to adjust your premium if they so choose. This is a uniquely worse problem than search engines because your average query is now more revealing (not just keywords), interactive, and intelligence is now cheap. Despite this, most of us still want these remote models; they’re just too good and convenient! (this is aka the "privacy paradox".) # Unlinkable inference as a user privacy architecture The idea of unlinkable inference is to add privacy while preserving access to the remote models controlled by someone else. A “privacy wrapper” or “VPN for AI inference”, so to speak. Concretely, it’s a blind inference middle layer that: (1) consists of decentralized proxies that anyone can operate; (2) blindly authenticates requests (via blind signatures / RFC9474,9578) so requests are provably sandboxed from each other and from user identity; (3) relays prompts over randomly chosen proxies that don’t see or log traffic (via client-side ephemeral keys or hosting in TEEs); and (4) the provider simply sees a mixed pool of anonymous prompts from the proxies. No state, pseudonyms, or linkable metadata. If you squint, an unlinkable inference layer is essentially a vendor for per-request, anonymous, ephemeral AI access credentials (for users or agents alike). It partitions your context so that user tracking is drastically harder. Obviously, unlinkability isn’t a silver bullet: the prompt itself still goes to the remote model and can leak privacy (so don't use our chat app for a therapy session!). It aims to combat *longitudinal tracking* as a major threat to user privacy, and its statistical power increases quickly by mixing more users and requests. Unlinkability can be applied at any granularity. For an AI chat app, you can unlinkably request a fresh ephemeral key for every session so tracking is virtually impossible. # The Open Anonymity Project We started this project with the belief that intelligence should be a truly public utility. Like water and electricity, providers should be compensated by usage, not who you are or what you do with it. We think unlinkable inference is a first step towards this “intelligence neutrality”. # Try it out! It’s quite practical - Chat app “oa-chat”: https://t.co/ELf8LvxFzX (<20 seconds to get going) - Blog post that should be a fun read: https://t.co/OwFmyFlZH5 - Project page: https://t.co/Swerz1xDE2 - GitHub: https://t.co/38CeKajCy2

830

158

794

384K

Edward Chen @edchene

3 months ago

@ShirleyYXWu Congrats Shirley!!

172

Who to follow

ilkhch

@ilkhch

Keywords: engineer-artificial intelligence-healthcare-diagnostics-ultrasound-deep learning-AI-professor-Co-Founder@PONS Opnions are my own...

Gepeng (Daniel) JI

@gepeng_ji

PhD student @ANU. I am working on interesting things in the world of computer vision, including medical image analysis and video understanding.

Nafiseh Ashgar

@ashkarnafiseh

Machine learning scientist. Interested in: Deep Learning, Computer Vision.

edchene retweeted

Vignesh Kothapalli

@kvignesh1420

4 months ago

Relational Foundation Models face a scaling problem: diverse training datasets are rarely public due to privacy constraints 🔒. 🚀 We are excited to introduce "PluRel": a framework that synthesizes diverse multi-table relational databases from scratch, unlocking scaling laws for RFMs. 🧵 Kudos to the amazing collaborators at @StanfordAILab @Kumo_ai_team , and @SAP : @_rishabhranjan_ @VHudovernik @vijaypradwi @johanneshoffart @guestrin @jure

19K

edchene retweeted

Shirley Wu

@ShirleyYXWu

4 months ago

Announcing 🌇HumanLM, a RL framework that trains LLMs to simulate human users’ responses, along with 🌆Humanual, a comprehensive user simulation benchmark https://t.co/LivDkQ2ioo 🌄 One thing that’s fascinating about our society: human users shape the world and determine the value of almost everything 👨‍💼 Human reactions reflect how justifiable policies are 👩‍🎨 Human preferences determine the popularity of blogs/products/media 👩‍💻 Human feedback evaluates LLMs and makes the best LLM collaborators 🌅If we know how to simulate users **accurately**, we know how things are evaluated and what the future looks like, and we can improve things in a way that like or can collaborate well with. So, meet HumanLM, our effort to enable a more human-centric future by simulating users.

ShirleyYXWu's tweet photo. Announcing 🌇HumanLM, a RL framework that trains LLMs to simulate human users’ responses, along with 🌆Humanual, a comprehensive user simulation benchmark

https://t.co/LivDkQ2ioo

🌄 One thing that’s fascinating about our society: human users shape the world and determine the value of almost everything
👨‍💼 Human reactions reflect how justifiable policies are
👩‍🎨 Human preferences determine the popularity of blogs/products/media
👩‍💻 Human feedback evaluates LLMs and makes the best LLM collaborators

🌅If we know how to simulate users **accurately**, we know how things are evaluated and what the future looks like, and we can improve things in a way that like or can collaborate well with.

So, meet HumanLM, our effort to enable a more human-centric future by simulating users.

595

103

573

118K

edchene retweeted

Sanmi Koyejo @sanmikoyejo

4 months ago

1/ Wonderful student projects from CS329H (Fall ’25) ML from Human Preferences at Stanford University! 🚀 @sangttruong @andreas_h0wpt and I introduced students to preference learning + alignment, culminating in final projects. Out of ~50, here are 5 standouts 👇

edchene retweeted

Liana @lianapatel_

9 months ago

Interested in building and benchmarking deep research systems? Excited to introduce DeepScholar-Bench, a live benchmark for generative research synthesis, from our team at Stanford and Berkeley! 🏆Live Leaderboard https://t.co/SdE1tRtrYJ 📚 Paper: https://t.co/E1CBUqVMjO 🛠️ Code: https://t.co/FUxmY9QmBE 🧵👇

lianapatel_'s tweet photo. Interested in building and benchmarking deep research systems?

Excited to introduce DeepScholar-Bench, a live benchmark for generative research synthesis, from our team at Stanford and Berkeley!

🏆Live Leaderboard https://t.co/SdE1tRtrYJ
📚 Paper: https://t.co/E1CBUqVMjO
🛠️ Code: https://t.co/FUxmY9QmBE

🧵👇

182

140

27K

edchene retweeted

Qinan Yu @qinan_yu

about 1 year ago

🎀 fine-grained, interpretable representation steering for LMs! meet RePS — Reference-free Preference Steering! 1⃣ outperforms existing methods on 2B-27B LMs, nearly matching prompting 2⃣ supports both steering and suppression (beat system prompts!) 3⃣ jailbreak-proof (1/n)

qinan_yu's tweet photo. 🎀 fine-grained, interpretable representation steering for LMs!
meet RePS — Reference-free Preference Steering!

1⃣ outperforms existing methods on 2B-27B LMs, nearly matching prompting
2⃣ supports both steering and suppression (beat system prompts!)
3⃣ jailbreak-proof

(1/n) https://t.co/aoDP9jjTIO

233

183

66K

edchene retweeted

Liana @lianapatel_

over 1 year ago

Some of the most exciting AI apps require LLM reasoning over large datasets at test time. For these types of NL questions, RAG or Text2SQL + your favorite LLM are simply not enough. Excited to announce our new leaderboard, from the TAG team at Stanford and Berkeley, to benchmark progress on these difficult types of NL questions over data. Make a submission & top our leaderboard: https://t.co/Y753p8oGhJ

lianapatel_'s tweet photo. Some of the most exciting AI apps require LLM reasoning over large datasets at test time.

For these types of NL questions, RAG or Text2SQL + your favorite LLM are simply not enough.

Excited to announce our new leaderboard, from the TAG team at Stanford and Berkeley, to benchmark progress on these difficult types of NL questions over data.

Make a submission & top our leaderboard: https://t.co/Y753p8oGhJ

12K

edchene retweeted

Sanmi Koyejo @sanmikoyejo

over 1 year ago

Thanks @RylanSchaeffer And thanks to my students and collaborators who make the work possible! Congrats to @chelseabfinn , @DorsaSadigh, and @mkwoot!

edchene retweeted

Yuhui Zhang

@Zhang_Yu_hui

over 1 year ago

🔍 Vision language models are getting better - but how do we evaluate them reliably? Introducing AutoConverter: transforming open-ended VQA into challenging multiple-choice questions! Key findings: 1️⃣ Current open-ended VQA eval methods are flawed: rule-based metrics correlate poorly with true performance (0.09 on VQAv2), while model-based eval has reproducibility issues (updates in GPT-4o versions constantly increase scores by 6% on MMVet). 2️⃣ To address this challenge, we propose AutoConverter, an agentic framework that automatically converts open-ended VQA to multiple-choice questions. It generates distractors matching/exceeding human difficulty, with only 3% of generated questions incorrect. 3️⃣ Using AutoConverter, we built VMCBench: 9,018 multiple-choice questions from 20 datasets testing 33 VLMs in a unified format! 🎯 Our goal: Make VLM evaluation more reliable, efficient & scalable https://t.co/BSnOW8QmYr Joint work with a really fantastic team: @hhhhh2033528 (co-lead) @leoliuym @XiaohanWang96 @jmhb0 @elaine__sui @ChenyuW64562111 @AkliluJosiah2 @Ale9806_ @anjiangw advised by @lschmidt3 @yeung_levy!

Zhang_Yu_hui's tweet photo. 🔍 Vision language models are getting better - but how do we evaluate them reliably? Introducing AutoConverter: transforming open-ended VQA into challenging multiple-choice questions!

Key findings:

1️⃣ Current open-ended VQA eval methods are flawed: rule-based metrics correlate poorly with true performance (0.09 on VQAv2), while model-based eval has reproducibility issues (updates in GPT-4o versions constantly increase scores by 6% on MMVet).

2️⃣ To address this challenge, we propose AutoConverter, an agentic framework that automatically converts open-ended VQA to multiple-choice questions. It generates distractors matching/exceeding human difficulty, with only 3% of generated questions incorrect.

3️⃣ Using AutoConverter, we built VMCBench: 9,018 multiple-choice questions from 20 datasets testing 33 VLMs in a unified format!

🎯 Our goal: Make VLM evaluation more reliable, efficient & scalable

https://t.co/BSnOW8QmYr

Joint work with a really fantastic team: @hhhhh2033528 (co-lead) @leoliuym @XiaohanWang96 @jmhb0 @elaine__sui @ChenyuW64562111 @AkliluJosiah2 @Ale9806_ @anjiangw advised by @lschmidt3 @yeung_levy!

153

33K

edchene retweeted

Liana @lianapatel_

over 1 year ago

We've been building LOTUS at Stanford and Berkeley to make LLM-powered data processing fast, easy and declarative. LOTUS is an open-source query engine that makes programming as easy as writing Pandas and optimizes your programs for up to 400x speedups. To celebrate the holidays, we're excited to share our release of LOTUS 1.0.0 with a batch of new updates that make reasoning over your data faster, easier and better than ever! Code: https://t.co/qQVJ6Vg6fi 🧵👇

lianapatel_'s tweet photo. We've been building LOTUS at Stanford and Berkeley to make LLM-powered data processing fast, easy and declarative.

LOTUS is an open-source query engine that makes programming as easy as writing Pandas and optimizes your programs for up to 400x speedups.

To celebrate the holidays, we're excited to share our release of LOTUS 1.0.0 with a batch of new updates that make reasoning over your data faster, easier and better than ever!

Code: https://t.co/qQVJ6Vg6fi

🧵👇

183

160K

edchene retweeted

Luke Bailey

@LukeBailey181

over 1 year ago

Can interpretability help defend LLMs? We find we can reshape activations while preserving a model’s behavior. This lets us attack latent-space defenses, from SAEs and probes to Circuit Breakers. We can attack so precisely that we make a harmfulness probe output this QR code. 🧵

368

218

59K

edchene retweeted

Yuhui Zhang

@Zhang_Yu_hui

over 1 year ago

🤔 Why are VLMs (even GPT-4V) worse at image classification than CLIP, despite using CLIP as their vision encoder? Presenting VLMClassifier at #NeurIPS2024: ⏰ Dec 11 (Wed), 11:00-14:00 📍 East Hall #3710 Key findings: 1️⃣ VLMs dramatically underperform CLIP (>20% gap) 2️⃣ After testing 6 hypotheses, we found it's not architecture or training objective—it's lack of alignment data 3️⃣ Solution: adding classification data makes VLMs SOTA classifiers + improves their general capabilities! 🔗 https://t.co/dRVxwvaupk Joint work w/ @AlyssaUnell, @XiaohanWang96, Dhruba Ghosh, Yuchang Su, @lschmidt3, @yeung_levy at @StanfordAILab!

Zhang_Yu_hui's tweet photo. 🤔 Why are VLMs (even GPT-4V) worse at image classification than CLIP, despite using CLIP as their vision encoder?

Presenting VLMClassifier at #NeurIPS2024:
⏰ Dec 11 (Wed), 11:00-14:00
📍 East Hall #3710

Key findings:
1️⃣ VLMs dramatically underperform CLIP (>20% gap)
2️⃣ After testing 6 hypotheses, we found it's not architecture or training objective—it's lack of alignment data
3️⃣ Solution: adding classification data makes VLMs SOTA classifiers + improves their general capabilities!

🔗 https://t.co/dRVxwvaupk

Joint work w/ @AlyssaUnell, @XiaohanWang96, Dhruba Ghosh, Yuchang Su, @lschmidt3, @yeung_levy at @StanfordAILab!

13K

edchene retweeted

Nicole Meister @nicole__meister

over 1 year ago

Prior work has used LLMs to simulate survey responses, yet their ability to match the distribution of views remains uncertain. Our new paper [https://t.co/DleesiPbif] introduces a benchmark to evaluate how distributionally aligned LLMs are with human opinions. 🧵

nicole__meister's tweet photo. Prior work has used LLMs to simulate survey responses, yet their ability to match the distribution of views remains uncertain.

Our new paper [https://t.co/DleesiPbif] introduces a benchmark to evaluate how distributionally aligned LLMs are with human opinions.

🧵 https://t.co/Q2dpSpZg5Q

155

28K

edchene retweeted

Rose @rose_e_wang

over 1 year ago

AI has the potential to transform real-world domains. But can AI actually improve outcomes in live interactions? We conducted the first large-scale intervention of a Human-AI Approach that has statistically significant positive learning gains w/ 900 tutors & 1,800 K12 students.

rose_e_wang's tweet photo. AI has the potential to transform real-world domains. But can AI actually improve outcomes in live interactions?

We conducted the first large-scale intervention of a Human-AI Approach that has statistically significant positive learning gains w/ 900 tutors & 1,800 K12 students.

365

212

81K

edchene retweeted

Liana @lianapatel_

almost 2 years ago

Want to answer NL questions over your data? Introducing Table Augmented Generation (TAG)! Joint work w/ the amazing @matei_zaharia @guestrin @profjoeyg @_asimbiswal @sid_jha1 @AmogKamsetty @LynnLiu41887950 📚 Paper: https://t.co/VJZPz1PUpT 🛠️ Code: https://t.co/PO5sd8bgaQ 🧵

lianapatel_'s tweet photo. Want to answer NL questions over your data?

Introducing Table Augmented Generation (TAG)!

Joint work w/ the amazing @matei_zaharia @guestrin @profjoeyg @_asimbiswal @sid_jha1 @AmogKamsetty @LynnLiu41887950

📚 Paper: https://t.co/VJZPz1PUpT
🛠️ Code: https://t.co/PO5sd8bgaQ

🧵 https://t.co/9mCJIyLRUq

198

175

27K

edchene retweeted

Aaron Lou

@aaron_lou

over 2 years ago

Announcing Score Entropy Discrete Diffusion (SEDD) w/ @chenlin_meng @StefanoErmon. SEDD challenges the autoregressive language paradigm, beating GPT-2 on perplexity and quality! Arxiv: https://t.co/EA9FpO1ieo Code: https://t.co/0J3kKtHTgO Blog: https://t.co/pCsDYBy1Dw 🧵1/n

680

131

393

164K

Edward Chen

@edchene

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users