Fei Fang @fangf07 - Twitter Profile

fangf07 retweeted

8 days ago

A recent study found an LLM scored 95% on a healthcare benchmark. Deployed with real patients, it dropped to 34%. In our new work, we argue the problem isn't the benchmark, but the implicit assumptions buried in evaluation. Paper: https://t.co/mi445QtJvM 🧵 1/n

NaveenJRaman's tweet photo. A recent study found an LLM scored 95% on a healthcare benchmark. Deployed with real patients, it dropped to 34%.

In our new work, we argue the problem isn't the benchmark, but the implicit assumptions buried in evaluation.

Paper: https://t.co/mi445QtJvM

🧵 1/n https://t.co/96ZbfotWfd

3

25

13

2

2K

Fei Fang @fangf07

10 days ago

Excited to share this AAAI blog post on our new paper assignment algorithm used for AAAI 2026 (with Michael Cui, Chenxin Dai, and @YixuanEvenXu) and the resulting statistics. Thanks to AAAI 2026 Program Chairs Matt Taylor and Chad Jenkins, and Conference Chair @k_leyton_brown

AAAI

@RealAAAI

14 days ago

Curious about the paper assignment algorithm used for AAAI 2026? The new algorithm substantially improved the robustness of large-scale paper–reviewer assignments, eliminating clear forms of strategic behavior and increasing diversity, while retaining nearly all of the assignment quality achieved by standard methods. Read more: https://t.co/8DqCnvlIlT

0

6

0

3

2K

0

5

2

0

604

Fei Fang @fangf07

16 days ago

Check out our recent work on Humanization by Iterative Paraphrasing (HIP)! We find that commercial AI-text detectors often classify text from base LLMs as human-written, HIP leverages this observation to improve detector evasion.

YixuanEvenXu @YixuanEvenXu

16 days ago

🔗 Check out the full paper and code below: Paper: https://t.co/i9F6kVvc2B, Code: https://t.co/b5WgtTugAD. Work by @YixuanEvenXu, @fjzzq2002, @AdtRaghunathan, @fangf07, @zicokolter (6/6)

0

10

3

8

1K

0

495

fangf07 retweeted

YixuanEvenXu @YixuanEvenXu

16 days ago

🤖 AI text detectors are widely deployed in education and integrity workflows, but what are they actually tracking? We report a surprising finding: text from base models is overwhelmingly judged as human by GPTZero and Pangram. 👇 (1/6)

YixuanEvenXu's tweet photo. 🤖 AI text detectors are widely deployed in education and integrity workflows, but what are they actually tracking?

We report a surprising finding: text from base models is overwhelmingly judged as human by GPTZero and Pangram. 👇 (1/6) https://t.co/M7Mop24NtF

3

61

13

31

13K

Who to follow

Yuandong Tian

@tydsh

Co-founder of @Recursive_SI. ex-Meta FAIR Director. ex-Google. Reasoning, Optimization and Understanding LLM. Novelist in spare time. PhD in @CMU_Robotics.

Sharon Li

@SharonYixuanLi

Associate Professor @WisconsinCS. Making AI reliable for the open world. Program Chairing #ICML2026

Zhou Yu

@Zhou_Yu_AI

Founder of https://t.co/9KM4uFScMi, Associate Professor at Columbia. Making ai agent design and deployment easy, safe, and fast! Forbes 30 under 30.

Fei Fang @fangf07

about 2 months ago

Introducing our new work on interpretable AI!

Naveen Raman @NaveenJRaman

about 2 months ago

Training concept-based models relies on concept selection which is labor-intensive and slow. We introduce Decision-Relevant Selection (DRS), a principled algorithm for automatic concept selection in RL. Paper: https://t.co/TYtOJFaE4D Website: https://t.co/NOTmQLFI8q 🧵 1/n

NaveenJRaman's tweet photo. Training concept-based models relies on concept selection which is labor-intensive and slow.

We introduce Decision-Relevant Selection (DRS), a principled algorithm for automatic concept selection in RL.

Paper: https://t.co/TYtOJFaE4D
Website: https://t.co/NOTmQLFI8q

🧵 1/n https://t.co/DHtIkd7NJq

2

66

14

50

10K

0

3

0

3

682

Fei Fang @fangf07

3 months ago

Our AI Pokémon Benchmark released!

Seth Karten

@sethkarten

3 months ago

https://t.co/BxGonf2ena

18

376

48

278

78K

0

15

0

1

1K

Fei Fang @fangf07

3 months ago

Our recent work on AI for public mental health with @NaveenJRaman in collaboration with @hongshenus 's team, @viscidula team and @cspnj !

CMU School of Computer Science @SCSatCMU

3 months ago

SCS researchers have developed an AI-powered chatbot, PeerCoPilot, designed both with and specifically for people working in behavioral health. 👉 https://t.co/MxwTquMHfe

0

3

2

0

2K

0

8

1

0

940

fangf07 retweeted

CMU School of Computer Science @SCSatCMU

3 months ago

SCS researchers have developed an AI-powered chatbot, PeerCoPilot, designed both with and specifically for people working in behavioral health. 👉 https://t.co/MxwTquMHfe

0

3

2

0

2K

fangf07 retweeted

YixuanEvenXu @YixuanEvenXu

4 months ago

🧬 Distillation enables efficient emulation of LLMs, but verifying provenance remains a critical challenge. Introducing Antidistillation Fingerprinting (ADFP): A principled approach that aligns signals with student learning dynamics. 👇 (1/6)

YixuanEvenXu's tweet photo. 🧬 Distillation enables efficient emulation of LLMs, but verifying provenance remains a critical challenge.

Introducing Antidistillation Fingerprinting (ADFP): A principled approach that aligns signals with student learning dynamics. 👇 (1/6) https://t.co/rGH1mfMDHm

1

45

12

20

10K

Fei Fang @fangf07

6 months ago

At 4pm, we will have our panel discussion on AI education. Panelists include our invited speakers Serene Bioth, @eunicemjun as well as Milind Tamar @MilindTambe_AI and Leo Porter

Fei Fang @fangf07

6 months ago

As a co-chair for the NeurIPS 2025 Education Program, we are excited about the One-Day Event on AI Education which will take place tomorrow Tue 2 Dec 10am - 5pm PST at Upper Level Room 9. More details here: https://t.co/PgMYxkWJQy @adityagrover_ @NaveenJRaman

0

9

2

1

4K

0

1

0

568

Fei Fang @fangf07

6 months ago

We will have two invited talks given by Serena Booth and Eunice Jun @eunicemjun on communicating AI to non-experts.

Fei Fang @fangf07

6 months ago

As a co-chair for the NeurIPS 2025 Education Program, we are excited about the One-Day Event on AI Education which will take place tomorrow Tue 2 Dec 10am - 5pm PST at Upper Level Room 9. More details here: https://t.co/PgMYxkWJQy @adityagrover_ @NaveenJRaman

0

9

2

1

4K

0

3

1

0

1K

Fei Fang @fangf07

6 months ago

The first session at 10am tomorrow will be an interactive session led by Julien Besset on "Cut Through the Noise: How to Write an Effective Elevator Pitch". It will equip you with practical tools to translate your research into a short, effective, and accessible overview.

Fei Fang @fangf07

6 months ago

As a co-chair for the NeurIPS 2025 Education Program, we are excited about the One-Day Event on AI Education which will take place tomorrow Tue 2 Dec 10am - 5pm PST at Upper Level Room 9. More details here: https://t.co/PgMYxkWJQy @adityagrover_ @NaveenJRaman

0

9

2

1

4K

0

1

0

547

Fei Fang @fangf07

6 months ago

As a co-chair for the NeurIPS 2025 Education Program, we are excited about the One-Day Event on AI Education which will take place tomorrow Tue 2 Dec 10am - 5pm PST at Upper Level Room 9. More details here: https://t.co/PgMYxkWJQy @adityagrover_ @NaveenJRaman

0

9

2

1

4K

fangf07 retweeted

Seth Karten

@sethkarten

6 months ago

How do we close the gap between specialist RL and generalist LLM agents? We're benchmarking it in Pokémon. Join us at the PokeAgent Challenge competition workshop @ NeurIPS 2025. 📍 Dec 7, 8AM in San Diego 🎮 Track 1: Competitive Pokémon (game-theoretic reasoning) 🗺️ Track 2: Speedrunning (long-horizon planning) Speakers from Google DeepMind, NYU, CMU, UT Austin, Princeton.

sethkarten's tweet photo. How do we close the gap between specialist RL and generalist LLM agents?

We're benchmarking it in Pokémon. Join us at the PokeAgent Challenge competition workshop @ NeurIPS 2025.

📍 Dec 7, 8AM in San Diego
🎮 Track 1: Competitive Pokémon (game-theoretic reasoning)
🗺️ Track 2: Speedrunning (long-horizon planning)
Speakers from Google DeepMind, NYU, CMU, UT Austin, Princeton.

7

59

19

11

10K

Fei Fang @fangf07

7 months ago

I’m so proud of you!

Stephanie Milani

@steph_milani

7 months ago

📣 Honored to be selected as Honorable Mention for the @SCSatCMU Distinguished Dissertation Award!! Thanks to my advisor @fangf07 & committee Geoff Gordon, @hongshenus, @katjahofmann, & @OriolVinyalsML (+ other mentors and collaborators) for their support 🖤 & congrats to Juncheng, Tim, and Brian 🎉

steph_milani's tweet photo. 📣 Honored to be selected as Honorable Mention for the @SCSatCMU Distinguished Dissertation Award!!

Thanks to my advisor @fangf07 & committee Geoff Gordon, @hongshenus, @katjahofmann, & @OriolVinyalsML (+ other mentors and collaborators) for their support 🖤

& congrats to Juncheng, Tim, and Brian 🎉

9

112

5

13

12K

1

9

0

693

fangf07 retweeted

YixuanEvenXu @YixuanEvenXu

12 months ago

✨ Did you know that NOT using all generated rollouts in GRPO can boost your reasoning LLM? Meet PODS! We down-sample rollouts and train on just a fraction, delivering notable gains over vanilla GRPO. (1/7)

$YixuanEvenXu's tweet photo. ✨ Did you know that NOT using all generated rollouts in GRPO can boost your reasoning LLM? Meet PODS! We down-sample rollouts and train on just a fraction, delivering notable gains over vanilla GRPO. (1/7) https://t.co/xqjt9nILxb$

6

138

16

109

18K

fangf07 retweeted

Yi Wu @jxwuyi

about 1 year ago

We release fully async RL system AReaL-boba² for LLM & SOTA code RL w. Qwen3-14B! @Alibaba_Qwen #opensource 🚀system&algorithm co-design → 2.77x faster ✅ 69.1 on LiveCodeBench 🔥 multi-turn RL ready 🔗 Project: https://t.co/YUa03ppHBp 📄 Paper: https://t.co/DMZ7YDLt6L 1/3👇

jxwuyi's tweet photo. We release fully async RL system AReaL-boba² for LLM & SOTA code RL w. Qwen3-14B! @Alibaba_Qwen #opensource
🚀system&algorithm co-design → 2.77x faster
✅ 69.1 on LiveCodeBench
🔥 multi-turn RL ready
🔗 Project: https://t.co/YUa03ppHBp
📄 Paper: https://t.co/DMZ7YDLt6L
1/3👇 https://t.co/gzAdYoYQcr

7

153

40

73

132K

Fei Fang @fangf07

about 1 year ago

I’m so proud of you!

Stephanie Milani

@steph_milani

about 1 year ago

Another life update!! 🎉 I’m joining @JHUCompSci as an Assistant Professor starting Fall 2026! Apply to work with me on reinforcement learning, foundation models, & human-centered AI. Let’s build better AI agents 🤖🙆‍♀️🦀 Before that, I’ll join @NYU_Courant as an Assistant Professor/Faculty Fellow. Excited to spend a year in NYC!

steph_milani's tweet photo. Another life update!! 🎉

I’m joining @JHUCompSci as an Assistant Professor starting Fall 2026! Apply to work with me on reinforcement learning, foundation models, & human-centered AI. Let’s build better AI agents 🤖🙆‍♀️🦀

Before that, I’ll join @NYU_Courant as an Assistant Professor/Faculty Fellow. Excited to spend a year in NYC!

71

644

21

94

63K

1

11

0

2

2K

fangf07 retweeted

Stephanie Milani

@steph_milani

about 1 year ago

Another life update!! 🎉 I’m joining @JHUCompSci as an Assistant Professor starting Fall 2026! Apply to work with me on reinforcement learning, foundation models, & human-centered AI. Let’s build better AI agents 🤖🙆‍♀️🦀 Before that, I’ll join @NYU_Courant as an Assistant Professor/Faculty Fellow. Excited to spend a year in NYC!

71

644

21

94

63K

Fei Fang @fangf07

about 1 year ago

Excited to be at #aamas2025 ! - My keynote talk at C-MAS workshop today: 2-2:45pm, Maquette A - Will attend panel at ALA workshop today: 4:30-5:30pm, Salon 2 - Siyu Liu (PhD advised by @___tiffanyb___ ) will present our joint paper on Friday 10:45am, Salon 3

0

6

0

567

Fei Fang

@fangf07

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users