Stephen Bates @stats_stephen - Twitter Profile

Pinned Tweet

over 4 years ago

📰 Excited to share our new work on risk control in prediction! Multiple testing leads to practical calibration algorithms with PAC guarantees for any statistical error rate. Works with any model + data distribution! https://t.co/OCCQUdZsCi #Statistics #MachineLearning

Anastasios Nikolas Angelopoulos

@ml_angelopoulos

over 4 years ago

Thrilled to share Learn then Test, a tool to calibrate any model to control risk (eg. IOU, recall in object detection). No assns on model/data. See arXiv https://t.co/kql7BfyMFb + Colab https://t.co/tZnpo2l6mn ✍️w/@stats_stephen, E.J. Candes, M.I. Jordan, @lihua_lei_stat! 🧵1/n

3

86

22

23

0

3

51

12

17

0

Stephen Bates @stats_stephen

2 months ago

Announcing the Statistical Frameworks for Uncertainty in Agentic Systems workshop at ICML '26!

Mahmoud Hegazy @oumatheu

2 months ago

Excited that our ICML 2026 workshop Statistical Frameworks for Uncertainty in Agentic Systems got accepted 🎉 @icmlconf #icml2026 We want to bring together people thinking about uncertainty and agentic systems.

oumatheu's tweet photo. Excited that our ICML 2026 workshop Statistical Frameworks for Uncertainty in Agentic Systems got accepted 🎉 @icmlconf #icml2026

We want to bring together people thinking about uncertainty and agentic systems. https://t.co/kdYqmz2PMk

1

37

4

24

12K

0

18

1

9

3K

stats_stephen retweeted

Cai Zhou

@zhuci19

2 months ago

(1/5) Modern reasoning systems rely on test-time scaling: CoT, self-consistency, MCTS... But two challenges remain: 1️⃣ Confidence signals shift across tasks/prompts 2️⃣ Stopping decisions are typically static and heuristic We ask: Can we adapt confidence within each reasoning trajectory — while still preserving statistical guarantees? Calibrating LLM reasoning in test-time scaling is not new. But what if calibration itself could adapt online — at test time — to the specific reasoning trajectory of each instance? Our new paper proposes a Test-Time Training framework for calibrating generalizable LLM reasoning, enabling instance-level adaptation with distribution-level robustness. Paper: https://t.co/FtCD6gIZcN

2

147

24

131

17K

stats_stephen retweeted

Anastasios Nikolas Angelopoulos

@ml_angelopoulos

3 months ago

Today I'm sharing a preprint on conformal risk control for non-monotonic losses, a paper three years in the making. The key idea: validity of conformal can be reframed as a consequence of algorithmic stability. Therefore, any stable algorithm inherits a conformal guarantee. 🧵

ml_angelopoulos's tweet photo. Today I'm sharing a preprint on conformal risk control for non-monotonic losses, a paper three years in the making.

The key idea: validity of conformal can be reframed as a consequence of algorithmic stability. Therefore, any stable algorithm inherits a conformal guarantee.

🧵 https://t.co/d9wDTy8Hkw

3

96

13

25

9K

Who to follow

Anastasios Nikolas Angelopoulos

@ml_angelopoulos

Measuring intelligence @arena. Statistics, model evaluation. Formerly @Berkeley_EECS, @stanford_ee, student researcher @GoogleDeepMind.

Lihua Lei

@lihua_lei_stat

Assistant Professor at @StanfordGSB

Valeriy M., PhD, MBA, CQF

@predict_addict

Stephen Bates @stats_stephen

6 months ago

Postdoc opportunity — If you do ML/stat/applied math/… and want to work at the frontier of biology , come join us! 🤖 🧬

Eric and Wendy Schmidt Center @Schmidt_Center

7 months ago

Interested in pursuing #machinelearning, #appliedmathematics, #statistics, or #computationalresearch to work on biomedical problems at the @broadinstitute? Apply to become a @Schmidt_Center postdoctoral associate: https://t.co/VXqP62c19w

Schmidt_Center's tweet photo. Interested in pursuing #machinelearning, #appliedmathematics, #statistics, or #computationalresearch to work on biomedical problems at the @broadinstitute? Apply to become a @Schmidt_Center postdoctoral associate: https://t.co/VXqP62c19w https://t.co/tMEsBQypZj

0

9

6

4

3K

0

8

0

2

1K

stats_stephen retweeted

Eric and Wendy Schmidt Center @Schmidt_Center

7 months ago

Interested in pursuing #machinelearning, #appliedmathematics, #statistics, or #computationalresearch to work on biomedical problems at the @broadinstitute? Apply to become a @Schmidt_Center postdoctoral associate: https://t.co/VXqP62c19w

0

9

6

4

3K

stats_stephen retweeted

Eric and Wendy Schmidt Center @Schmidt_Center

6 months ago

🎉 Our new machine learning challenge – Obesity ML Competition: Tackling Metabolic Diseases – is officially open! Register, watch our introduction videos and lecture series, and begin coding today: https://t.co/h59QRwlJdG @broadinstitute @crunchDAO

Schmidt_Center's tweet photo. 🎉 Our new machine learning challenge – Obesity ML Competition: Tackling Metabolic Diseases – is officially open! Register, watch our introduction videos and lecture series, and begin coding today: https://t.co/h59QRwlJdG

@broadinstitute @crunchDAO https://t.co/FgfRKucYHt

0

27

8

6

3K

Stephen Bates @stats_stephen

6 months ago

Exciting research internship!

Clara Fannjiang @clara_fannjiang

6 months ago

we're hiring a Ph.D. intern! join us @genentech in South San Francisco for a summer advancing ML & statistical approaches for clinical trial design & analysis 📉💊DMs are open, feel free to reach out! 🔗https://t.co/4LRO9UkpnW

1

170

31

99

29K

1

17

1

5

6K

stats_stephen retweeted

Clara Fannjiang @clara_fannjiang

6 months ago

we're hiring a Ph.D. intern! join us @genentech in South San Francisco for a summer advancing ML & statistical approaches for clinical trial design & analysis 📉💊DMs are open, feel free to reach out! 🔗https://t.co/4LRO9UkpnW

1

170

31

99

29K

stats_stephen retweeted

Edgar Dobriban @EdgarDobriban

6 months ago

I wrote a review paper about statistical methods in generative AI; specifically, about using statistical tools along with genAI models for making AI more reliable, for evaluation, etc. See here: https://t.co/0aq8hJqXzo! I have identified four main areas where statistical thinking can be helpful. These are just a subset of what is out there; other topics have been well-covered in other reviews. 1. Designing "statistical wrappers" around a model, for instance, changing behavior of a trained model (e.g., abstaining), where a score, e.g., an "unsafety score" is too high. The key connection to statistics is to use the quantiles of the loss (on a calibration set) to set the critical threshold, thus enabling conformal-type high probability guarantees. 2. Closely related, methods for uncertainty quantification, which enable the model to express uncertainty in an answer. A crucial component here is "calibration", whereby the uncertainty is required to reflect reality. 3. Statistical methods for AI evaluation: Specifically, tools for statistical inference (e.g., confidence intervals) on model performance. Exciting recent work proposes careful statistical models for leveraging a very small high-quality dataset, possibly combined with much larger low-quality datasets, for accurate evaluation. 4. Experiment design and interventions. Careful AI experiments to understand and steer models may require interventions such as modifying experimental settings in a controlled manner. This brings up connections to classical experimental design in statistics. This connection has largely remained implicit so far, and my review aims to make it more explicit; hoping that experimental design principles will become useful here. This review references the work of many, including @HamedSHassani @obastani @tatsu_hashimoto @yuekai_sun @CsabaSzepesvari @ml_angelopoulos @stats_stephen @yaniv_romano @yaringal @KilianQW @_onionesque +their teams, and some work that I was also involved in. Hopefully, my review will be helpful to orient yourself in this exciting area. Nonetheless, since the area is rapidly expanding, it is possible that I missed important references. Please feel free to let me know of anything that I should add/change!

EdgarDobriban's tweet photo. I wrote a review paper about statistical methods in generative AI; specifically, about using statistical tools along with genAI models for making AI more reliable, for evaluation, etc. See here: https://t.co/0aq8hJqXzo!

I have identified four main areas where statistical thinking can be helpful. These are just a subset of what is out there; other topics have been well-covered in other reviews.

1. Designing "statistical wrappers" around a model, for instance, changing behavior of a trained model (e.g., abstaining), where a score, e.g., an "unsafety score" is too high. The key connection to statistics is to use the quantiles of the loss (on a calibration set) to set the critical threshold, thus enabling conformal-type high probability guarantees.

2. Closely related, methods for uncertainty quantification, which enable the model to express uncertainty in an answer. A crucial component here is "calibration", whereby the uncertainty is required to reflect reality.

3. Statistical methods for AI evaluation: Specifically, tools for statistical inference (e.g., confidence intervals) on model performance. Exciting recent work proposes careful statistical models for leveraging a very small high-quality dataset, possibly combined with much larger low-quality datasets, for accurate evaluation.

4. Experiment design and interventions. Careful AI experiments to understand and steer models may require interventions such as modifying experimental settings in a controlled manner. This brings up connections to classical experimental design in statistics. This connection has largely remained implicit so far, and my review aims to make it more explicit; hoping that experimental design principles will become useful here.

This review references the work of many, including @HamedSHassani @obastani @tatsu_hashimoto @yuekai_sun @CsabaSzepesvari @ml_angelopoulos @stats_stephen @yaniv_romano @yaringal @KilianQW @_onionesque +their teams, and some work that I was also involved in.

Hopefully, my review will be helpful to orient yourself in this exciting area. Nonetheless, since the area is rapidly expanding, it is possible that I missed important references. Please feel free to let me know of anything that I should add/change!

11

484

97

375

34K

stats_stephen retweeted

Aaron Roth @Aaroth

7 months ago

If you work at the intersection of CS and economics (or think your work is of interest to those who do!) consider submitting to the ESIF Economics and AI+ML meeting this summer at Cornell: https://t.co/ZpCrofc8Fn

2

126

36

70

17K

stats_stephen retweeted

Cai Zhou

@zhuci19

8 months ago

(1/5) Beyond Next-Token Prediction, introducing Next Semantic Scale Prediction! Our @NeurIPSConf NeurIPS 2025 paper HDLM is out! Check out the new language modeling paradigm: Next Semantic Scale Prediction via Hierarchical Diffusion Language Models. It largely generalizes Masked Diffusion Models (MDM), and provides the progressively denoising capability for each token in the semantic level. Minimal computation overheads, much better results! arxiv: https://t.co/CwGqnUptzX code: https://t.co/asiDuxKw8w

zhuci19's tweet photo. (1/5) Beyond Next-Token Prediction, introducing Next Semantic Scale Prediction! Our @NeurIPSConf NeurIPS 2025 paper HDLM is out! Check out the new language modeling paradigm: Next Semantic Scale Prediction via Hierarchical Diffusion Language Models.

It largely generalizes Masked Diffusion Models (MDM), and provides the progressively denoising capability for each token in the semantic level. Minimal computation overheads, much better results!

arxiv: https://t.co/CwGqnUptzX

code: https://t.co/asiDuxKw8w

7

343

57

218

50K

stats_stephen retweeted

Sherrie Wang @sherwang

9 months ago

Happy to share that our paper on how to obtain reliable statistical inferences from satellite-based maps is now published in Remote Sensing of Environment!

sherwang's tweet photo. Happy to share that our paper on how to obtain reliable statistical inferences from satellite-based maps is now published in Remote Sensing of Environment! https://t.co/ZlCYCyuiUk

12

640

90

257

24K

stats_stephen retweeted

U.S. National Science Foundation

@NSF

12 months ago

Today, NSF announced an add’l 500 NSF Graduate Research Fellowship Program awardees for the 2025-2026 cohort, bringing the total to approx 1,500. #NSFGRFP supports grad students as they pursue their dreams, build STEM skills, & become the next generation of innovators & leaders.

NSF's tweet photo. Today, NSF announced an add’l 500 NSF Graduate Research Fellowship Program awardees for the 2025-2026 cohort, bringing the total to approx 1,500. #NSFGRFP supports grad students as they pursue their dreams, build STEM skills, & become the next generation of innovators & leaders. https://t.co/kZI2fkF8XI

15

742

130

42

79K

stats_stephen retweeted

Jessica Hullman @JessicaHullman

about 1 year ago

📢If you're interested in conformal prediction, algorithms w/predictions, robust stats & connections between them from a theory perspective, join us for a workshop at #COLT2025 in Lyon 🇫🇷 June 30! Submit a poster description by May 25, more here: https://t.co/gXa88zx53F

0

37

8

11

5K

stats_stephen retweeted

Massachusetts Institute of Technology (MIT)

@MIT

about 1 year ago

Imagine a world without MIT.

55

632

148

135

80K

Stephen Bates @stats_stephen

about 1 year ago

@Micro_Yunha @MIT @MITBiology @MITEECS @MIT_SCC Welcome Yunha!

0

1

0

230

stats_stephen retweeted

Sharon Li

@SharonYixuanLi

about 1 year ago

Our paper notifications are out! Congratulations to the authors and look forward to an exciting lineup of discussions. Stay tuned for more details! #ICLR2025

2

139

14

35

20K

stats_stephen retweeted

COPSS @COPSSNews

about 1 year ago

🙌🎉Our 2025 recipient of the COPSS Presidents' Award, is Lester Mackey! This award is given annually to a young member of the statistical community in recognition of outstanding contributions to the profession of statistics.

COPSSNews's tweet photo. 🙌🎉Our 2025 recipient of the COPSS Presidents' Award, is Lester Mackey! This award is given annually to a young member of the statistical community in recognition of outstanding contributions to the profession of statistics. https://t.co/iCOTiKvmyn

8

120

22

7

25K

Stephen Bates @stats_stephen

about 1 year ago

@COPSSNews @LesterMackey Congratulations @LesterMackey! Wonderful news :)

1

0

414

stats_stephen retweeted

Sherrie Wang @sherwang

over 1 year ago

📢 We are hiring a postdoc to work on remote sensing of soil carbon and land degradation! 🌱🗺️ The position will be hosted by the Earth Intelligence Lab & @mitenergy, with an earliest start date of April 2025. To apply: https://t.co/Hx0U91DL3x

1

87

36

24

9K

Stephen Bates

@stats_stephen

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users