Shaoshi Zhang @ZShaoshi - Twitter Profile

Pinned Tweet

14 days ago

For years, we've known that running a standard t-test on cross-validation folds violates sample independence. We wanted to see how widespread this issue actually is. The result? 97% of the studies used an invalid statistical test. 🧵👇

Thomas Yeo @bttyeo

14 days ago

In a meta-analysis of 210 biomedical AI studies that statistically compared models under cross-validation, 97% used invalid statistical tests. Here's our new preprint https://t.co/OG58Vkeu49 led by @tianchuzeng @kkli20111 @ZShaoshi @ten_photos 1/N

bttyeo's tweet photo. In a meta-analysis of 210 biomedical AI studies that statistically compared models under cross-validation, 97% used invalid statistical tests.

Here's our new preprint https://t.co/OG58Vkeu49 led by @tianchuzeng @kkli20111 @ZShaoshi @ten_photos 1/N https://t.co/PEnqXQxcsJ

7

277

96

168

116K

1

11

5

0

3K

ZShaoshi retweeted

Eric Topol

@EricTopol

6 days ago

The p-tau217 breakthrough blood test replicated again, predicting Alzheimer's disease in a large cohort mean age 61. The cover of the new issue is telling @TheLancet https://t.co/Qre6mCkpMV

EricTopol's tweet photo. The p-tau217 breakthrough blood test replicated again, predicting Alzheimer's disease in a large cohort mean age 61. The cover of the new issue is telling @TheLancet https://t.co/Qre6mCkpMV https://t.co/ke8SULhdvI

18

873

269

285

86K

ZShaoshi retweeted

Tal Golan @TalGolanNeuro

13 days ago

This looks like a straightforward, highly applicable solution to the long-standing problem of valid inference for K-fold CV performance differences. The trade-off is smaller training sets from the split-half step and having to rerun K-fold CV many times.

2

33

6

15

4K

ZShaoshi retweeted

Imaging Neuroscience @ImagingNeurosci

13 days ago

New paper in Imaging Neuroscience by Ru Kong, B.T. Thomas Yeo, et al: Network-based near-scalp personalized brain stimulation targets https://t.co/86oCkw42cf

ImagingNeurosci's tweet photo. New paper in Imaging Neuroscience by Ru Kong, B.T. Thomas Yeo, et al:

Network-based near-scalp personalized brain stimulation targets

https://t.co/86oCkw42cf https://t.co/BbxExpdvBa

0

15

6

7

2K

Who to follow

Ruby Kong

@rubykong92

Neuroscience & machine learning

Ting Xu

@TingsterX

brain/evolution/development/math/arts/cartoon/surfing @ChildMindInst, @ChildMindRnD

Ke Xie

@KeXie20

PhD candidate in MICA Lab @BorisBernhardt @TheNeuro_MNI @McGill. Neuroimaging | Brain Networks | Brain Dynamics | Connectomics | Epilepsy

ZShaoshi retweeted

Thomas Yeo @bttyeo

13 days ago

Here's bonus slides on cross-validation tests, separate from our preprint. Covering: 1. paired (sign-flip) permutation test 2. label-swap permutation test 3. sample-level vs fold-averaged stats 4. a common misapplication of the corrected t-test 5. three bootstrap variants 1/N

1

43

25

36

7K

ZShaoshi retweeted

Francisco Pereira @fpereira

13 days ago

@bttyeo @tianchuzeng @kkli20111 @ZShaoshi @ten_photos This is fantastic! I'm glad to have something to point people to in reviews beyond Demšar, 2006 (and Benavoli 2017 for the Bayesian perspective).

1

5

2

1

527

ZShaoshi retweeted

Shan Siddiqi @shansiddiqi.bsky.social @shansiddiqi

13 days ago

Apparently I was doing cross-validation wrong. Thanks @bttyeo and @ZShaoshi for helping us fix it.

0

15

3

7

4K

ZShaoshi retweeted

Gary Marcus, MIT PhD and NYU Professor Emeritus

@GaryMarcus

14 days ago

Biomedical AI may be headed for a replication crisis. (This work below is not about AI-generated reports; it’s about studies of biomedicine that use ML in their methods, and how they are evaluted.)

9

53

4

23

11K

ZShaoshi retweeted

Jake Vogel @_JakeVogel_

14 days ago

Omg I've been commenting about this in manuscript reviews for years. Thank goodness there's actually a paper to cite now!! Thanks @bttyeo !

2

18

3

4

5K

ZShaoshi retweeted

Rajan Kashyap @Rajankashya

14 days ago

Eye opener 👀

0

5

3

4

882

ZShaoshi retweeted

Lijun AN | 安丽军 @anlijuncn

14 days ago

Proud to participate in this study! We should keep rigorous in AI-Biomedical research, we also observe some concerning trends in AI+biomarker studies… Congratulations @tianchuzeng Tian Fang and @ZShaoshi

1

12

5

2

2K

ZShaoshi retweeted

Juan (Helen) Zhou @HelenJuanZhou

14 days ago

Important work. Worth to take a look if you are doing AI in biomedical research.

0

17

5

4

2K

ZShaoshi retweeted

Thomas Yeo @bttyeo

14 days ago

Once again, @ten_photos came to the rescue - we prayed to him for a better statistical test for k-shot learning (since the corrected t-test is overly conservative in that scenario), and he answered our prayers with a new test that also covers classical cross-validation.

1

17

10

12

3K

ZShaoshi retweeted

Sina Mansour L. @Sina_Mansour_L

14 days ago

@bttyeo @tianchuzeng @kkli20111 @ZShaoshi @ten_photos Can't stress this enough 👇 If you use ML to compare predictive models in your research (neuroscience, genetics, you name it), this paper is a must read! 👀 The majority of work in this space (mine included 🙋) misses critical nuances when reporting comparative stats.

0

8

2

944

ZShaoshi retweeted

Dhurandhar B @bornspectator42

14 days ago

My quibble: This is traditional ML *not* AI in the generative sense it means now till eternity. But yeah this is a thing. Metric chasing brought this on. Reviewers reward higher metric values & not well cross-validated results. We've been told AuC<0.8 not worth submitting. 🙄

1

3

1

667

ZShaoshi retweeted

Crémieux

@cremieuxrecueil

14 days ago

Oh my god, almost no biomedical AI papers had proper validations. This feels like field-wide malpractice.

7

383

28

135

53K

ZShaoshi retweeted

Tianchu @tianchuzeng

14 days ago

So glad this is finally public. Grateful to my wonderful co-authors for the long journey.

0

9

5

2

1K

Shaoshi Zhang @ZShaoshi

14 days ago

It’s incredible to see this study come to fruition! Shout out to the amazing @tianchuzeng and @kkli20111 who spearheaded this work and huge thank you to all other coauthors!

0

2

0

82

Shaoshi Zhang @ZShaoshi

14 days ago

For years, we've known that running a standard t-test on cross-validation folds violates sample independence. We wanted to see how widespread this issue actually is. The result? 97% of the studies used an invalid statistical test. 🧵👇

Thomas Yeo @bttyeo

14 days ago

In a meta-analysis of 210 biomedical AI studies that statistically compared models under cross-validation, 97% used invalid statistical tests. Here's our new preprint https://t.co/OG58Vkeu49 led by @tianchuzeng @kkli20111 @ZShaoshi @ten_photos 1/N

7

277

96

168

116K

1

11

5

0

3K

ZShaoshi retweeted

Hesheng Liu

@hesheng3

16 days ago

Lesion network mapping (LNM) has been powerful in linking symptoms and brain functional circuits, but ongoing debates highlight that it is still hard to isolate symptom-specific effects. We came up with a new method, robust LNM (rLNM) — a unified framework combining null models and selective specificity to reveal reliable, symptom-specific networks from background structure. https://t.co/6WHpBRNuQn @bttyeo @foxmdphd @ndosenbach @club_scan

hesheng3's tweet photo. Lesion network mapping (LNM) has been powerful in linking symptoms and brain functional circuits, but ongoing debates highlight that it is still hard to isolate symptom-specific effects. We came up with a new method, robust LNM (rLNM) — a unified framework combining null models and selective specificity to reveal reliable, symptom-specific networks from background structure. https://t.co/6WHpBRNuQn
@bttyeo @foxmdphd @ndosenbach @club_scan

5

117

43

54

21K

ZShaoshi retweeted

Nico Dosenbach @ndosenbach

about 1 month ago

Function & cytoarchitecture don't overlap ... they're orthogonal. Prefrontal cortex is tiled with chains of functional patches mostly known from face processing. Multi-modal parcellations are wrong ... & other insights hidden by group-averaging fMRI data: https://t.co/WEo2Cf9N26

ndosenbach's tweet photo. Function & cytoarchitecture don't overlap ... they're orthogonal. Prefrontal cortex is tiled with chains of functional patches mostly known from face processing. Multi-modal parcellations are wrong ... & other insights hidden by group-averaging fMRI data: https://t.co/WEo2Cf9N26 https://t.co/lJHznCYutL

2

105

35

62

13K

Shaoshi Zhang

@ZShaoshi

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users