Mingze Dong @mingze7316 - Twitter Profile

Stanford AI+Biomedicine Seminar @Stanford_AI_Bio

5 months ago

We are excited to have @Mingze7316 presenting “Stack: In-context learning of single cell biology” tomorrow! 📍CoDa E160 | 2/3 2:30pm | Stanford + Zoom

Stanford_AI_Bio's tweet photo. We are excited to have @Mingze7316 presenting “Stack: In-context learning of single cell biology” tomorrow!

📍CoDa E160 | 2/3 2:30pm | Stanford + Zoom https://t.co/cvXDNSL5q1

2

175

21

77

13K

Mingze Dong @Mingze7316

5 months ago

@maxkreuzz @abhinadduri @yusufroohani @davey_burke @dhrvji So it's inevitable given current bio data. But since biological foundation models only need to capture biology (not general reasoning), we may not need comparable scale for it to be useful. Plus, we believe we've set up the right framework for scaling as more data comes. (2/2)

1

2

0

75

Mingze Dong @Mingze7316

5 months ago

@maxkreuzz @abhinadduri @yusufroohani @davey_burke @dhrvji Thanks! You're right about the scale gap. All current human single-cell data is ~10¹⁰ tokens in stack, while modern LLMs train on >10¹³. Our model scales accordingly: 10⁸⁻⁹ params vs >10¹¹⁻¹² for SOTA LLMs—about 10⁶× smaller in total, hence the resource difference. (1/2)

1

4

0

70

Mingze7316 retweeted

Arc Institute

@arcinstitute

5 months ago

Predicting cell state in previously unseen conditions such as disease or in response to a drug has typically required retraining for each new biological context. Today, Arc is releasing Stack, a foundation model that learns to simulate cell state under novel conditions directly at inference time, no fine-tuning required.

arcinstitute's tweet photo. Predicting cell state in previously unseen conditions such as disease or in response to a drug has typically required retraining for each new biological context. Today, Arc is releasing Stack, a foundation model that learns to simulate cell state under novel conditions directly at inference time, no fine-tuning required.

36

961

206

594

403K

Who to follow

Tianyu Liu

@meSuper8

BS @ #UIUC, PhD student @ #Yale

Sijie Chen

@chensj16

ML researcher, Computational Immunologist, Postdoc@Stanford. Developing GenAI frameworks integrate multi-modal data- imaging, texts & multi-omics

Ke XU

@KeXU0828

BS in Math @UWMadison, MPH in Biostatistics @Yale, Joint CS PhD Student @GersteinLab and @KrishnaswamyLab at Yale University

Mingze Dong @Mingze7316

5 months ago

Super proud to present Stack — a foundation model that brings in-context learning to leverage and engineer cellular contexts, through innovations grounded in single-cell biology. Huge thanks to @yusufroohani @abhinadduri @dhrvji @davey_burke and Arc team! A great summary below:

Yusuf Roohani @yusufroohani

5 months ago

Why define conditions, donors or even *tasks* when we can just use cells themselves to guide model output Presenting Stack, in-context learning using just cells! Use cell context -> enhance its embedding Engineer cell context ->modify its state Led by the brilliant @Mingze7316

yusufroohani's tweet photo. Why define conditions, donors or even *tasks* when we can just use cells themselves to guide model output

Presenting Stack, in-context learning using just cells!

Use cell context -> enhance its embedding
Engineer cell context ->modify its state

Led by the brilliant @Mingze7316 https://t.co/4MzMrmP4xK

7

156

23

120

30K

2

21

6

7

4K

Mingze Dong @Mingze7316

7 months ago

Open to DMs / chats about AI for science and academic job opportunities! See my previous work on theoretically grounded single-cell and spatial omics AI models: https://t.co/NOkX8R23NX — with more to come.

0

1

0

1

332

Mingze Dong @Mingze7316

7 months ago

I’ll be at #NeurIPS from Wed–Sun presenting our work https://t.co/DC8E7ZwSBf ! We build a high-dim linear model that explains all kinds of phenomena in mask-based pretraining, and from this framework propose R²MAE that improves pretraining across language, DNA, and single-cell.

Mingze7316's tweet photo. I’ll be at #NeurIPS from Wed–Sun presenting our work https://t.co/DC8E7ZwSBf !
We build a high-dim linear model that explains all kinds of phenomena in mask-based pretraining, and from this framework propose R²MAE that improves pretraining across language, DNA, and single-cell. https://t.co/LlDA1q2MMb

1

0

2

479

Mingze Dong @Mingze7316

about 1 year ago

Thrilled to share that our work is featured by @NatureComms as an editor's highlight in Computational and Theoretical Biology! Check the link below: https://t.co/z2y7IJpcp9

0

2

0

1

363

Mingze Dong @Mingze7316

about 1 year ago

Out in @NatureComms! We tackle a core challenge in spatial omics—reliably disentangle spatial interactions from intrinsic cell properties, which requires identifiability. We built an identifiable deep learning framework SIMVI (with proofs!) to solve this: https://t.co/JnJnDRQnIF

4

106

24

35

13K

Mingze Dong @Mingze7316

about 1 year ago

By identifiability, SIMVI uniquely enables inference of “spatial effects” at a single-cell level, empowering biological discoveries. Please refer to our manuscript (and the 44-page SI) for more details and applications. Many thanks for the support @YaleCBB @RongFan8 @Klugerlab!

1

4

1

0

821

Mingze Dong @Mingze7316

about 1 year ago

@Ella_Maru Thanks for the question! Short answer: Yes. If the lineage is space-independent, intrinsic variation would capture and disentangle it from niche; if space-dependent, our relevant case study (Fig. 5) shows SIMVI can reveal spatial-dependent states and differentiate from niches.

1

0

189

Mingze Dong @Mingze7316

over 1 year ago

Many thanks to all co-authors whose contributions make this work possible! Please check our manuscript for details and more results: https://t.co/Y0sUV0ewoN N/N.

0

1

0

230

Mingze Dong @Mingze7316

over 1 year ago

Thrilled to share our preprint: https://t.co/Y0sUV0f4el. Long story short: we found a way (scShift) leveraging massive single-cell atlases to build powerful zero-shot biological state extractors. Its performance scales with dataset diversity after an “emergence threshold”. 1/N

1

9

2

3

1K

Mingze Dong @Mingze7316

over 1 year ago

Summary: scShift demonstrates 4 important properties for next-generation single-cell models: 1) zero-shot, 2) disentanglement, 3) scaling, and 4) unsupervised. It facilitates analyses of biological states at all levels. The novel idea may lead to various future extensions. 11/N

1

0

262

Mingze Dong

@Mingze7316

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users