Rahul Dhodapkar

8 months ago

Exciting to see our collaboration with @Google highlighted here — using AI to generate and test new biological hypotheses!

13

292

16

24

33K

🥇 The #1 website for AI tools. ➡️ Submit your AI tool: https://t.co/5cnnp2uURj 🚀 Sponsor us: https://t.co/We1agMQ9UO

8 months ago

So proud to be a part of this groundbreaking effort - just the beginning of many discoveries, and new ways to improve health for us all

Sundar Pichai

@sundarpichai

8 months ago

An exciting milestone for AI in science: Our C2S-Scale 27B foundation model, built with @Yale and based on Gemma, generated a novel hypothesis about cancer cellular behavior, which scientists experimentally validated in living cells. With more preclinical and clinical tests, this discovery may reveal a promising new pathway for developing therapies to fight cancer.

537

22K

3K

4K

7M

0

6

0

292

rahuldhodapkar retweeted

Sundar Pichai

@sundarpichai

8 months ago

An exciting milestone for AI in science: Our C2S-Scale 27B foundation model, built with @Yale and based on Gemma, generated a novel hypothesis about cancer cellular behavior, which scientists experimentally validated in living cells. With more preclinical and clinical tests, this discovery may reveal a promising new pathway for developing therapies to fight cancer.

537

22K

3K

4K

7M

Who to follow

There's An AI For That

@theresanaiforit

🕸️Dr.T, PhD

@chydorina

PhD in Biology (not an MD). Cognitive health and chronic disease recovery.

Yale Professor | Founder & CEO @CellTypeInc (YC W26) | AI + Drug Discovery

rahuldhodapkar retweeted

almost 2 years ago

🚀 Beyond excited to announce our release of the #Cell2Sentence (C2S) API and new foundation models! 🎉 Our C2S API makes it incredibly easy to convert #singlecell data into cell sentences, perform inference with LLM-based C2S models, fine-tune them, and convert cell sentences back into expression data—all in one seamless workflow. 🧬 We're releasing powerful new 410M parameter models designed for diverse tasks, including cell type prediction, cell generation, cell annotation, and cell embedding! 🌟 But there’s more: We provide the first foundation model that can encode multiple cells in context, opening up completely new possibilities in single-cell analysis! 🦄 Check out our tutorials to get started, explore the models on Hugging Face, and read the manuscript for more details. We can’t wait to see the innovative applications the community will dream up with these new tools. Stay tuned—more updates are on the way! 🔗 https://t.co/Lpk9flF2UF 📝 https://t.co/cbjFB5TmJr 🤗 https://t.co/MJ95K1r7Fo

david_van_dijk's tweet photo. 🚀 Beyond excited to announce our release of the #Cell2Sentence (C2S) API and new foundation models! 🎉 Our C2S API makes it incredibly easy to convert #singlecell data into cell sentences, perform inference with LLM-based C2S models, fine-tune them, and convert cell sentences back into expression data—all in one seamless workflow. 🧬

We're releasing powerful new 410M parameter models designed for diverse tasks, including cell type prediction, cell generation, cell annotation, and cell embedding! 🌟

But there’s more: We provide the first foundation model that can encode multiple cells in context, opening up completely new possibilities in single-cell analysis! 🦄

Check out our tutorials to get started, explore the models on Hugging Face, and read the manuscript for more details. We can’t wait to see the innovative applications the community will dream up with these new tools. Stay tuned—more updates are on the way!

🔗 https://t.co/Lpk9flF2UF
📝 https://t.co/cbjFB5TmJr
🤗 https://t.co/MJ95K1r7Fo

7

244

62

129

38K

over 2 years ago

Excited to share this work - a new way to apply foundation models to graph structured data. Please reach out if interested in bringing any of these techniques to your data or use case!

over 2 years ago

💡 Want to leverage the power of foundation models in graphs? 🔥 Introducing Foundation-Informed Message-Passing (FIMP), a framework for applying any pre-trained transformer-based foundation model to Graph Neural Networks! https://t.co/JEJTNV90L9

david_van_dijk's tweet photo. 💡 Want to leverage the power of foundation models in graphs?

🔥 Introducing Foundation-Informed Message-Passing (FIMP), a framework for applying any pre-trained transformer-based foundation model to Graph Neural Networks!

https://t.co/JEJTNV90L9 https://t.co/18LtO01geH

3

116

33

63

15K

0

3

0

511

over 2 years ago

Proud to be a part of this fantastic effort!

over 2 years ago

Delighted to share our latest work on #longCOVID - sex differences in symptoms and immune signatures. Led by @SilvaJ_C @taka_takehiro @wood_jamie_1 et al. with @LeyingGuan & @PutrinoLab. We find a striking inverse correlation btw testosterone levels and symptom burden👇🏼 (1/) https://t.co/XUGHcnVJKJ

53

2K

870

1K

537K

2

43

2

5

13K

rahuldhodapkar retweeted

over 2 years ago

Delighted to share our latest work on #longCOVID - sex differences in symptoms and immune signatures. Led by @SilvaJ_C @taka_takehiro @wood_jamie_1 et al. with @LeyingGuan & @PutrinoLab. We find a striking inverse correlation btw testosterone levels and symptom burden👇🏼 (1/) https://t.co/XUGHcnVJKJ

53

2K

870

1K

537K

over 2 years ago

@ylecun @yaroslavvb The problem with this assertion is that there are many other places where information can be encoded in the zygote beyond germline sequence - e.g. the physical orientation of DNA within the nucleus, subcellular sequestration of premade proteins etc. These are >>8MB

0

20

over 2 years ago

It's been over a year now since I first proposed cell2sentence (https://t.co/WDVEbEsumq) - a universal framework that allows *any LLM* to interface with single cell data. Now, together with @david_van_dijk and some incredibly talented students, I'm excited to share major progress

over 2 years ago

Major Cell2Sentence update 🎉🔬! We’ve been thrilled to see the attention Cell2Sentence has received from the single-cell community. Now, we’re excited to release our first update of Cell2Sentence (C2S) - a framework to leverage LLMs to train foundational single-cell models, directly in text. What’s new & out: Updated preprint with latest results https://t.co/cbjFB5TmJr First full cell model available on the HuggingFace hub https://t.co/3kcQzUo7Tm Updated codebase for data transformation & training https://t.co/E8VaXmgYWf We now fine-tune language models to generate entire cells, predict combinatorial cell labels, and generate textual data insights directly from cell sentences. We train GPT-2 and Pythia models on a large multi-tissue dataset containing 36M cells from @cellxgene as well as an immune tissue dataset containing 270k cells. C2S LMs achieve SOTA performance in single-cell data generation. C2S models trained for combinatorial label prediction settings excel in low-data regimes, outperforming single-cell foundation model baselines. We also show that C2S models benefit from natural language pre-training and always outperform models trained from scratch on cell sentences. C2S provides a straightforward approach to adapting LLMs for single-cell data analysis, leveraging their natural language capabilities to generate and derive insights from single cells. We are convinced that C2S’ approach of integrating data modalities through text is the way forward for single-cell foundation models, from representing multi-omics data to generating clinical insights, all in a human readable format. We’re excited to start building a community around Cell2Sentence! If you also think that C2S will be the framework for single-cell foundation models, and are interested in contributing, reach out to us! We welcome any collaborations and discussions. Huge thanks to our collaborator @aminkarbasi and the C2S team (@danielflevine, @sachalevy3, @SyedARizvi5688, @nazreenpm, Xingyu Chen, @dzhang03, @GhadermarziSina, Ruiming Wu, Ivan Vrkic, Anna Zhong, Daphne Raskin, Insu Han, @aho_fonseca, @josueortc) for their hard work on C2S! Special thanks to @rahuldhodapkar, who co-supervises this project.

3

179

43

102

51K

0

5

0

1

542

Nature Methods @naturemethods

over 2 years ago

Some very cool insights here into the intersection between human labeling and other distance-based "unsupervised" approaches to classification! Exciting work!

Maria Brbic @mariabrbic

over 2 years ago

How to infer human labelling of a given dataset in a model-agnostic way? Check our new method HUME accepted at @NeurIPSConf as #spotlight!🌟 HUME provides a new view to tackle unsupervised learning. Kudos to my fantastic PhD student @artygadetsky! Paper https://t.co/ILNk2mJQm0

mariabrbic's tweet photo. How to infer human labelling of a given dataset in a model-agnostic way?

Check our new method HUME accepted at @NeurIPSConf as #spotlight!🌟 HUME provides a new view to tackle unsupervised learning.

Kudos to my fantastic PhD student @artygadetsky!

Paper https://t.co/ILNk2mJQm0 https://t.co/hVAZ6nz5Qs

1

102

21

46

24K

0

1

0

339

rahuldhodapkar retweeted

over 2 years ago

CINEMA-OT, developed by @david_van_dijk, @mingze7316 and colleagues is a causal-inference based method for analyzing the effects of single cell perturbation experiments. @ishizukalab, @ellenfoxman, @rahuldhodapkar, @aho_fonseca https://t.co/5p0bOHvwfT

naturemethods's tweet photo. CINEMA-OT, developed by @david_van_dijk, @mingze7316 and colleagues is a causal-inference based method for analyzing the effects of single cell perturbation experiments. @ishizukalab, @ellenfoxman, @rahuldhodapkar, @aho_fonseca

https://t.co/5p0bOHvwfT https://t.co/ZHVEQLUCcQ

0

52

16

13

18K

over 2 years ago

Happy to share this collaboration with @david_van_dijk @ishizukalab @EllenFoxman - a new causal method to infer perturbational effects with single cell resolution! Amazing work by @Mingze7316

over 2 years ago

Thrilled to announce that CINEMA-OT is now published at Nature Methods! https://t.co/i7lx2Zz9bc

4

220

62

47

43K

1

7

0

1

2K

over 2 years ago

Extremely excited to share our work on #LongCovid, now out in #Nature! I'm honored to be part of an amazing team contributing to our knowledge of a disease affecting so many lives worldwide. Very clear that this disease has *objectively measurable* immune characteristics.

over 2 years ago

So pleased to report that our Mount Sinai-Yale long COVID (MY-LC) paper with @putrinolab & others is now published!! Proud of the hard work of all who contributed. We found biological signatures that can distinguish people with vs. without #longCOVID (1/) https://t.co/t8ARWBKLsQ

123

5K

2K

1M

0

71

12

7

14K

almost 3 years ago

Very proud to share this collaboration with @david_van_dijk and team, where we show a new fundamental approach that allows language-pretrained LLMs to be used *without architectural modifications* to learn from #singlecell data. Please check it out!

almost 3 years ago

Single Cells as text? We developed Cell2Sentence, a method that allows training of Large Language Models on single-cell data! https://t.co/IGc9TFXcTM With @danielflevine @SyedARizvi5688 @sachalevy3 @rahuldhodapkar @YaleSEAS @YaleMed #AI #ML #NLP #genomics #CompBio #singlecell

david_van_dijk's tweet photo. Single Cells as text? We developed Cell2Sentence, a method that allows training of Large Language Models on single-cell data!
https://t.co/IGc9TFXcTM
With @danielflevine @SyedARizvi5688 @sachalevy3 @rahuldhodapkar
@YaleSEAS @YaleMed #AI #ML #NLP #genomics #CompBio #singlecell https://t.co/eNHlkE3GyD

3

307

86

117

55K

0

17

0

3

4K

rahuldhodapkar retweeted

Madhav Dhodapkar @MadhavDhodapkar

almost 3 years ago

Introducing BrainLM 🧠🤖the first foundation model for #fMRI analysis trained on 6,700 hours of brain activity data! Fine-tune for specialized tasks or leverage zero-shot inference capabilities! @WuTsaiYale @YaleCompsci @YaleCBB @YaleMed https://t.co/MUobqXULfb

david_van_dijk's tweet photo. Introducing BrainLM 🧠🤖the first foundation model for #fMRI analysis trained on 6,700 hours of brain activity data! Fine-tune for specialized tasks or leverage zero-shot inference capabilities!
@WuTsaiYale @YaleCompsci @YaleCBB @YaleMed
https://t.co/MUobqXULfb https://t.co/t0BWtsuk3k

6

352

99

133

97K

rahuldhodapkar retweeted

almost 3 years ago

Then and NowHarnessing Immunity in Myeloma: Wine That Keeps Getting Better? https://t.co/HyjiOcnBeW

2

20

10

4

6K

rahuldhodapkar retweeted

about 3 years ago

A new study in @SciImmunology led by @AnisBarmada & Jon Klein @YaleIBIO with @lucasite_lab @InciYildirim11 @YalePediatrics teams explored immune signatures of people who developed myocarditis after mRNA vaccines. Here is what we found. 🧵 (1/) https://t.co/HpWvWxGeQy

29

669

250

187

238K

rahuldhodapkar retweeted

Rahul Satija @satijalab

about 3 years ago

We are excited to release Seurat v5- with new methods for multimodal, spatially resolved, and massively scalable single-cell analysis. https://t.co/7BMGF7x1wV

satijalab's tweet photo. We are excited to release Seurat v5- with new methods for multimodal, spatially resolved, and massively scalable single-cell analysis. https://t.co/7BMGF7x1wV https://t.co/kUC4GOnYm0

7

1K

269

114

116K

about 3 years ago

Perhaps this is a good way to avoid the bias of fixating on the genes we already "know" and the processes we are already familiar with!

0

1

0

118

about 3 years ago

I've been playing around with using #ChatGPT to help think about and process differential expression gene lists and found that very simple prompts are able to do reasonably well in generating high-level overviews of known gene functions, just pasting from #Seurat `FindMarkers`

rahuldhodapkar's tweet photo. I've been playing around with using #ChatGPT to help think about and process differential expression gene lists and found that very simple prompts are able to do reasonably well in generating high-level overviews of known gene functions, just pasting from #Seurat `FindMarkers` https://t.co/D9NqnNrh7R

1

3

0

1

467