Alan Aw

@youngtableaux

Postdoc @PennMedicine, via @UCBerkeley (Statistics) and @Stanford (Applied Math)

Philadelphia, PA

Joined May 2014

737 Following

255 Followers

198 Posts

Pinned Tweet

Alan Aw @youngtableaux

over 4 years ago

Excited to share our work with @spence_jeffrey_ and @yun_s_song on detecting sample exchangeability in the absence of point labels! https://t.co/1IGZyDwChK Our work is a blend of stat theory, computation and application to real data, so I’ll try to explain each of these. 1/n

3

52

17

3

0

youngtableaux retweeted

Yun S. Song @yun_s_song

9 months ago

We are excited to share GPN-Star, a cost-effective, biologically grounded genomic language modeling framework that achieves state-of-the-art performance across a wide range of variant effect prediction tasks relevant to human genetics. https://t.co/FTm3byYp67 (1/n)

yun_s_song's tweet photo. We are excited to share GPN-Star, a cost-effective, biologically grounded genomic language modeling framework that achieves state-of-the-art performance across a wide range of variant effect prediction tasks relevant to human genetics.
https://t.co/FTm3byYp67
(1/n) https://t.co/rQKHqi7c4E

17

528

162

291

86K

Alan Aw @youngtableaux

9 months ago

We are grateful to be part of a kind and supportive scientific community. Discussions with Jessica Li (@jsb_ucla), Arjun Raj (@arjunrajlab), and members of the truly singular Song Lab (@yun_s_song) all helped shape this work. Thank you --- we'll pay it forward 🙏

0

2

0

0

185

Alan Aw @youngtableaux

9 months ago

Check out our work and software, led by the incredibly meticulous and capable @fanding_zhou! This is the culmination of work-in-progress presented a while ago at the 2022 Bioconductor Conference (see https://t.co/xo2HNuu414).

Fanding Zhou @fanding_zhou

9 months ago

Gene expression changes aren’t just about mean shifts — variability shifts matter too, especially for aging. We're thrilled to introduce QRscore, a flexible non-parametric framework for detecting shifts in mean and variance across conditions. https://t.co/jfrK9JpJsE (1/7)

fanding_zhou's tweet photo. Gene expression changes aren’t just about mean shifts — variability shifts matter too, especially for aging. We're thrilled to introduce QRscore, a flexible non-parametric framework for detecting shifts in mean and variance across conditions. https://t.co/jfrK9JpJsE (1/7) https://t.co/YvHMajWnlI

1

32

7

17

6K

1

18

4

9

4K

Who to follow

Professor of EECS and Statistics @UCBerkeley. Mathematical and computational biologist. bsky handle: https://t.co/bW59tUBr8m

assistant professor, department of integrative biology at uw-madison. population genetics, human evolution. tweets are, regrettably, my own.

Gonzalo Benegas

Research Scientist at Open Athena | AI for Science

Alan Aw @youngtableaux

9 months ago

Thus with large n, we can hedge against the risk of model misspecification and maintain high statistical power. Even if we think that the distribution of the data is negative binomial. This theoretical insight underpins our method. #methodsmatter

1

1

0

0

192

youngtableaux retweeted

Yun S. Song @yun_s_song

over 1 year ago

Coincidentally, another article from my lab on DNA language models got published on the same day as GPN-MSA. It's freely available for 50 days from this link: https://t.co/q8aWPIRGz8 Genomic language models: opportunities and challenges Please share with your colleagues.

1

61

17

20

6K

youngtableaux retweeted

Biology+AI Daily @BiologyAIDaily

over 1 year ago

A DNA language model based on multispecies alignment predicts the effects of genome-wide variants @NatureBiotech 1. GPN-MSA, a DNA language model leveraging multispecies alignment (MSA), sets a new benchmark in predicting variant deleteriousness for both coding and noncoding regions of the human genome. 2. This model uses evolutionary insights from 100 vertebrate species to enhance predictions, outperforming existing DNA language models like Nucleotide Transformer and genome-wide predictors such as CADD. 3. GPN-MSA achieves superior results in classifying ClinVar, COSMIC, and OMIM variants, highlighting its utility in clinical and research contexts, including rare disease diagnosis and cancer genomics. 4. Unlike prior models, GPN-MSA is computationally efficient, requiring just 3.5 hours of training on 4 NVIDIA A100 GPUs while delivering high-accuracy predictions across 9 billion possible human variants. 5. It excels in distinguishing rare from common variants, leveraging conservation and genomic context, making it a robust tool for genome-wide variant effect prediction (VEP). 6. GPN-MSA's insights extend to functional impact predictions, capturing regulatory and epigenetic markers critical for gene expression and regulation studies. 7. The model paves the way for integrating DNA sequence modeling with functional genomics, offering potential breakthroughs in precision medicine, drug discovery, and population genetics. @gsbenegas @cralbors @youngtableaux @chengzhong_ye @yun_s_song 💻Code: https://t.co/GNDhXAVbZw 📜Paper: https://t.co/ooNtfYUBmA #GenomeVariants #DNAAnalysis #Bioinformatics #PrecisionMedicine #RareDisease #VariantPrediction

BiologyAIDaily's tweet photo. A DNA language model based on multispecies alignment predicts the effects of genome-wide variants @NatureBiotech

1. GPN-MSA, a DNA language model leveraging multispecies alignment (MSA), sets a new benchmark in predicting variant deleteriousness for both coding and noncoding regions of the human genome.

2. This model uses evolutionary insights from 100 vertebrate species to enhance predictions, outperforming existing DNA language models like Nucleotide Transformer and genome-wide predictors such as CADD.

3. GPN-MSA achieves superior results in classifying ClinVar, COSMIC, and OMIM variants, highlighting its utility in clinical and research contexts, including rare disease diagnosis and cancer genomics.

4. Unlike prior models, GPN-MSA is computationally efficient, requiring just 3.5 hours of training on 4 NVIDIA A100 GPUs while delivering high-accuracy predictions across 9 billion possible human variants.

5. It excels in distinguishing rare from common variants, leveraging conservation and genomic context, making it a robust tool for genome-wide variant effect prediction (VEP).

6. GPN-MSA's insights extend to functional impact predictions, capturing regulatory and epigenetic markers critical for gene expression and regulation studies.

7. The model paves the way for integrating DNA sequence modeling with functional genomics, offering potential breakthroughs in precision medicine, drug discovery, and population genetics.

@gsbenegas @cralbors @youngtableaux @chengzhong_ye @yun_s_song
💻Code: https://t.co/GNDhXAVbZw
📜Paper: https://t.co/ooNtfYUBmA

#GenomeVariants #DNAAnalysis #Bioinformatics #PrecisionMedicine #RareDisease #VariantPrediction

0

33

10

14

3K

youngtableaux retweeted

Yun S. Song @yun_s_song

over 1 year ago

Happy New Year! Our GPN-MSA paper is finally published, under a slightly different title from the preprint. Please check it out and share it with your colleagues. https://t.co/CKvTG2EZS2 1/4

6

116

43

42

19K

youngtableaux retweeted

Yutong V Wang 王雨桐 @yu_tong_wang

over 2 years ago

Thrilled to present at #MLCB2023, about our latest work on #SpatialTranscriptomics - ggPair: A deep learning approach to unveil novel ligand-receptor interactions in cell-cell communication. 🧬 Catch my talk at 12:50 PM PT this Friday. All the talks are live-streamed on YouTube.

1

37

3

12

11K

youngtableaux retweeted

Yun S. Song @yun_s_song

over 2 years ago

We recently posted a preprint describing GPN-MSA, a DNA language model that leverages whole-genome alignments across multiple species while taking only a few hours to train. This thread summarizes its performance on the human genome. https://t.co/QyRBRqOXAX 1/12

yun_s_song's tweet photo. We recently posted a preprint describing GPN-MSA, a DNA language model that leverages whole-genome alignments across multiple species while taking only a few hours to train. This thread summarizes its performance on the human genome.
https://t.co/QyRBRqOXAX
1/12 https://t.co/s8tkkgx3nt

7

328

93

152

94K

youngtableaux retweeted

Yun S. Song @yun_s_song

over 2 years ago

After lengthy anticipation, we finally got to compare PrimateAI-3D with our Cross-Protein Transfer (CPT) model https://t.co/Sop1kUF8u6 Below is a summary of our findings, including a comparison of ESM-1b and ESM-1v protein language models. 1/8

2

82

23

28

20K

youngtableaux retweeted

Richard Shuai @richardwshuai

almost 3 years ago

Can current genomic sequence-to-expression models explain expression variation across individuals based on their personal genome? In the Ioannidis Lab at UC Berkeley, we evaluated 4 state-of-the-art models for this (Enformer, Basenji2, ExPecto, Xpresso) https://t.co/WVpPMWQNcs

2

87

26

38

24K

youngtableaux retweeted

Yun S. Song @yun_s_song

over 3 years ago

Interested in a fast, accurate method for estimating the parameters of a complex phylogenetic model of molecular evolution (e.g. a general rate matrix describing the co-evolution of protein contact sites in 3D)? If so, please check out our preprint: https://t.co/VnuGnBjByS (1/10)

1

110

32

30

22K

youngtableaux retweeted

Yun S. Song @yun_s_song

over 3 years ago

Predicting the effects of missense variants is a central problem in human genome interpretation. We are thrilled to share our preprint on using cross-protein transfer (CPT) learning to improve zero-shot prediction of disease variant effects: https://t.co/Q4JLkLwNh8 (1/8)

yun_s_song's tweet photo. Predicting the effects of missense variants is a central problem in human genome interpretation. We are thrilled to share our preprint on using cross-protein transfer (CPT) learning to improve zero-shot prediction of disease variant effects:
https://t.co/Q4JLkLwNh8
(1/8) https://t.co/hyoeJuXkJU

3

430

122

122

0

youngtableaux retweeted

Ruchir Rastogi @rrastogi02

almost 4 years ago

Excited to share my undergraduate work on pathogenicity prediction of stopgain (nonsense) variants: https://t.co/ytnt2oKFMR

1

10

3

0

0

youngtableaux retweeted

🇲🇽 Leonardo Collado-Torres @lcolladotor

almost 4 years ago

Alan Aw @youngtableaux talked about the challenge of identifying marker genes in #scRNAseq and a new non parametric test for doing so Work in progress 🏗 at https://t.co/gU6HQq8xpn #BioC2022

0

3

2

1

0

youngtableaux retweeted

Yun S. Song @yun_s_song

over 4 years ago

Very glad to see this work by @gsbenegas published in eLife. https://t.co/Ryud4zq0Wi We analyzed alternative splicing across diverse cell types in mice using scRNA-seq data from Tabula Muris and BICCN. Many thanks to the reviewers for their thorough feedback.

1

18

5

3

0

youngtableaux retweeted

Berkeley AI Research

over 4 years ago

The BAIR REU is back! @Berkeley_AI is currently recruiting undergraduates from HBCUs and PBIs for a hands-on summer 2022 research experience in artificial intelligence. More details about eligibility and support at https://t.co/JGy4kFF4Sx

0

51

24

4

0

youngtableaux retweeted

Ryan Chung @rykchung

over 4 years ago

Really excited to share a project I’ve been working on with Ryo Yamamoto and the Sudmant Lab. Uncovering tissue-specific changes in gene expression with age. We look at age-correlated gene expression and increasing expression heterogeneity with age. https://t.co/6pDBgtAwg8

0

19

7

2

0

Last Seen Users on Sotwe

Trends for you

Most Popular Users