Taotao Tan @doubletaotan - Twitter Profile

5 months ago

.@HGGAdvances' latest article provides additional evidence that germline HLA-II heterozygosity is inversely associated w/ lung cancer risk: https://t.co/urR6dUjMDM #ASHG @doubleTaoTan

GeneticsSociety's tweet photo. .@HGGAdvances' latest article provides additional evidence that germline HLA-II heterozygosity is inversely associated w/ lung cancer risk: https://t.co/urR6dUjMDM #ASHG @doubleTaoTan https://t.co/OF7Y8pFl0G

0

3

0

585

Taotao Tan @doubleTaoTan

7 months ago

@DuaneJRich Ah, indeed! Thanks, DJ!

0

1

0

16

Taotao Tan @doubleTaoTan

7 months ago

@karpathy @TheVixhal 🤣🤣

0

16

Taotao Tan @doubleTaoTan

about 1 year ago

@nntaleb Most of the exercises are designed for Gaussian probabilistics tho

0

50

Who to follow

Charleston Chiang

@CharlestonCWKC

Associate Professor @usc_cge @uscpphs @qcb_usc. Genetic epidemiology & population genetics. A city, mountain, candy, dance & drosophila gene.

Luke O'Connor

@Luke0connor

Human genetics and applied mathematics - genetic architecture and genealogical algorithms - Assistant Professor @HarvardDBMI

Chandranil Das

@ChandranilDas5

Pursuing MBBS at NRSMCH KOLKATA

Taotao Tan @doubleTaoTan

over 1 year ago

Functional genomicists and statistical geneticists live in two different worlds

0

1

0

111

Taotao Tan @doubleTaoTan

over 1 year ago

@minouye271 Very nice work! Is it worth adding some convolution layers to account for LD? At least it should have some benefits of reducing # of parameters

0

13

doubleTaoTan retweeted

Ruchir Rastogi @rrastogi02

over 1 year ago

Happy to share new work with @anikethjreddy, where we aim to improve Enformer's gene expression predictions on personal genomes. https://t.co/wqfLxgO8wM

1

79

18

37

23K

Taotao Tan @doubleTaoTan

over 1 year ago

@A_A_Zaidi Great point! I should fix this soon

0

108

Taotao Tan @doubleTaoTan

over 1 year ago

Just want to share some personal notes on several stats-gen topics: https://t.co/tNHH9fJcLL

0

16

3

15

3K

Taotao Tan @doubleTaoTan

over 1 year ago

@SashaGusevPosts I still find that "assuming each variant explains a tiny bit of variance" is quite clever. This way, the standard error for each marker is the same (condition on true effect size), which is 1/sqrt(N). By using iterated expectation, the expected chi2 can be worked out

doubleTaoTan's tweet photo. @SashaGusevPosts I still find that "assuming each variant explains a tiny bit of variance" is quite clever. This way, the standard error for each marker is the same (condition on true effect size), which is 1/sqrt(N). By using iterated expectation, the expected chi2 can be worked out https://t.co/Tt6Hg0Znss

1

0

673

Taotao Tan @doubleTaoTan

over 1 year ago

@epigenci Yep, was it XGboost or something

0

59

Taotao Tan @doubleTaoTan

over 1 year ago

@nmancuso_ @KaiYuan1990 That’s great context. In 60s where genotyping is unavailable, this method can still be used for predicting traits. All we need is to keep track of the pedigree. Quite brilliant idea!

0

1

0

135

Taotao Tan @doubleTaoTan

over 1 year ago

Is it possible to predict PRS using the genetic relationship matrix (GRM)? The GRM can be viewed as applying a linear kernel to the standardized genotype. Therefore, techniques such as kernel regression can be used. Moreover, one can modify GRM such that pairwise interactions

1

7

1

3

2K

Taotao Tan @doubleTaoTan

over 1 year ago

@KaiYuan1990 far as I can see, is the reduced dimensionality. Further, we can be creative in designing the kernel, even to incorporate interaction terms. This can be done by using the polynomial kernel, for example. The dimensionality will remain unchanged.

1

0

114

Taotao Tan @doubleTaoTan

over 1 year ago

@KaiYuan1990 Just some raw thoughts: PRS is modeling the joint effect size for each genetic marker, which is very high-dimensional. The kernel method says this linear model is identical to using a similarity matrix K (or GRM) as the predictor. This is called "duality". The benefit, as

doubleTaoTan's tweet photo. @KaiYuan1990 Just some raw thoughts:
PRS is modeling the joint effect size for each genetic marker, which is very high-dimensional. The kernel method says this linear model is identical to using a similarity matrix K (or GRM) as the predictor. This is called "duality". The benefit, as https://t.co/s7rlmC6JCS

1

0

114

Taotao Tan @doubleTaoTan

over 1 year ago

LD is also computed in a block-by-block manner, which means if two variants are far away, their correlation is ignored. Would these ignored correlation, or LD, cumulatively play a role in determining the joint effect size?

0

143

Taotao Tan @doubleTaoTan

over 1 year ago

A lot of stat-gen applications used a matched LD panel and summary statistics as a substitution for individual-level genotype and phenotype. e.g. the joint effect size \beta can be estimated with (X'X)^{-1} X' y, which equals D^{-1} \beta_{GWAS}. The D matrix, or the LD 1/n

1

0

413

Taotao Tan @doubleTaoTan

over 1 year ago

panel, is often obtained from the 1000 Genome project, assuming the populations are similar enough. But how much variability would be induced by using an external LD matrix? I mean, we are estimating the entire LD matrix with only a few hundred samples...

1

0

223

Taotao Tan

@doubleTaoTan

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users