Yanjun Han @yanjun_han - Twitter Profile

CDS Asst. Prof. Yanjun Han (@yanjun_han) and colleagues at NYU and MIT explains why transformers trained on synthetic data excel at empirical Bayes (EB) problems. By using universal priors, these models adapt to new data through posterior contraction. https://t.co/qpsp65Z9Wf

0

51

12

38

7K

0

7

0

5

2K

yanjun_han retweeted

Paata Ivanisvili

@PI010101

3 months ago

Just posted a preprint on arXiv (with P. Durcik, J. Roos, X. Xie) settling the Kahn–Park conjecture on the Hamming cube: https://t.co/cKYXvZc2lu I first learned about the problem through Gil Kalai’s (@GilKalaiblog) blog: https://t.co/L6Og6nYt3f (I should also add that I was asking about related question "Sharp L1 Poincare inequality for boolean functions" 8-9 years ago https://t.co/e359tw5x70 and now the preprint solves both of them though these two problems do not imply each other but they are connected). In addition, it confirms the low-noise limit for balanced functions predicted by the Hellinger conjecture on noisy Boolean channels in information theory. The paper shows C_{11b}=0.5 in our GitHub repo: https://t.co/xF9TsWmkf9 and answers my recent experimental AI challenge: https://t.co/SfBpybe57q (For the record: I received many interesting submissions -- all incorrect, often due to surprisingly simple mistakes. I still don’t know whether there is a proof avoiding the argument in our paper.) But AI did correctly identify the bottleneck that one needs to show C_{11b}=0.5 which is (now was) an open problem. Finally, many thanks to @AIMathematics -- this project began at our SQuaREs program in San Jose supported by AIM. Without AIM, this would not have happened 🙏

3

41

12

17

6K

Who to follow

Yiping Lu

@2prime_PKU

Kernel, ML for PDE, Robust learning,non-parametric stats/🌈/PKU👉Stanford👉NYU Courant👉Prof.@Northwestern IEMS/ Previous Intern @RIKEN_AIP

Jason Lee

@jasondeanlee

Associate Professor CS/stats UC Berkeley. Former Research Scientist at Google DeepMind. ML/AI Researcher working on LLMs and deep learning. PhD at Stanford.

Zhuoran Yang

@zhuoran_yang

Assistant Professor of Statistics and Data Science @Yale

Yanjun Han @yanjun_han

3 months ago

RT @zhun_deng: 🔥 The key isn’t more compute — it’s better allocation. Test-time self-consistency improves reasoning by sampling multiple t…

0

2

0

163

Yanjun Han @yanjun_han

4 months ago

@GuanyangW Yes! Both rely on the classical idea of posterior contraction

0

2

0

238

Yanjun Han @yanjun_han

4 months ago

New paper alert: https://t.co/WrwNSmdZSp Why do pretrained transformers succeed at empirical Bayes? Rather than analyzing architecture or training dynamics, we ask a statistical question: how can a fixed training prior perform well under arbitrary test distributions? [1/2]

2

84

8

67

6K

Yanjun Han @yanjun_han

4 months ago

Our answer: Universal priors exist, just because of the classical phenomenon of posterior contraction! A pretrained estimator under such priors adapt to different test distributions, and generalize to different lengths. Comments welcome! [2/2]

0

3

1

0

554

Yanjun Han @yanjun_han

4 months ago

@weijie444 Huge congrats and well deserved!

1

0

309

Yanjun Han @yanjun_han

9 months ago

@weijie444 Thanks Weijie!

0

206

Yanjun Han @yanjun_han

9 months ago

A key technical challenge is to show a quantitative mean-field approximation of the best permutation-invariant decision rule by a simple rule. We managed to apply the tools we developed last year https://t.co/n6cYf5vhLo to give tight results in Gaussian and Poisson models! [3/3]

yanjun_han's tweet photo. A key technical challenge is to show a quantitative mean-field approximation of the best permutation-invariant decision rule by a simple rule. We managed to apply the tools we developed last year https://t.co/n6cYf5vhLo to give tight results in Gaussian and Poisson models! [3/3] https://t.co/EQ6uVmi67d

0

4

1

959

Yanjun Han @yanjun_han

9 months ago

New preprint out: https://t.co/YGGrdf1uTd! For the good old problem of distribution estimation, we use empirical Bayes and nonparametric MLE, two cornerstones of 20th century statistics, to propose a new, efficient, parameter-free, and competitively optimal estimator. [1/3]

1

58

6

22

8K

Yanjun Han @yanjun_han

9 months ago

Technically, we resolved a decade-old competitive gap in https://t.co/PqGswPis7F, an award-winning paper in NeurIPS 2015. Yihong Wu brought this question to me years ago, and we are so happy to solve it along with two amazing collaborators Jon Niles-Weed and Yandi Shen! [2/3]

1

3

1

0

2K

yanjun_han retweeted

NYU Center for Data Science

@NYUDataScience

over 1 year ago

The Fall 2025 CDS PhD Program application is now open! Apply now: https://t.co/68c3Dzlyv2 Information on our Fall 2025 PhD Admissions Information Sessions is coming soon! #datascience #ai #artificialintelligence #machinelearning

NYUDataScience's tweet photo.
The Fall 2025 CDS PhD Program application is now open! Apply now: https://t.co/68c3Dzlyv2

Information on our Fall 2025 PhD Admissions Information Sessions is coming soon!

#datascience #ai #artificialintelligence #machinelearning https://t.co/tfkpWJwznV

1

21

11

15

4K

yanjun_han retweeted

Edward Kennedy @edwardhkennedy

almost 2 years ago

there are surprisingly many open problems when it comes to theory/methods in causal inference check out this talk by Siva Balakrishnan for an excellent & comprehensive summary of the state of the art https://t.co/BixhF4jImP https://t.co/pVIslnRatf

edwardhkennedy's tweet photo. there are surprisingly many open problems when it comes to theory/methods in causal inference

check out this talk by Siva Balakrishnan for an excellent & comprehensive summary of the state of the art

https://t.co/BixhF4jImP

https://t.co/pVIslnRatf https://t.co/SKGTV1V8i5

0

219

38

209

21K

Yanjun Han @yanjun_han

almost 2 years ago

I haven’t enjoyed the mathematics in a paper this much in a long time: https://t.co/n6cYf5vhLo Summary: an example of performing method-of-moment type analysis for high-dimensional mixtures. Joint work with my amazing colleague Jonathan Niles-Weed.

2

122

20

77

14K

Yanjun Han @yanjun_han

about 3 years ago

Excited for the new journey!

NYU Center for Data Science

@NYUDataScience

about 3 years ago

Yanjun Han (@yanjun_han) will be joining CDS this fall as an Assistant Professor of Mathematics and Data Science. Read about Yanjun and his work on the mathematics of data science on the CDS blog! https://t.co/sO94YltryB

0

17

1

0

20K

1

61

3

0

17K

Yanjun Han @yanjun_han

over 3 years ago

@kc_shineth Just apply directly and mention your research interests & my name!

1

0

Yanjun Han @yanjun_han

over 3 years ago

Apply and work with me if you are interested in the math of data!

NYU Center for Data Science

@NYUDataScience

over 3 years ago

Applications for the NYU Data Science PhD program are now open! To apply/find more information, please visit our PhD Admissions page: https://t.co/W6tktEoICE. We're excited to welcome the next cohort of leading researchers in data science! #datascience

NYUDataScience's tweet photo. Applications for the NYU Data Science PhD program are now open! To apply/find more information, please visit our PhD Admissions page: https://t.co/W6tktEoICE. We're excited to welcome the next cohort of leading researchers in data science! #datascience https://t.co/HuRFpLQIro

2

187

64

59

0

1

40

4

7

0

yanjun_han retweeted

Andrea Montanari @Andrea__M

over 4 years ago

The Nobel prize to Giorgio Parisi is such a joy and and multiply well deserved recognition: In random order: (1) Stochastic quantization; (2) The KPZ equation; (3) Matrix models; (4) Mean field spin glasses (!); (5) Random constraint satisfaction problems. 1/2

5

860

167

88

0

Yanjun Han

@yanjun_han

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users