Jake Silberg @JakeSilberg - Twitter Profile

17 days ago

🚀 Today, we’re excited to introduce SimpleTES for scaling the scientific discovery loop. 🧵 I always ask myself: what are we actually scaling in scientific discovery? Most LLM discovery methods focus on test-time scaling generation — more tokens, more agents, more turns. But science advances through the evaluation-driven loops: propose → evaluate → refine → repeat. SimleTES captures this idea, discovering SOTA solutions across 21 scientific problems! Key discoveries: 🏎️ 2.17x faster lasso solver than glmnet — the gold-standard LASSO solver, engineered for decades. ⚛️ 24.5% fewer quantum routing overhead on IBM Q20 — superior than previous standard library LightSABRE. 📐 0.380868 on Erdős Minimum Overlap — outperforming previous solutions from mixed-frontier ensembles or humans. 🧬 0.74 on Tabula Muris (scRNA-seq denoising) — new SOTA, generalizing to unseen tissue types without retraining. #LLM #AI4Science #ScalingLaws #SimpleTES #MachineLearning

haotian_yeee's tweet photo. 🚀 Today, we’re excited to introduce SimpleTES for scaling the scientific discovery loop.

🧵 I always ask myself: what are we actually scaling in scientific discovery?

Most LLM discovery methods focus on test-time scaling generation — more tokens, more agents, more turns.
But science advances through the evaluation-driven loops: propose → evaluate → refine → repeat.

SimleTES captures this idea, discovering SOTA solutions across 21 scientific problems!

Key discoveries:
🏎️ 2.17x faster lasso solver than glmnet — the gold-standard LASSO solver, engineered for decades.
⚛️ 24.5% fewer quantum routing overhead on IBM Q20 — superior than previous standard library LightSABRE.
📐 0.380868 on Erdős Minimum Overlap — outperforming previous solutions from mixed-frontier ensembles or humans.
🧬 0.74 on Tabula Muris (scRNA-seq denoising) — new SOTA, generalizing to unseen tissue types without retraining.

#LLM #AI4Science #ScalingLaws #SimpleTES #MachineLearning

10

150

43

94

56K

JakeSilberg retweeted

Rahul Thapa @connect_thapa

17 days ago

In AI for scientific discovery, the bottleneck isn't always generation — it's quite often evaluation. How do you design evaluators close to gold? Prevent reward hacking? And critically, how do you scale the evaluation-driven loop to reach genuinely novel discoveries?

1

8

2

1K

JakeSilberg retweeted

Martin Pacesa @MartinPacesa

about 1 month ago

Extremely excited about the results of @adaptyvbio RBX1 binder design competition! 𝑩𝒊𝒏𝒅𝑪𝒓𝒂𝒇𝒕2 performed very well, with 3 out of 7 designs binding to the disordered tail. Overall, only 9 binders worked out of 322 tested, 2.8% hit rate! Proud of the BC2 team ♥️

MartinPacesa's tweet photo. Extremely excited about the results of @adaptyvbio RBX1 binder design competition! 𝑩𝒊𝒏𝒅𝑪𝒓𝒂𝒇𝒕2 performed very well, with 3 out of 7 designs binding to the disordered tail. Overall, only 9 binders worked out of 322 tested, 2.8% hit rate! Proud of the BC2 team ♥️ https://t.co/1FGURwEYHy

9

218

38

55

12K

JakeSilberg retweeted

Kyle Swanson @KyleWSwanson

about 1 month ago

SyntheMol-RL has now been published! SyntheMol-RL is a reinforcement learning model for synthesizable small molecule drug design. We used it to design antibiotic candidates for the bacteria S. aureus with hits validated in vitro and in vivo in mice. 1/6 https://t.co/SuCMHik2tB

KyleWSwanson's tweet photo. SyntheMol-RL has now been published! SyntheMol-RL is a reinforcement learning model for synthesizable small molecule drug design. We used it to design antibiotic candidates for the bacteria S. aureus with hits validated in vitro and in vivo in mice. 1/6 https://t.co/SuCMHik2tB https://t.co/Ci03ocfhzt

5

81

15

24

41K

Who to follow

NO FINANCIAL ADVISE send your donations, I will use 💸 for a dog 🐶 shelter in 🇹🇷 https://t.co/OSeQ21mEfk

Jake Silberg @JakeSilberg

about 1 month ago

@zhang_ouyang Congrats!! Really great work

1

0

67

JakeSilberg retweeted

Haotian Ye

@haotian_yeee

2 months ago

Finally getting to share one of my favorite projects. ICLR Oral! 🏆 It’s so strange how rigid video tokenization is. Think about it: why should a still landscape cost the same amount of tokens as a busy street? We built InfoTok. We went back to basics with Shannon’s information theory to make tokens "adaptive" in a principled way. Its 2.3x better compression and 11x faster inference demonstrates the magic of the old-school theory ✨ Check it out: https://t.co/0PeYtaVY1y

10

295

43

168

49K

Jake Silberg @JakeSilberg

3 months ago

@bookclubpodhq @dcsandbrook This episode was fantastic. I would've been happy for it go on a whole second hour!

0

2

0

46

JakeSilberg retweeted

Nitya Thakkar @nityathakkar_

3 months ago

Excited to share that our paper has been published in Nature Machine Intelligence! We conducted a randomized controlled trial at ICLR 2025 with 20,000+ reviews to test whether LLM feedback improves peer review quality. Link: https://t.co/ioXpqRJEyN

3

115

24

36

34K

Jake Silberg @JakeSilberg

4 months ago

@arpitrage Doesn't upzoning square the circle? Allowing 4 units on a SFH lot means each individual unit is cheaper for new buyers/renters, but the plot as a whole has higher value for the seller?

0

3

0

147

Jake Silberg @JakeSilberg

4 months ago

@DdelAlamo I have a pet theory that a GAN-style discriminator auxiliary head for post-training a diffusion model could be helpful, given some of the differences between generated and natural proteins (see the distances on page 5 https://t.co/Pxx4jEvnDd) but haven't tested this yet

0

1

0

1

31

JakeSilberg retweeted

Caleb Lareau

@CalebLareau

4 months ago

To make a long story short, we uncover dozens of regions of our genome that control whether the virus persists or is cleared quickly. Further, we show that persistent EBV may serve as a biomarker of complex diseases-- from respiratory disease to autoimmunity.

2

32

4

5

3K

Jake Silberg @JakeSilberg

4 months ago

@CalebLareau @MSKCancerCenter Congrats on the awesome work! This is a fascinating read. I see you found associations with RA and SLE. Just curious, did you look for an association with Celiac as well?

0

1

0

176

Jake Silberg @JakeSilberg

4 months ago

@pengzhangzhi1 @ShuibaiZ69721 @jarridrb @AlexanderTong7 @mmbronstein @bose_joey This is a very cool paper! Great work and congrats!

1

2

0

151

Jake Silberg @JakeSilberg

4 months ago

@NielsRogge @ericzakariasson Do you notice a pattern of when this happens? My proposal is after every compacting it should re-read its https://t.co/dVBqPIbO6j where I tell it what env to use, or something like that. I find it will randomly forget env name later into long conversations

0

1

0

293

Jake Silberg @JakeSilberg

4 months ago

@bcherny Does Claude Code re-read it's https://t.co/KhvfvudzZH (or some equivalent) after compacting? I find it might forget some odd things during a long convo (e.g., what conda env it should be using)

0

24

Jake Silberg @JakeSilberg

5 months ago

@DdelAlamo just had the same experience, lmk if you get it

1

0

203

Jake Silberg @JakeSilberg

5 months ago

@smithhenryd @HannesStaerk @nate_diamant @brianltrippe Great talk!!

0

3

1

0

2K

JakeSilberg retweeted

Haotian Ye

@haotian_yeee

6 months ago

🤔Want a principled way to RL your diffusion model? Check Data-regularized Reinforcement Learning (DDRL)! Post-train @nvidia #Cosmos World Foundation models with a million GPU hours! 🤯 Novel formulation ➡️ Theoretically integrates SFT into RL ➡️ Robust to Reward Hacking 🛑 Details: https://t.co/1A9q8ho2xb #DDRL #Diffusion #RL #NVIDIA #Cosmos

4

269

75

183

77K

Jake Silberg @JakeSilberg

7 months ago

@ludocomito Really nice visualizations, especially the walkthrough of the N and O codes!

1

0

135

Jake Silberg @JakeSilberg

7 months ago

@sedielem Another fun tweak here is intentionally biasing the training distribution, e.g., SolubleMPNN only trained on soluble proteins, so that "natural" structures passed through the model intentionally come out more soluble than the original input

0

1

0

65

Jake Silberg

@JakeSilberg

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users