Sergio Charles

Verified account

@eigentopology

co-founder @thesis_labs (yc f25) | automating ai research | prev. @google, @nvidia, @stanford

San Francisco, CA

Joined November 2018

500 Following

489 Followers

787 Posts

Pinned Tweet

about 2 months ago

@thesis_labs gives you an AI researcher that runs experiments for you: train models, track runs, and iterate on results autonomously at scale. Get $100 in free GPU credits by signing up at https://t.co/sC0x5KaskV and commenting how you'd use Thesis. We're onboarding 20 research teams this month. Reply if you want in. #AIResearch #MachineLearning #AI #AutoResearch

19

79

10

47

30K

7 days ago

Human intelligence emerged from scarcity, not abundance. ASI will follow the same path.

0

0

0

0

19

8 days ago

@AdaFang_ This is great! A delegation of auto scientists seems like the way to invent 'novel' ideas.

0

0

0

0

345

9 days ago

Just the start

9 days ago

Today we're announcing ESMFold2, an open scientific engine to power prediction, design, and discovery across protein biology. The new model delivers state of the art performance on protein interactions, especially antibodies, a critical modality for therapeutics. We have designed and validated miniprotein binders and single chain antibodies across five therapeutic targets that are important in cancer and immunology. We are seeing very high success rates, and affinities at levels consistent with therapeutic activity. We’re also releasing an atlas of 6.8 billion proteins, and 1.1 billion predicted structures. ESMFold2 is built on a state of the art language model that has been trained on billions of protein sequences. A world model of protein biology emerges through language modeling. We’ve used the techniques of mechanistic interpretability developed to understand large language models to understand the concepts ESM uses to represent proteins. The model’s representation space has a compositional organization of features across scales, levels of complexity, and abstraction, that reflects and mirrors the understanding of protein biology developed through a century of empirical science. This understanding emerges without prior knowledge, just from language modeling of protein sequences. Language models are becoming a powerful substrate to understand and program biology. The design of protein interactions is one of the most fundamental problems in biophysics, and has critical implications for the discovery of new medicines. A simple gradient based search with the model was able to discover high-affinity protein binders. I'm excited by the potential this has to accelerate basic science and the understanding of proteins. And especially for the new avenues it opens up for therapeutic design and medicine.

74

2K

446

706

592K

0

0

0

0

69

Who to follow

friends and family

Verified account

@friendsandfam_

A home for makers and founders @ Stanford

Verified account

@benchmark // @modal @warpdotdev @stanford

eigentopology retweeted

@Tiberiu_Musat_

9 days ago

Why does deep learning generalize? What does weight decay really do? Can algorithmic information theory address these questions? In my latest preprint, I give a proof that the minimum neural weight norm matches the minimum program length (aka Kolmogorov Complexity), up to a logarithmic factor. In other words, the neural network with the smallest possible weight norm (that fits the data) must encode the shortest program (that fits the data). The result only holds for fixed-precision neural nets: infinite precision nets can store infinite information with finite (small) weights. https://t.co/eMZIGQDf2f

Tiberiu_Musat_'s tweet photo. Why does deep learning generalize? What does weight decay really do? Can algorithmic information theory address these questions?

In my latest preprint, I give a proof that the minimum neural weight norm matches the minimum program length (aka Kolmogorov Complexity), up to a logarithmic factor. In other words, the neural network with the smallest possible weight norm (that fits the data) must encode the shortest program (that fits the data).

The result only holds for fixed-precision neural nets: infinite precision nets can store infinite information with finite (small) weights.

https://t.co/eMZIGQDf2f

29

1K

152

1K

143K

9 days ago

@wenhaocha1 @karpathy Amazing!

0

0

0

0

19

10 days ago

@Andy_ShuoYang @CShorten30 Really cool! What was your process for speeding up the operators?

1

3

0

1

1K

eigentopology retweeted

14 days ago

Liftoff of Starship!

1K

32K

6K

2K

2M

eigentopology retweeted

16 days ago

If this is not obvious enough... > Mar 25, Sakana’s AI Scientist makes Nature > May 13, RSI raises $650M to build self-improving AI > May 13, Adaption ships AutoScientist for model training > May 19, Karpathy joins Anthropic pretraining > May 19, Nature drops 3 AI-scientist papers in one day The next frontier is not bigger models. It’s AI researchers building AI researchers.

10

214

16

108

17K

16 days ago

@itsEmZee_ Absolutely! This is what we’re building towards at @thesis_labs

0

1

0

0

918

16 days ago

@jparkjmc @sama +1 @sama we need tokens to automate ai research

0

0

0

0

203

22 days ago

@HenryL_AI @Recursive_SI Thesis automated AI research and you can use it today! https://t.co/fnAF3UR6aH

1

1

0

0

187

22 days ago

@hallerite @thesis_labs

0

0

0

0

20

22 days ago

@DimitrisPapail @braindersnn This is the future of research. Where do you write your methods and analysis section?

0

0

0

0

788

23 days ago

@eliebakouch When @thesis_labs adds slack

0

0

0

0

40

26 days ago

@gpusteve I prefer manually moving around electrons myself

1

3

2

2

35

26 days ago

@harshbhatt7585 What are your takeaways? They look quite similar upon first inspection

0

0

0

0

290

27 days ago

@ScienceMagazine incredible.

0

0

0

0

37

28 days ago

I'd think bio could get away with less data than language because of stronger priors: SE(3), conservation, geometry, physics. But AF2 -> AF3 dropping IPA kind of went against the idea of inductive biases leading to better sample efficiency. Imo the real difficulty is that biology is multi-scale: atoms -> proteins -> cells -> organisms.

1

3

0

0

400

28 days ago

@rishabh16_ SE(3) is the best Lie group

0

0

0

0

34

Last Seen Users on Sotwe

Trends for you

Most Popular Users