Jacob C Tanner @JacobCTanner1 - Twitter Profile

Jacob C Tanner @JacobCTanner1

9 months ago

The murder of Charlie Kirk is a tragedy. I’m deeply saddened by this. We need to learn how to talk to each other again.

0

1

0

396

Jacob C Tanner @JacobCTanner1

about 1 year ago

https://t.co/Y5742vr96b

0

32

JacobCTanner1 retweeted

The Culturist

@the_culturist_

over 1 year ago

The Lord of the Rings does not take place on an imaginary planet — it's Earth. Middle-earth is our forgotten past, before recorded history, when Eden (Valinor) was a real place. The truth of Tolkien's world will blow your mind... 🧵

the_culturist_'s tweet photo. The Lord of the Rings does not take place on an imaginary planet — it's Earth.

Middle-earth is our forgotten past, before recorded history, when Eden (Valinor) was a real place.

The truth of Tolkien's world will blow your mind... 🧵 https://t.co/FyvEgVUvCK

401

24K

3K

15K

2M

JacobCTanner1 retweeted

Core Francisco Park

@corefpark

over 1 year ago

New paper! “In-Context Learning of Representations” What happens to an LLM’s internal representations in the large context limit? We find that LLMs form “in-context representations” to match the structure of the task given in context! 1/n

20

1K

179

1K

136K

Who to follow

Fabrizio Damicelli

@fabridamicelli

Python | Machine Learning | Data Science | https://t.co/cpCC43oPbu | https://t.co/917XhGuUiY

MunjungKim

@munjung_k

UVA datascience PhD student

Joaquín Goñi

@jgonicor

Associate Professor. Head of the CONNplexity Lab. Purdue University.

JacobCTanner1 retweeted

The Culturist

@the_culturist_

over 1 year ago

The fall of Rome is widely misunderstood. It wasn't invasion, disease or famine that truly brought it to its knees. Rome collapsed because the birth rate did… (thread) 🧵

the_culturist_'s tweet photo. The fall of Rome is widely misunderstood.

It wasn't invasion, disease or famine that truly brought it to its knees.

Rome collapsed because the birth rate did… (thread) 🧵 https://t.co/IxbMVp69hv

547

16K

3K

7K

3M

JacobCTanner1 retweeted

All The Right Movies

@ATRightMovies

over 1 year ago

LOTR: THE FELLOWSHIP OF THE RING was released 23 years ago this week. An adaptation of Tolkien’s classic novel, and the first entry in Peter Jackson’s The Lord of the Rings trilogy, the story of how it was made is proof that one does not simply walk into Mordor… 1/76

ATRightMovies's tweet photo. LOTR: THE FELLOWSHIP OF THE RING was released 23 years ago this week. An adaptation of Tolkien’s classic novel, and the first entry in Peter Jackson’s The Lord of the Rings trilogy, the story of how it was made is proof that one does not simply walk into Mordor…

1/76 https://t.co/wAe3Y861Q3

89

15K

2K

4K

2M

JacobCTanner1 retweeted

Mengsen Zhang @Mengsen

over 1 year ago

Review "metastability demystified" is finally out @SpringerNature @NatRevNeurosci (https://t.co/wiBuG3ut1I), led by @FranHancock1 & @_fernando_rosas w/ contributions from me & colleagues of many distinct perspectives. We identify the converging mechanisms & common misconceptions.

Mengsen's tweet photo. Review "metastability demystified" is finally out @SpringerNature @NatRevNeurosci (https://t.co/wiBuG3ut1I), led by @FranHancock1 & @_fernando_rosas w/ contributions from me & colleagues of many distinct perspectives. We identify the converging mechanisms & common misconceptions. https://t.co/Hcg34EzDRA

0

82

23

30

5K

JacobCTanner1 retweeted

Jonathan Gorard @getjonwithit

over 1 year ago

The apparent "philosophical problems" of quantum mechanics are not unique to QM at all: they are in fact the same problems that arise whenever one attempts to construct an abstract model of reality. We can see these problems already in high school-level mechanics. (1/14)

getjonwithit's tweet photo. The apparent "philosophical problems" of quantum mechanics are not unique to QM at all: they are in fact the same problems that arise whenever one attempts to construct an abstract model of reality. We can see these problems already in high school-level mechanics. (1/14) https://t.co/fdrBtHHmhQ

111

7K

730

5K

768K

JacobCTanner1 retweeted

Stéphane Deny @StphTphsn1

over 1 year ago

Very interesting! how does it relate to this work, studying phase transitions in the dynamics of diffusion? https://t.co/vhbin1lwm3

0

143

26

82

8K

JacobCTanner1 retweeted

The Culturist

@the_culturist_

over 1 year ago

Past societies produced so much beauty because they knew that math and beauty are deeply connected. It all started when Pythagoras discovered something mind-blowing about reality: The universe is not made of matter — but music... (thread) 🧵

the_culturist_'s tweet photo. Past societies produced so much beauty because they knew that math and beauty are deeply connected.

It all started when Pythagoras discovered something mind-blowing about reality:

The universe is not made of matter — but music... (thread) 🧵 https://t.co/F04pbQmGu9

514

28K

5K

14K

2M

JacobCTanner1 retweeted

Griffiths Computational Cognitive Science Lab @cocosci_lab

over 1 year ago

(1/5) Very excited to announce the publication of Bayesian Models of Cognition: Reverse Engineering the Mind. More than a decade in the making, it's a big (600+ pages) beautiful book covering both the basics and recent work: https://t.co/5dnLpcMQzu

cocosci_lab's tweet photo. (1/5) Very excited to announce the publication of Bayesian Models of Cognition: Reverse Engineering the Mind. More than a decade in the making, it's a big (600+ pages) beautiful book covering both the basics and recent work: https://t.co/5dnLpcMQzu https://t.co/QSo91mCzcJ

20

2K

445

2K

176K

JacobCTanner1 retweeted

Franklyn Wang

@frank_liquid

over 1 year ago

Doubling o1-preview performance on ARC-AGI with one simple trick 🚀 tldr: by providing human-like representations to o1, we are able to substantially increase performance on @arcprize.

frank_liquid's tweet photo. Doubling o1-preview performance on ARC-AGI with one simple trick 🚀

tldr: by providing human-like representations to o1, we are able to substantially increase performance on @arcprize.

23

977

84

712

217K

JacobCTanner1 retweeted

James Zou @james_y_zou

over 1 year ago

📢Thrilled to introduce the #VirtualLab: a team of AI scientist agents (AI chemist, AI reviewer...). Virtual Lab is led by an AI professor w/ feedback from human scientist. The Lab created new nanobodies that we experimentally validated to bind to recent #covid variants🚀🧵

james_y_zou's tweet photo. 📢Thrilled to introduce the #VirtualLab: a team of AI scientist agents (AI chemist, AI reviewer...). Virtual Lab is led by an AI professor w/ feedback from human scientist.

The Lab created new nanobodies that we experimentally validated to bind to recent #covid variants🚀🧵

21

815

178

513

191K

JacobCTanner1 retweeted

Oliver Habryka @ohabryka

over 1 year ago

I compiled all the emails released as part of the Musk v. Altman lawsuit in chronological order (link in reply). IMO a really valuable read. Extremely consequential decisions made in these emails.

ohabryka's tweet photo. I compiled all the emails released as part of the Musk v. Altman lawsuit in chronological order (link in reply).

IMO a really valuable read. Extremely consequential decisions made in these emails. https://t.co/DhKcRQM6JN

56

3K

229

3K

869K

JacobCTanner1 retweeted

Paul Thompson

@PTenigma

over 1 year ago

If you are interested in Generative #AI, or statistical physics, you will know that you can use latent diffusion models to make synthetic images (or videos), but these methods are a bit slow (I explain here how the Fokker-Planck formulation and Langevin diffusion are related): https://t.co/x4lsJZDdhq About an hour ago some Russian scientists from Skoltech posted a quite considerable leap forward in this field, computing these maps much faster + in one step: https://t.co/92AluHBU03

PTenigma's tweet photo. If you are interested in Generative #AI, or statistical physics, you will know that you can use latent diffusion models to make synthetic images (or videos), but these methods are a bit slow (I explain here how the Fokker-Planck formulation and Langevin diffusion are related):
https://t.co/x4lsJZDdhq
About an hour ago some Russian scientists from Skoltech posted a quite considerable leap forward in this field, computing these maps much faster + in one step:
https://t.co/92AluHBU03

4

908

139

781

108K

JacobCTanner1 retweeted

nature

@Nature

over 1 year ago

“We’re very excited to see what people do with this” AlphaFold3 is open at last https://t.co/cdvACli8r5

20

2K

588

344

128K

JacobCTanner1 retweeted

Maksym Andriushchenko

@maksym_andr

over 1 year ago

🚨 So, why do we need weight decay in modern deep learning? 🚨 The camera-ready version of our NeurIPS 2024 paper is now on arXiv (a major update compared to the first version). Weight decay is traditionally viewed as a regularization method, but its effect in the overtraining regime is quite subtle and its interaction with the implicit regularization effect of SGD plays a crucial role. In the undertraining regime (e.g., in LLM pretraining), however, the effect of weight decay is totally different: it sets an implicit learning rate schedule for AdamW and enables stable training with bfloat16 precision. This explains why weight decay is still widely used for LLM training with standard optimizers, such as AdamW. This is joint work with @dngfra, @adityavardhanv, @tml_lab.

maksym_andr's tweet photo. 🚨 So, why do we need weight decay in modern deep learning? 🚨

The camera-ready version of our NeurIPS 2024 paper is now on arXiv (a major update compared to the first version).

Weight decay is traditionally viewed as a regularization method, but its effect in the overtraining regime is quite subtle and its interaction with the implicit regularization effect of SGD plays a crucial role.

In the undertraining regime (e.g., in LLM pretraining), however, the effect of weight decay is totally different: it sets an implicit learning rate schedule for AdamW and enables stable training with bfloat16 precision. This explains why weight decay is still widely used for LLM training with standard optimizers, such as AdamW.

This is joint work with @dngfra, @adityavardhanv, @tml_lab.

11

692

106

789

74K

JacobCTanner1 retweeted

Denise J. Cai, Ph.D. @denisejcai

over 1 year ago

🌟 Cai Lab @Nature paper alert! In new work led by @mysteriousjoe_, we find that rest periods after learning not only stabilize new memories BUT ALSO integrate new memories with older ones from days past! (1/9) Read it here: https://t.co/Ur8dbGfuP3

denisejcai's tweet photo. 🌟 Cai Lab @Nature paper alert! In new work led by @mysteriousjoe_, we find that rest periods after learning not only stabilize new memories BUT ALSO integrate new memories with older ones from days past! (1/9)

Read it here: https://t.co/Ur8dbGfuP3 https://t.co/XwdrDmLWQY

38

584

134

150

81K

JacobCTanner1 retweeted

Machine Learning Street Talk

@MLStreetTalk

over 1 year ago

I finally got to meet @fchollet in person recently to interview him about @arcprize, intelligence vs memorization, human cognitive development, learning abstractions, limits of pattern recognition and consciousness development. These are the best bits. Full show released tomorrow

9

507

55

344

79K

JacobCTanner1 retweeted

Alan Jeffares @Jeffaresalan

over 1 year ago

There are many things we don’t understand about deep learning. Our new NeurIPS paper (w/ @AliciaCurth) makes the mistake of trying to tackle too many of them 😅 A simplified model of deep learning describes double descent, grokking, gradient boosting & linear mode connectivity🧵

Jeffaresalan's tweet photo. There are many things we don’t understand about deep learning. Our new NeurIPS paper (w/ @AliciaCurth) makes the mistake of trying to tackle too many of them 😅

A simplified model of deep learning describes double descent, grokking, gradient boosting & linear mode connectivity🧵 https://t.co/5N9nAXAMBu

15

746

129

908

110K

Jacob C Tanner

@JacobCTanner1

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users