Daniel Whettam @DWhettam - Twitter Profile

DWhettam retweeted

4 months ago

New paper on a long-shot I've been obsessed with for a year: How much are AI reasoning gains confounded by expanding the training corpus 10000x? How much LLM performance is down to "local" generalisation (pattern-matching to hard-to-detect semantically equivalent training data)?

gleech's tweet photo. New paper on a long-shot I've been obsessed with for a year:

How much are AI reasoning gains confounded by expanding the training corpus 10000x? How much LLM performance is down to "local" generalisation (pattern-matching to hard-to-detect semantically equivalent training data)? https://t.co/utpmbXIK93

32

964

131

738

225K

DWhettam retweeted

Alan Jeffares @Jeffaresalan

11 months ago

Our new ICML 2025 oral paper proposes a new unified theory of both Double Descent and Grokking, revealing that both of these deep learning phenomena can be understood as being caused by prime numbers in the network parameters 🤯🤯 🧵[1/8]

Jeffaresalan's tweet photo. Our new ICML 2025 oral paper proposes a new unified theory of both Double Descent and Grokking, revealing that both of these deep learning phenomena can be understood as being caused by prime numbers in the network parameters 🤯🤯

🧵[1/8]

13

935

76

941

131K

DWhettam retweeted

Kevin Nejad

@kevin_nejad

11 months ago

After a year of review, our paper is now out in @NatureComms ! I really think our model/theory offers one of the most promising frameworks yet for learning in neocortical circuits - perhaps not in its final form, but the core principle of ssl feels right. Hoping experimentalists will test it, break it, and computationalists will refine it #AllModelsAreWrongButSomeAreUseful - I think this one could be very useful😀

1

22

4

2

1K

DWhettam retweeted

Andrej Karpathy

@karpathy

about 1 year ago

Good post from @balajis on the "verification gap". You could see it as there being two modes in creation. Borrowing GAN terminology: 1) generation and 2) discrimination. e.g. painting - you make a brush stroke (1) and then you look for a while to see if you improved the painting (2). these two stages are interspersed in pretty much all creative work. Second point. Discrimination can be computationally very hard. - images are by far the easiest. e.g. image generator teams can create giant grids of results to decide if one image is better than the other. thank you to the giant GPU in your brain built for processing images very fast. - text is much harder. it is skimmable, but you have to read, it is semantic, discrete and precise so you also have to reason (esp in e.g. code). - audio is maybe even harder still imo, because it force a time axis so it's not even skimmable. you're forced to spend serial compute and can't parallelize it at all. You could say that in coding LLMs have collapsed (1) to ~instant, but have done very little to address (2). A person still has to stare at the results and discriminate if they are good. This is my major criticism of LLM coding in that they casually spit out *way* too much code per query at arbitrary complexity, pretending there is no stage 2. Getting that much code is bad and scary. Instead, the LLM has to actively work with you to break down problems into little incremental steps, each more easily verifiable. It has to anticipate the computational work of (2) and reduce it as much as possible. It has to really care. This leads me to probably the biggest misunderstanding non-coders have about coding. They think that coding is about writing the code (1). It's not. It's about staring at the code (2). Loading it all into your working memory. Pacing back and forth. Thinking through all the edge cases. If you catch me at a random point while I'm "programming", I'm probably just staring at the screen and, if interrupted, really mad because it is so computationally strenuous. If we only get much faster 1, but we don't also reduce 2 (which is most of the time!), then clearly the overall speed of coding won't improve (see Amdahl's law).

134

4K

537

3K

845K

Who to follow

Davide Moltisanti

@davmoltisanti

Lecturer (assistant professor) at the University of Bath.

Antonino Furnari

@anfurnari

Associate Professor at @unict_it. Member of @ego4_d and #EPICKITCHENS. Working on Video Understanding and EgoVision.

Giovanni M Farinella

@GMFarinella

Daniel Whettam @DWhettam

about 1 year ago

@ghostbestie057 @CollinRugg This was after the official races. He just ran down for fun at the end. Cheese wasn't even on the table

0

58

Daniel Whettam @DWhettam

about 1 year ago

@jxmnop Similar idea to The Platonic Representation Hypothesis? https://t.co/MTt30ixv3z

0

1

325

Daniel Whettam @DWhettam

about 1 year ago

@jxmnop Why do you say it's not self-supervised? There are many SSL tasks where the self-supervision is the task of interest. Language modelling is one of those IMO. Self supervision is supervised training where the supervision comes from the data (hence "self")

0

1

0

121

Daniel Whettam @DWhettam

almost 2 years ago

@SBRLabs Congrats!

0

1

0

27

Daniel Whettam @DWhettam

almost 2 years ago

@CianEastwood @valence_ai @RecursionPharma Congrats!

0

1

0

79

DWhettam retweeted

gavin leech (Non-Reasoning)

@gleech

over 2 years ago

New paper: a big 90-page intro to AI and its likely effects from ten perspectives, ten camps. The whole gamut: ML, scientific applications, social applications, access, safety and alignment, economics, AI ethics, governance, and classical philosophy of life. 1/18

gleech's tweet photo. New paper: a big 90-page intro to AI and its likely effects from ten perspectives, ten camps.

The whole gamut: ML, scientific applications, social applications, access, safety and alignment, economics, AI ethics, governance, and classical philosophy of life.

1/18 https://t.co/5Flz7lol8l

6

375

72

568

61K

DWhettam retweeted

Kipp Freud @kipp_freud

over 2 years ago

(1) Good news! I've had a paper accepted (with @cian_neuro, @nathanlepora, and Matt W. Jones), and I'll be giving a talk on it at @AAMASconf this year 🥳🥳🥳🧠🤖🐀🥳🥳🥳

1

10

3

2

1K

Daniel Whettam @DWhettam

over 2 years ago

@bristol_filming @josmith1975x Yes, that must be it! I believe I saw Naomi Scott being filmed outside Clifton Arcade.

0

4

0

231

Daniel Whettam @DWhettam

almost 3 years ago

@iamJoeWhettam Big first tweet

0

3

0

49

Daniel Whettam @DWhettam

almost 3 years ago

@vertinski This is the lottery ticket hypothesis: https://t.co/0PGI5tbNCo

0

1

0

1

226

Daniel Whettam @DWhettam

almost 3 years ago

@wafajohal @TheOfficialACM @ieeeras @ICCVConference has a policy on not using chatgpt for conference reviews

1

2

0

689

Daniel Whettam @DWhettam

almost 3 years ago

@gleech Most martial arts

0

2

0

57

DWhettam retweeted

Jacob Chalk @JacobChalkie

over 3 years ago

Very excited the EPIC-SOUNDS dataset is finally released! We’ve all worked incredibly hard on this and I can be proud to say that this is my first publication! Looking forward to what this dataset can bring to the deep learning community!

0

9

1

0

740

DWhettam retweeted

Dima Damen @CVPR @dimadamen

over 3 years ago

📢 Now Open For Submissions - all EPIC-KITCHENS Leaderboards for @CVPR #CVPR2023 Challenges. Winners announced at Joint @ego4_d and EPIC workshop: https://t.co/u81J84aPDD **Nine** open challenges inc. 4 new ones (see 🧵) Leaderboards close 1st of June 2023. 🧵 1/7

dimadamen's tweet photo. 📢 Now Open For Submissions - all EPIC-KITCHENS Leaderboards for @CVPR #CVPR2023 Challenges. Winners announced at Joint @ego4_d and EPIC workshop: https://t.co/u81J84aPDD
**Nine** open challenges inc. 4 new ones (see 🧵)
Leaderboards close 1st of June 2023.
🧵 1/7 https://t.co/yDtInZ56PO

1

34

16

5

22K

DWhettam retweeted

Jacob Chalk @JacobChalkie

over 3 years ago

For anyone interested in audio-visual learning, this challenge will be of interest! EPIC-100 and EPIC-SOUNDS is a step towards multi-modal methods where one modality does not rely on the timestamps or label set of another! We look forward to seeing what this challenge brings!

0

11

1

1K

Daniel Whettam

@DWhettam

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users