Daniel Greenfeld @d_greenfeld - Twitter Profile

about 1 month ago

.@NitCal will be presenting "Empty Shelves or Lost Keys? Recall Is the Bottleneck for Parametric Factuality" at ICML 2026 next month. (tl;dr: we show encoding is near-saturated on frontier LLMs, but models still struggle to recall encoded facts.) One recurring piece of feedback we've gotten since posting the paper: "you show LLMs struggle with factual recall, but does that even matter when today's agents can use external retrieval?" Here's how I currently think about this, and more broadly about the role of parametric knowledge in today's systems: The theoretical argument for why knowledge matters (true in principle, but I don't know of work that measures this in practice): parametric knowledge is important for making efficient use of search and for knowing how to properly integrate retrieved information. Imagine finding some weird pizza recipe online — can you trust it without knowing a lot about cooking, chemistry, etc.? I think this is going to become a bigger issue moving forward, the more "sloppier" the internet becomes. The realistic case for why knowledge matters: today's agents are far from producing responses that are fully grounded in external evidence. Even when search triggers properly — which it often doesn't — only the "big" claims tend to be grounded, while models still volunteer a lot of extra information from their parametric knowledge. Since models are still poor at "knowing what they know" (more on that in my next post, about our other ICML paper...), our best bet is making models actually more knowledgeable — and our paper reveals where the headroom for that actually lies.

_galyo's tweet photo. .@NitCal will be presenting "Empty Shelves or Lost Keys? Recall Is the Bottleneck for Parametric Factuality" at ICML 2026 next month.

(tl;dr: we show encoding is near-saturated on frontier LLMs, but models still struggle to recall encoded facts.)

One recurring piece of feedback we've gotten since posting the paper: "you show LLMs struggle with factual recall, but does that even matter when today's agents can use external retrieval?"

Here's how I currently think about this, and more broadly about the role of parametric knowledge in today's systems:

The theoretical argument for why knowledge matters (true in principle, but I don't know of work that measures this in practice): parametric knowledge is important for making efficient use of search and for knowing how to properly integrate retrieved information. Imagine finding some weird pizza recipe online — can you trust it without knowing a lot about cooking, chemistry, etc.? I think this is going to become a bigger issue moving forward, the more "sloppier" the internet becomes.

The realistic case for why knowledge matters: today's agents are far from producing responses that are fully grounded in external evidence. Even when search triggers properly — which it often doesn't — only the "big" claims tend to be grounded, while models still volunteer a lot of extra information from their parametric knowledge.

Since models are still poor at "knowing what they know" (more on that in my next post, about our other ICML paper...), our best bet is making models actually more knowledgeable — and our paper reveals where the headroom for that actually lies.

2

31

5

13

4K

d_greenfeld retweeted

Uri Shalit @ShalitUri

over 4 years ago

Today at 11:30 EST / 16:30 GMT we'll be presenting our poster about our work “On Calibration and Out-of-domain Generalization” at #NeurIPS2021, come visit! https://t.co/vYIdjeRMHh @wald_yoav @amir_feder @d_greenfeld

0

33

8

2

0

d_greenfeld retweeted

Fermat's Library

@fermatslibrary

about 7 years ago

Paper "What is it like to be a bat?" Thomas Nagel's thought experiment about consciousness is still as relevant today as it was when it was first published in 1974. https://t.co/abesYXI7n1

fermatslibrary's tweet photo. Paper

"What is it like to be a bat?" Thomas Nagel's thought experiment about consciousness is still as relevant today as it was when it was first published in 1974. https://t.co/abesYXI7n1 https://t.co/l0dAJNlk8w

7

324

66

44

0

d_greenfeld retweeted

Aleksander Madry @aleks_madry

over 7 years ago

Took a while (don't ask) but here they are: Notes from "Science of Deep Learning" class co-taught with @KonstDaskalakis now available: https://t.co/0H1SKClPUf. More coming soon (promise!). Feedback very welcome! Thanks to @andrew_ilyas for heroic effort on doing final revisions.

10

450

112

84

0

Who to follow

Gal Yona

@_galyo

Research scientist @googleai, previously CS PhD @weizmannscience

Yoav Wald

@wald_yoav

Faculty fellow @NYUDataScience, incoming assistant prof. @TechnionLive. Causal ML and healthcare applications.

Alice Bizeul

@AliceBizeul

Research Scientist @ Apple MLR | Previously @ETH Zurich @EPFL @MIT, Research Intern @Amazon

d_greenfeld retweeted

Andrei Bursuc @CVPR @abursuc

over 7 years ago

A visual exploration of Gaussian Processes: beautiful interactive plots and a brief tutorial to make GPs more approachable https://t.co/5m4YqEyDAE

2

412

113

82

0

d_greenfeld retweeted

Google DeepMind @GoogleDeepMind

over 7 years ago

Today we are excited to release video recordings of lectures from "Advanced Deep Learning and Reinforcement Learning", a course on deep RL taught at @UCL earlier this year by DeepMind researchers: https://t.co/znsWtTxQcN Enjoy!

45

4K

2K

568

0

d_greenfeld retweeted

bijan @bijanstephen

over 7 years ago

i've never seen a clearer explanation of plato's cave

223

47K

13K

836

0

d_greenfeld retweeted

David Barrett @dgtbarrett

almost 8 years ago

Our latest work on ‘Measuring abstract reasoning in neural networks’ has just been published at #icml2018. As always, it was a privilege to collaborate with @santoroAI, Felix Hill, @arimorcos and Tim Lillicrap. Paper: https://t.co/lUZKpxvfd2 Blog post: https://t.co/LQaf5MpV04

1

32

12

2

0

d_greenfeld retweeted

Yascha Mounk

@Yascha_Mounk

almost 8 years ago

In 1951, Bertrand Russel took to the @nytimes to argue that the best answer to fanaticism was a calm search for truth. His Ten Commandments of Liberal Inquiry could not be more relevant today. (Number 6 will blow your mind! ;) ) Thread.

53

4K

2K

302

0

d_greenfeld retweeted

Durk Kingma @dpkingma

almost 8 years ago

Check out https://t.co/cBigTutSGn, my work with @prafdhar on improving flow-based generative models with invertible 1x1 convolutions. https://t.co/znKj0LnCxm

6

627

203

26

0

d_greenfeld retweeted

Roger Grosse

@RogerGrosse

almost 8 years ago

New paper analyzing sample-based metrics for evaluating generative models, from the Cornell group. Tests if they can detect things like overfitting and mode collapse. Should be required reading for everyone working on generative models. https://t.co/oPzFYkAZ8o

2

143

26

17

0

Daniel Greenfeld @d_greenfeld

about 8 years ago

@gstsdn @goodfellow_ian Great paper. Out of curiosity - why use spectral normalization on the generator and not Jacobian clamping like in the previous paper?

0

1

0

d_greenfeld retweeted

Google DeepMind @GoogleDeepMind

about 8 years ago

By learning to write programs that generate images our artificial agents can reason about how digits, characters and portraits are constructed. Read the blog: https://t.co/zNDRMAEdOW

16

897

374

35

0

d_greenfeld retweeted

Ian Goodfellow

@goodfellow_ian

about 8 years ago

2nd thread on evaluating GAN papers (1st thread hit max thread length)

2

155

54

21

0

d_greenfeld retweeted

Zachary Lipton

@zacharylipton

about 8 years ago

Simple GAN inversion experiments easily show that ***all*** real images (except for zero measure subset) have 0 probability of being generated by a GAN (off the manifold). What does this say about the promise (or lack thereof) of training models based on GAN-generated datasets.

11

65

20

3

0

d_greenfeld retweeted

Ian Goodfellow

@goodfellow_ian

about 8 years ago

Check out Adversarial Logit Pairing, the new state of the art defense against adversarial examples on ImageNet, by @harinidkannan @alexey2004 and I: https://t.co/2JIT1t3ApO

goodfellow_ian's tweet photo. Check out Adversarial Logit Pairing, the new state of the art defense against adversarial examples on ImageNet, by @harinidkannan @alexey2004 and I: https://t.co/2JIT1t3ApO https://t.co/zbu99DcCiv

3

508

176

24

0

d_greenfeld retweeted

Tom Rainforth @tom_rainforth

about 8 years ago

Nesting probabilistic programs allows us to model agents reasoning about other agents, but current inference engines typically give invalid estimates. Check out how to do things correctly in my new paper https://t.co/4EQqwxUdDt

0

60

24

5

0

d_greenfeld retweeted

Pablo Stanley

@pablostanley

over 8 years ago

GESTALT PRINCIPLES THREAD! Gestalt is the idea that we see the whole of something before the individual parts. PROXIMITY (1/8) When objects are close to each other, they tend to be perceived together in a group. Use white space to separate groups. Reduce it to group elements.

36

3K

1K

371

0

d_greenfeld retweeted

Sanjeev Arora

@prfsanjeevarora

over 8 years ago

Encoder-decoder GANs architectures still don't fix the theoretical problems in GANs framework such as mode collapse. Encoders may produce nonsense codes and the discriminator is none the wiser. Blog post https://t.co/oQBxIaVEri and ICLR'18 paper https://t.co/64nmaNsef3

3

61

14

6

0

d_greenfeld retweeted