Frans Zdyb @FZdyb - Twitter Profile

Pinned Tweet

Frans Zdyb @FZdyb

almost 4 years ago

Why AI needs to ease up on scaling and learn how to code: https://t.co/Ei2Dv5QyfM @GaryMarcus @fchollet @yudapearl

0

32

9

11

0

FZdyb retweeted

Θωμᾶς del Vasto

@Thomasdelvasto_

7 days ago

it’s insanely creepy how our entire modern society follows Sir Francis Bacon’s explicitly plan almost exactly

21

728

21

426

64K

Frans Zdyb @FZdyb

7 days ago

@hard_boiledbabe Are we sure we want to be taking his advice tho? Dude fumbled a 10 that was super into him because he got too into his head

0

4

0

1

372

Frans Zdyb @FZdyb

13 days ago

@ToKTeacher I think the claim is less "there's no thinker" and more "the model of yourself as an enduring and stable uncaused causer of thoughts is inaccurate"

0

5

0

62

Who to follow

Damian Konrad

@nightflight_dk

e/acc @ MSFT, PhD, eMBA

Maxim Khomiakov

@maximkhv

managing context windows

Andrea Dittadi

@andrea_dittadi

ML research @IsomorphicLabs · previously Helmholtz Munich, TUM, Microsoft, Amazon, @MPI_IS, @DTUTweet · into diffusion/flows & (causal) representation learning

Frans Zdyb @FZdyb

14 days ago

@SchopenhauerOn @srikosuri I got 99 conjectures...

0

43

Frans Zdyb @FZdyb

17 days ago

@gavinrbrown1 Right, I guess I meant without training, but this also serves to make the point. My intuition is that you could put bounds on capacity (or other properties) if you have a simple IT based model of an NN, but the empirical success of NNs requires a physics-style explanation.

0

35

Frans Zdyb @FZdyb

17 days ago

@gavinrbrown1 I don't know, was genuinely asking whether there's a way to describe the input distribution, information bottleneck in the architecture, and get the memorization capacity using some IT concept..!

1

0

57

Frans Zdyb @FZdyb

17 days ago

@gavinrbrown1 You mean how much info it can memorize, or..? Can't you almost calculate this if you know the architecture?

1

0

55

Frans Zdyb @FZdyb

17 days ago

@gavinrbrown1 I think they are less meaningful because the IT part is incidental, like calculus is for EM. People expect IT to somehow say stuff on its own, but it's just math. We need the "physics" of AI :)

1

0

61

Frans Zdyb @FZdyb

17 days ago

@gavinrbrown1 Eg maybe it turns out hierarchical architectures induce manifold structure s.t. large program subspaces are ordered wrt time-bound Kolmogorov complexity, when doing gradient descent

0

22

Frans Zdyb @FZdyb

17 days ago

@gavinrbrown1 They do - set up boundary conditions, run a solver and out pop very good predictions, all from an extremely simple description of two coupled vector fields. IT seems more comparable to calculus - IT would provide concepts, but the explanation itself is some hidden entity.

2

1

0

149

Frans Zdyb @FZdyb

19 days ago

@EmilevanKrieken @yoavgo @andrewgwils Similar to randomized time-bounded Kolmogorov complexity, the length of the shortest program that can output a given string with high probability using a randomized, time-bounded algorithm (so information theory people do already have a concept for this).

0

2

0

49

Frans Zdyb @FZdyb

22 days ago

@gleech @entirelyuseles Vision is an incredibly difficult inverse problem, and that's not even getting to the hard part of learning intuitive physics and concepts and categories.. language kinda gets these as input, the rest is grammar and style. Very handwavy argument, but I think the point stands.

0

38

Frans Zdyb @FZdyb

22 days ago

@gleech @entirelyuseles No doubt the higher info density of language requires LLMs to be more 'nonlinear'; vision models get a lot of bits right just by learning spatiotemporal smoothness, which is linear-ish. But if you control for info density and current capability, I'd guess vision is harder..

1

0

57

Frans Zdyb @FZdyb

22 days ago

@entirelyuseles @gleech models would degenerate after 30 seconds of trying to keep track of people and objects in a courtroom.

0

29

Frans Zdyb @FZdyb

22 days ago

@entirelyuseles @gleech Compare the kind of conversational ability you need to be a farmer, trader or lawyer, vs the kind of scene understanding you'd need for the same jobs. Even though a lawyer needs excellent language skills and little vision skill, LLMs could easily handle the language part, vision

1

0

42

Frans Zdyb @FZdyb

26 days ago

@francoisfleuret Gonna plug my ML x Epistemology thing here: https://t.co/LZW3t37WAQ

0

67

Frans Zdyb @FZdyb

27 days ago

@JesusFerna7026 @AVMiceliBarone @MahdiKahou But NNs are also different from standard piecewise affine functions; they can also adjust many slopes in many regions, by changing early layers.

0

1

0

85

Frans Zdyb @FZdyb

28 days ago

@gleech @tenobrus Cool! I'm reading your Massively Parallel RWS paper. I wonder if it could scale to models like neural PCFGs on videos.

0

3

0

59

Frans Zdyb @FZdyb

28 days ago

@gleech @tenobrus IMO discrete latent variables are symbols. Hidden Markov models learn a "symbolic" state representation. It gets interesting when you replace the Markov chain with a tree, and get a PCFG; then continue up the Chomsky hierarchy. PPL people have been doing this stuff for a while.

1

0

104

Frans Zdyb

@FZdyb

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users