Fabian Fuchs

@FabianFuchsML

Senior Research Scientist @ DeepMind. I work on LLM post-training, focusing on improving training data. Prior to that, I worked on AlphaFold. Typos are my own.

Oxford, England

Joined June 2018

314 Following

2.7K Followers

152 Posts

Pinned Tweet

Fabian Fuchs @FabianFuchsML

over 4 years ago

A year ago I asked: Is there more than Self-Attention and Deep Sets? - and got very insightful answers. 🙏 Now, Ed, Martin and I wrote up our own take on the various neural networks architectures for sets. Have a look and tell us what you think! :) ➡️https://t.co/Z1aprTcLQV ☕️

FabianFuchsML's tweet photo. A year ago I asked: Is there more than Self-Attention and Deep Sets? - and got very insightful answers. 🙏 Now, Ed, Martin and I wrote up our own take on the various neural networks architectures for sets.

Have a look and tell us what you think! :)

➡️https://t.co/Z1aprTcLQV ☕️ https://t.co/oEKxCHlmx5

Fabian Fuchs @FabianFuchsML

almost 6 years ago

Both Max-Pooling (e.g. DeepSets) and Self-Attention are permutation invariant/equivariant neural network architectures for set-based problems. I am aware of a couple of variations for both of these. Are there additional, fundamentally different architectures for sets? 🤔

15

111

14

40

0

2

318

74

116

0

Fabian Fuchs @FabianFuchsML

12 days ago

I wrote a blog post trying to understand TrackStar, a gradient-based method tracing LLM predictions to influential training examples. Mostly: take the main equation, read it from right to left, and poke at the pieces with a small MNIST toy example. 🙂 ☕️ https://t.co/JCWblqh956

FabianFuchsML's tweet photo. I wrote a blog post trying to understand TrackStar, a gradient-based method tracing LLM predictions to influential training examples.

Mostly: take the main equation, read it from right to left, and poke at the pieces with a small MNIST toy example. 🙂

☕️ https://t.co/JCWblqh956 https://t.co/GaiMV27nLn

0

82

10

62

8K

FabianFuchsML retweeted

Adam Golinski @adam_golinski

over 2 years ago

Our Apple ML Research team in Barcelona is looking for a PhD intern! 🎓 Curiosity-driven research 🧠 with the goal to publish 📝 Topics: Confidence/uncertainty quantification and reliability of LLMs 🤖 Apple here: https://t.co/ZiN3ecWGo7

4

271

48

147

47K

Fabian Fuchs @FabianFuchsML

about 3 years ago

Graphs , Sets, Universality We put more work into this and are presenting it via the ICLR blogpost track (thanks to organisers and reviewers!). Have a read and let us know what you think: https://t.co/MtpPYYUfTm better in light mode💡, dark mode🌙 messes with the latex a bit

Petar Veličković

over 3 years ago

📢 New blog post! Realising an intricate connection between PNA (@GabriCorso @lukecavabarrett @dom_beaini @pl219_Cambridge) & the seminal work on set representations (Wagstaff @FabianFuchsML @martinengelcke @IngmarPosner @maosbot), Fabian and I join forces to attempt to explain!

1

56

11

14

0

0

43

8

9

9K

Who to follow

Verified account

Slop janitor & post-trainologer at Meta / FAIR. Into codegen, RL, equivariance. Spent time at Qualcomm, Scyfer (acquired), UvA, Deepmind, OpenAI.

Verified account

Sr. Staff RS at @GoogleDeepMind. Gemini Omni Team. Priors: GNNs, Structured World Models, Neural Assets, Veo Ingredients/References, Veo Robotics

FabianFuchsML retweeted

Adam R. Kosiorek @arkosiorek

over 3 years ago

Text-to-image diffusion models seem to have a good idea of geometry. Can we extract that geometry? Or maybe we can nudge these models to create large 3D consistent environments? Here's a blog summarizing some ideas in this space :) https://t.co/INlMuslCdU

arkosiorek's tweet photo. Text-to-image diffusion models seem to have a good idea of geometry. Can we extract that geometry? Or maybe we can nudge these models to create large 3D consistent environments? Here's a blog summarizing some ideas in this space :)
https://t.co/INlMuslCdU https://t.co/gR6VchJm0C

0

137

25

40

19K

Fabian Fuchs @FabianFuchsML

over 3 years ago

@GabriCorso That is fantastic to hear!

0

2

0

0

0

Fabian Fuchs @FabianFuchsML

over 3 years ago

@PetarV_93 @GabriCorso @lukecavabarrett @dom_beaini @pl219_Cambridge @martinengelcke @IngmarPosner @maosbot Ed's link is broken, here it is: @EdWagstaff :)

0

2

0

0

0

FabianFuchsML retweeted

Petar Veličković

over 3 years ago

📢 New blog post! Realising an intricate connection between PNA (@GabriCorso @lukecavabarrett @dom_beaini @pl219_Cambridge) & the seminal work on set representations (Wagstaff @FabianFuchsML @martinengelcke @IngmarPosner @maosbot), Fabian and I join forces to attempt to explain!

1

56

11

14

0

Fabian Fuchs @FabianFuchsML

over 3 years ago

I have recently had a range of very insightful conversations with @PetarV_93 about graph neural networks, networks on sets, universality and how ideas have spread in the two communities. This is our write up, feedback welcome as always! :) ➡️https://t.co/vbmCGHBHxd ☕️

FabianFuchsML's tweet photo. I have recently had a range of very insightful conversations with @PetarV_93 about graph neural networks, networks on sets, universality and how ideas have spread in the two communities. This is our write up, feedback welcome as always! :)

➡️https://t.co/vbmCGHBHxd ☕️ https://t.co/z4veWqtc3l

2

202

44

72

0

FabianFuchsML retweeted

Sergey Ovchinnikov @sokrypton

almost 4 years ago

Anyone know of a department looking to hire faculty in the protein/genome+evolution+ML space? Also RNA biology (asking for a friend) 🙂🥼🧪

20

116

24

15

0

FabianFuchsML retweeted

Adam R. Kosiorek @arkosiorek

almost 4 years ago

New blog post! Find out: - what reconstructing masked images and our brains have in common, - why reconstructing masked images is a good idea for learning representations, - what makes a good mask and how to learn one https://t.co/bFrmQvITEz

arkosiorek's tweet photo. New blog post! Find out:
- what reconstructing masked images and our brains have in common,
- why reconstructing masked images is a good idea for learning representations,
- what makes a good mask and how to learn one

https://t.co/bFrmQvITEz https://t.co/zONE66CIHb

2

96

15

30

0

Fabian Fuchs @FabianFuchsML

about 4 years ago

@Padarn Gradient descent :)

1

1

0

0

0

Fabian Fuchs @FabianFuchsML

about 4 years ago

Graph neural networks often have to globally aggregate over all nodes. How we do this can have a significant impact on performance 🎯. After we recently finished a project on this, I wrote a blog post on this topic. Let me know what you think! :) ➡️https://t.co/OGJJAF9w9C ☕️

FabianFuchsML's tweet photo. Graph neural networks often have to globally aggregate over all nodes. How we do this can have a significant impact on performance 🎯. After we recently finished a project on this, I wrote a blog post on this topic. Let me know what you think! :)

➡️https://t.co/OGJJAF9w9C ☕️ https://t.co/EFTMh5dTcy

6

473

80

150

0

Fabian Fuchs @FabianFuchsML

about 4 years ago

@jonkhler That's very true, nice observation! Let me know how it goes in case you do, I am curious :)

0

2

0

0

0

FabianFuchsML retweeted

Emiel Hoogeboom @emiel_hoogeboom

about 4 years ago

Molecule Generation in 3D with Equivariant Diffusion (https://t.co/4ZgiHdswER). Very happy to share this project (the last of my PhD woohoo 🥳) and a super nice collab with @vgsatorras @ClementVignac (equal contrib shared among three of us) and of course @wellingmax

6

400

64

85

0

Fabian Fuchs @FabianFuchsML

over 4 years ago

@newplatonism Depends, are you happy with treating cars as point masses in vacuum? :P More seriously: people do work on making the L/H-based NNs more general (like allowing for friction & external forces), but, to my understanding, it's still mostly constrained to physical particle systems

0

0

0

0

0

Fabian Fuchs @FabianFuchsML

over 4 years ago

Emmy Noether connected symmetries and conserved quantities in physics - how is this related to exploiting symmetries with neural networks? 🤔 I've tried to answer this question in a blog post (no background knowledge required!): ➡️https://t.co/VfxMu3fkAK ☕️

FabianFuchsML's tweet photo. Emmy Noether connected symmetries and conserved quantities in physics - how is this related to exploiting symmetries with neural networks? 🤔

I've tried to answer this question in a blog post (no background knowledge required!):

➡️https://t.co/VfxMu3fkAK ☕️ https://t.co/omXXNZraUr

5

222

40

62

0

Fabian Fuchs @FabianFuchsML

over 4 years ago

@lzamparo The book was already linked but I now added a few more links for further reading, including the ICLR keynote. Thanks for the suggestion!

0

1

0

0

0

Fabian Fuchs @FabianFuchsML

over 4 years ago

@_onionesque @Blendenfleck Thank you!

0

1

0

0

0

Fabian Fuchs @FabianFuchsML

over 4 years ago

I should have said 'no physics background knowledge required' - the blog post does assume general machine learning background knowledge :)

0

7

0

0

0

Fabian Fuchs @FabianFuchsML

over 4 years ago

@william_woof @andrewwhite01 +1; also, this equivalence does not need to be obvious at all. In some cases, like with max(), it might even seem counterintuitive that the function >can< be written as a sum-decomposition (ie in a Deep-Sets form)

0

1

0

0

0

Last Seen Users on Sotwe

Trends for you

Most Popular Users