Andrew Fitzgibbon

17 days ago

Hey, I wrote a blog - I think it's fun, hope you do too!

17 days ago

A tiny implementation detail in low-precision arithmetic could be biasing your AI training 😲 This interactive deep dive from Graphcore Research's @Awfidius uncovers a subtle failure mode in stochastic rounding that only appears when randomness is limited, and how it can be addressed with one simple fix 🎲 Check it out in the link below! 👇

1

8

4

6

3K

0

13

2

4

3K

Awfidius retweeted

17 days ago

A tiny implementation detail in low-precision arithmetic could be biasing your AI training 😲 This interactive deep dive from Graphcore Research's @Awfidius uncovers a subtle failure mode in stochastic rounding that only appears when randomness is limited, and how it can be addressed with one simple fix 🎲 Check it out in the link below! 👇

1

8

4

6

3K

Awfidius retweeted

7 months ago

🚨 Graphcore is hiring AI Research Interns! 🚨 Join us to work at the intersection of hardware and AI and help shape the future of AI systems. Whether you're excited about efficient inference, large-scale training, or advancing frontier-model capabilities, we’ve got cutting-edge projects for you to dive into. Interested? Apply below 👇

1

3

976

Awfidius retweeted

3D vision fanatic. Professor @cornell_tech & Researcher @GoogleDeepmind. He or they. https://t.co/m7Rs5xUFfG

over 1 year ago

Our Papers of the Month for September is now live! We cover: - LLM self-correction via RL - Trillion-token FP8 training - SOAP (Shampoo + Adam) - Generative models for crystals All framed in terms of "proper conditioning" (🧵) https://t.co/X6Xllf0SdC

1

53

13

28

4K

Who to follow

Noah Snavely

@Jimantha

Angjoo Kanazawa

@akanazawa

Assistant Professor at @Berkeley_EECS, @berkeley_ai. KAIR, @nerfstudioteam. Amazon Scholar @ FAR. Previously advised @WonderDynamics and @LumaLabsAI. she/her.

Marc Pollefeys

@mapo1

Director of Science at @Microsoft @HoloLens, Professor of Computer Science at @ETH Zurich, working on #ComputerVision

Awfidius retweeted

Charlie Blake @thecharlieblake

over 2 years ago

Graphcore Research internships are now open 🎉 We're looking for PhD students for next summer We're interested in algorithms & tools for hardware-efficient ML, in areas like LLM training/inference, GNNs, knowledge graphs and frameworks Spread the word! https://t.co/3AsDzaSney

0

10

3

2

2K

Awfidius retweeted

over 1 year ago

Introducing `tandv` - a library for tracking and visualising the internal stats of your model. We hope this will help with low-precision, debugging and more. (link in 🧵)

1

16

3

6

2K

Awfidius retweeted

Josef Dean @JosefNDean

almost 2 years ago

Sure matplotlib is cool, but what if I want to load my loss curves into the 2006 hit Flash game LineRider?

50

6K

793

2K

438K

almost 2 years ago

Update: super helpful customer service got me the brass fixing kit, tap no longer wobbling.

0

6

0

245

almost 2 years ago

Same problem, and based on online searches, many others have it too. Terrible wasteful design, saves 10 seconds when it works, costs hours when it doesn't, and a load of extra useless plastic is left in place having saved those 10 seconds.

1

3

0

1K

Charlie Blake @thecharlieblake

almost 2 years ago

As further feedback to the team that thought this would be a great innovation that would attract customers: I bought this tap because of Grohe's reputation for quality. The term "quickfix" had zero impact on my choice. However, that term now means "low quality gimmick".

1

2

0

438

Awfidius retweeted

almost 2 years ago

Our u-µP paper hit arXiv this morning! I'm so proud of this one — and grateful for a wonderful team who put so much into it 🥰 We add lots of good things to µP. Better sweeping, transfer, simple FP8. Already @cloneofsimo has a great thread on it, which I highly recommend

0

45

8

6

3K

Awfidius retweeted

Paul B. @PaulBalanca

almost 2 years ago

Excited to present our work at @icmlconf WANT workshop: ⚖️ Scalify: scale propagation for efficient low-precision LLM training 🎉 https://t.co/7BnCQ1Vpqk

1

6

1

0

335

Awfidius retweeted

Simo Ryu

@cloneofsimo

almost 2 years ago

babe wake up, new muP paper dropped https://t.co/YKmDolNXWP And holy smokes does this look promising!

10

335

52

286

36K

Awfidius retweeted

almost 2 years ago

Our team's summaries & analysis of our favourite papers from the last month. We give our take on: Mamba-2, sparse-µP, contextual position encoding & matmul-free models 🧵 https://t.co/Ad7HRGLSaf

1

9

8

2K

Awfidius retweeted

about 2 years ago

Our latest edition of *Papers of the Month* is now available 📚 These are summaries of our team's favourite papers from March, including a new low-rank training procedure GaLore, and the supposed "Era of 1-bit LLMs" (really 1.58 bits) Mini-version in 🧵 https://t.co/8NzkK0CGyB

1

13

9

4

2K

over 2 years ago

@robin_kips @eccvconf Oops, will fix - this works in the meantime: https://t.co/trHEnHnZlp

1

0

113

Dominic Masters @dominic_masters

over 2 years ago

I'm moving from extremely rarely posting interesting content on twitter to doing the same on, wow I'm old, linkedin...

1

25

0

4K

Awfidius retweeted

almost 3 years ago

So pleased to share that our OGB-LSC winning model GPS++ has just been published in TMLR.

0

13

4

0

2K

almost 3 years ago

I love that I'm being offered these - I guess the ad targeting knows I'm going to have a sudden urge to take up jewellery making long before I do...

Awfidius's tweet photo. I love that I'm being offered these - I guess the ad targeting knows I'm going to have a sudden urge to take up jewellery making long before I do... https://t.co/pSHqqABe4g

0

1

0

753