Rick Izzo @therickizzo - Twitter Profile

almost 2 years ago

Why do 16k GPU jobs fail? The Llama3 paper has many cool details -- but notably, has a huge infrastructure section that covers how we parallelize, keep things reliable, etc. We hit an overall 90% effective-training-time. https://t.co/5gngOZJHBO

soumithchintala's tweet photo. Why do 16k GPU jobs fail?
The Llama3 paper has many cool details -- but notably, has a huge infrastructure section that covers how we parallelize, keep things reliable, etc.
We hit an overall 90% effective-training-time.
https://t.co/5gngOZJHBO

31

1K

202

705

381K

TheRickIzzo retweeted

Andrew Ng

@AndrewYNg

over 2 years ago

It is only rarely that, after reading a research paper, I feel like giving the authors a standing ovation. But I felt that way after finishing Direct Preference Optimization (DPO) by @rm_rafailov @archit_sharma97 @ericmitchellai @StefanoErmon @chrmanning and @chelseabfinn. This beautiful paper proposes a much simpler alternative to RLHF (reinforcement learning from human feedback) for aligning language models to human preferences. RLHF has been a key technique for training LLMs. In brief, RLHF (i) Gets humans to specify their preferences by ranking LLM outputs, (ii) Trains a reward model (used to score LLM outputs) -- typically represented using a transformer network -- to be consistent with the human rankings, (iii) Uses reinforcement learning to tune an LLM, also represented as a transformer, to maximize rewards. This requires two transformer networks, and RLHF is also finicky to the choice of hyperparameters. DPO simplifies the whole thing. Via clever mathematical insight, the authors show that given an LLM, there is a specific reward function for which that LLM is optimal. DPO then trains the LLM directly to make the reward function (that’s now implicitly defined by the LLM) consistent with the human rankings. So you no longer need to deal with a separately represented reward function, and you can train the LLM directly to optimize the same objective as RLHF. Although it’s still too early to be sure, I am cautiously optimistic that DPO will have a huge impact on LLMs and beyond in the next few years. You can read the paper here: https://t.co/m14qRYszVa I also write more about this in The Batch (linked to below). https://t.co/8h2ag2plIa

51

5K

749

4K

696K

Rick Izzo @TheRickIzzo

over 2 years ago

@mylifcc @LightningAI @mylifcc you're in! enjoy!

0

11

TheRickIzzo retweeted

Lightning AI ⚡️

@LightningAI

over 2 years ago

All-in-one. Zero setup. It’s finally here - Lightning AI Studios Launch a free Studio - https://t.co/IcYvVsd88b

2

224

19

105

35K

Who to follow

Noha Alon

@Noha_Alon

Engineering leader at @lightningAI ⚡️

Rick Izzo @TheRickIzzo

over 2 years ago

@mylifcc @LightningAI Hey @mylifcc , Rick from @LightningAI here. Shoot me a DM with your email you used to sign up and I’ll admit you from the waitlist!

0

16

Rick Izzo @TheRickIzzo

over 2 years ago

@Mickey_English @LightningAI You’re on! Enjoy!

0

11

Rick Izzo @TheRickIzzo

over 2 years ago

@Mickey_English @LightningAI Hey @Mickey_English, Rick from @LightningAI here. Send me a DM with your email you used to sign up and I’ll give you access!

0

1

0

27

TheRickIzzo retweeted

Noah Hein

@TheNoahHein

about 3 years ago

Devs when github is down

13

361

70

3

48K

TheRickIzzo retweeted

Sebastian Raschka

@rasbt

over 3 years ago

You love using PyTorch for Deep Learning but want it a bit more organized, so it's easier to take advantage of more advanced features? Great news: Unit 5 is finally live! In Unit 5, I'll show you how to train PyTorch models with the Lightning Trainer! 🔗 https://t.co/MJB1pr3npi

rasbt's tweet photo. You love using PyTorch for Deep Learning but want it a bit more organized, so it's easier to take advantage of more advanced features?

Great news: Unit 5 is finally live! In Unit 5, I'll show you how to train PyTorch models with the Lightning Trainer!

🔗 https://t.co/MJB1pr3npi https://t.co/h36snJtyt8

10

421

67

205

105K

Rick Izzo @TheRickIzzo

over 3 years ago

@LinuxSeb Linux Mint

0

3

0

Rick Izzo @TheRickIzzo

over 3 years ago

@nevrekaraishwa2 @savetobookmarks

1

0

Rick Izzo @TheRickIzzo

over 3 years ago

@nixcraft Pop!_OS. It kindof just works

0

1

0

TheRickIzzo retweeted

Presley Mullinax @PresleyMullinax

almost 4 years ago

Is @Google really down right now?!? Or is this just a me problem? #googledown

182

2K

225

17

0

TheRickIzzo retweeted

Kyunghyun Cho

@kchonyc

over 4 years ago

hmm.. what is this!? @PyTorchLightnin

4

49

3

0

TheRickIzzo retweeted

William Falcon ⚡️

@williamfalcon

over 5 years ago

1/3 As a researcher from a non-traditional background, I take attribution very seriously and embrace constructive scientific discussion that accelerates learning and advances progress. Flash was developed to serve the evolving needs of PyTorch Lightning users.

3

96

11

9

0

TheRickIzzo retweeted

Lightning AI ⚡️

@LightningAI

over 5 years ago

We've surpassed 1 million downloads of Lightning! 🥳 Huge thank you to our contributors and #community. #opensource #deeplearning #communitylove