Thomas Chaton @chaton_thomas - Twitter Profile

over 1 year ago

How can we use small LLMs to shift more AI workloads onto our laptops and phones? In our paper and open-source code, we pair on-device LLMs (@ollama) with frontier LLMs in the cloud (@openai, @together), to solve token-intensive workloads on your 💻 at 17.5% of the cloud cost while maintaining 97.9% of the accuracy. See Gru and the Minions in action below, 🔉on please (h/t @cartesia)!

41

635

169

495

193K

chaton_thomas retweeted

NVIDIA AI Developer

@NVIDIAAIDev

over 1 year ago

Introducing DeepSeek-R1 optimizations for Blackwell, delivering 25x more revenue at 20x lower cost per token, compared with NVIDIA H100 just four weeks ago. Fueled by TensorRT DeepSeek optimizations for our Blackwell architecture, including FP4 performance with state-of-the-art production accuracy, it scored 99.8% of FP8 on MMLU general intelligence benchmark. FP4-optimized DeepSeek checkpoint now available on @huggingface: https://t.co/NxLukbCESw

NVIDIAAIDev's tweet photo. Introducing DeepSeek-R1 optimizations for Blackwell, delivering 25x more revenue at 20x lower cost per token, compared with NVIDIA H100 just four weeks ago.

Fueled by TensorRT DeepSeek optimizations for our Blackwell architecture, including FP4 performance with state-of-the-art production accuracy, it scored 99.8% of FP8 on MMLU general intelligence benchmark.

FP4-optimized DeepSeek checkpoint now available on @huggingface: https://t.co/NxLukbCESw

106

3K

416

646

501K

chaton_thomas retweeted

William Falcon ⚡️

@williamfalcon

over 1 year ago

Here I show you how to finetune and deploy DeepSeek R1 (8B) for < $1.00 in 8 minutes using the AI Hub from @LightningAI ⚡️⚡️

1

66

17

46

5K

chaton_thomas retweeted

DeepSeek

@deepseek_ai

over 1 year ago

🚀 Introducing NSA: A Hardware-Aligned and Natively Trainable Sparse Attention mechanism for ultra-fast long-context training & inference! Core components of NSA: • Dynamic hierarchical sparse strategy • Coarse-grained token compression • Fine-grained token selection 💡 With optimized design for modern hardware, NSA speeds up inference while reducing pre-training costs—without compromising performance. It matches or outperforms Full Attention models on general benchmarks, long-context tasks, and instruction-based reasoning. 📖 For more details, check out our paper here: https://t.co/HJiqzwnUV7

deepseek_ai's tweet photo. 🚀 Introducing NSA: A Hardware-Aligned and Natively Trainable Sparse Attention mechanism for ultra-fast long-context training & inference!

Core components of NSA:
• Dynamic hierarchical sparse strategy
• Coarse-grained token compression
• Fine-grained token selection

💡 With optimized design for modern hardware, NSA speeds up inference while reducing pre-training costs—without compromising performance. It matches or outperforms Full Attention models on general benchmarks, long-context tasks, and instruction-based reasoning.

📖 For more details, check out our paper here: https://t.co/HJiqzwnUV7

883

15K

2K

4K

3M

Who to follow

Noha Alon

@Noha_Alon

Engineering leader at @lightningAI ⚡️

Yurij Mikhalevich

@theyurij

Kushashwa Ravi Shrimali

@kushashwa

Not coding since I was 10. GitHub @ krshrimali I love computers; and I love Computer Science. Other things I love: keyboards and neovim

Thomas Chaton @chaton_thomas

almost 2 years ago

@ThomasScialom It would be fantastic if the data and pre/post training code was open sourced too.

0

104

Thomas Chaton @chaton_thomas

almost 2 years ago

@Thom_Wolf It would be fantastic if the data and pre/post training code was open sourced too.

1

0

1K

Thomas Chaton @chaton_thomas

almost 2 years ago

@bhimrazyadav @LightningAI Hey @bhimrazyadav, thanks a lot for the shot-out. Really appreciated. Lucky to have you as a user !

0

1

0

67

Thomas Chaton @chaton_thomas

about 2 years ago

@tomcocobrico @LightningAI That's great @tomcocobrico. Feel free to join your Discord if you need help on anything and you can publish to the Templates Gallery too https://t.co/gSmrHvhOx8 ;)

0

65

Thomas Chaton @chaton_thomas

about 2 years ago

@hertzfelt_io @LightningAI Looks cool !

0

88

chaton_thomas retweeted

Linus

@thesephist

about 2 years ago

A while ago I complained here about persistent storage in Google Colab. Have been using @LightningAI Studios for a while now for: - Full VSCode (incl. GH Copilot) - Persisted files shared across notebooks - Multi-GPU/node (!!) It's been great. Feels like a remote ML workstation

thesephist's tweet photo. A while ago I complained here about persistent storage in Google Colab.

Have been using @LightningAI Studios for a while now for:
- Full VSCode (incl. GH Copilot)
- Persisted files shared across notebooks
- Multi-GPU/node (!!)

It's been great. Feels like a remote ML workstation https://t.co/5axNYbQgJi

6

259

33

173

56K

Thomas Chaton @chaton_thomas

about 2 years ago

@DataChaz @GoogleColab @LightningAI @code @pycharm So true. Colab is so 2015.

0

1

0

27

Thomas Chaton @chaton_thomas

about 2 years ago

@bhimrazyadav @LightningAI Hey @bhimrazyadav. That's great ! BTW, do you know about LitData: https://t.co/0JLt5J6xGr. This is the library we built to make data processing on @LightningAI fast and scalable.

1

2

0

44

Thomas Chaton @chaton_thomas

over 2 years ago

@karpathy @karpathy Give it a try to Lightning Studio. You won't use your local computer ever again !

0

49

Thomas Chaton @chaton_thomas

over 2 years ago

@elitepax @LightningAI That's great to hear @elitepax. You can have a look at to our published Studios: https://t.co/0l9MqUVTZn. There is a ton to learn from there and 1-click away to get everything ready.

0

1

0

18

Thomas Chaton @chaton_thomas

over 2 years ago

@AnindyadeepS @LightningAI @williamfalcon @lantiga Hey @AnindyadeepS, thanks ! Great timing ! I am working on the docs right now. They should be available in the coming weeks ! Would you mind joining our Slack, you can reach out to me directly. My username is tchaton.

1

0

57

Thomas Chaton @chaton_thomas

over 2 years ago

@AuroraNemoia Hey @AuroraNemoia Check this out: https://t.co/uHeGx0qTK1. With @LightningAI, you can easily prepare your dataset to train any models.

0

1

0

21

Thomas Chaton @chaton_thomas

over 2 years ago

You can duplicate the Studio, you will get everything. The dependencies, the data, the code, etc... Finally, a benchmark you can reproduce yourself with a click!

0

1

0

59

Thomas Chaton @chaton_thomas

over 2 years ago

We just finished benchmarking cloud data-loading libraries over Imagenet 1.2M: - Lightning AI Streaming Dataset - Webdataset - MosaicML Streaming Conclusion: Lightning AI is the fastest (up to 80%) 🚀 https://t.co/R4O7lH0c8i

1

0

88

Thomas Chaton @chaton_thomas

over 2 years ago

Prepare a 1 trillion token dataset to train LLMs from scratch in under 4 hours instead of days with @LightningAI Studio! Everything is included, the final datasets, the code, dependencies, etc... Get started in seconds as no setup is needed. https://t.co/uHeGx0qTK1

0

4

0

84

Thomas Chaton

@chaton_thomas

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users