Ionut-Vlad Modoranu @ionutmodo - Twitter Profile

4 months ago

We're releasing DASH (Distributed Accelerated Shampoo), an improved implementation of the Shampoo optimizer that achieves up to 4.83× faster optimizer steps, while matching or improving final model quality. [1/6]

DAlistarh's tweet photo. We're releasing DASH (Distributed Accelerated Shampoo), an improved implementation of the Shampoo optimizer that achieves up to 4.83× faster optimizer steps, while matching or improving final model quality.
[1/6] https://t.co/MO7FZ3wpCk

1

66

13

26

3K

Ionut-Vlad Modoranu @ionutmodo

7 months ago

@OncelTuzel Hi, @OncelTuzel! Thank you for posting this announcement! I have just applied to this job as it fits my PhD expertise.

0

1

0

789

Ionut-Vlad Modoranu @ionutmodo

7 months ago

@calcsam BOOK

0

3

Ionut-Vlad Modoranu @ionutmodo

8 months ago

@asmah2107 Hi! I'm currently a PhD student working in optimizers for deep learning. I'm focused on decreasing the memory usage and speeding up the runtime.

0

6

Who to follow

Mahdi Soltanolkotabi

@mahdisoltanol

foundations of AI, opt/prob/stats multimodal eval/reasoning, AI4math/science/med Prof & director @USC AIF4S center+Research Scientist @Google 🚲🏔️🏃🏊‍♂️🥾⛷️

Ionut-Vlad Modoranu @ionutmodo

8 months ago

@iamgrigorev This is a great tip for productivity! Please check out our GridSearcher project, it is completely written in Python and allows you to run jobs in parallel or sequentially using dictionaries for hyper-parameters by employing a basic scheduling: https://t.co/t4dUUuxs5g

0

1

0

64

ionutmodo retweeted

Dan Alistarh @DAlistarh

about 1 year ago

Our QuEST paper was selected for Oral Presentation at ICLR @sparseLLMs workshop! QuEST is the first algorithm with Pareto-optimal LLM training for 4bit weights/activations, and can even train accurate 1-bit LLMs. Paper: https://t.co/ulbX5D5LjD Code: https://t.co/OYfjs4jdDp

3

31

9

5

3K

ionutmodo retweeted

Dan Alistarh @DAlistarh

9 months ago

We're releasing the DASLab GGUF Quantization Toolkit! 🚀 First open-source toolkit bringing GPTQ + EvoPress to @ggerganov's GGUF format, enabling heterogeneous quantization based on importance. Result: Better models at the same file size. [1/5]

DAlistarh's tweet photo. We're releasing the DASLab GGUF Quantization Toolkit! 🚀
First open-source toolkit bringing GPTQ + EvoPress to @ggerganov's GGUF format, enabling heterogeneous quantization based on importance.
Result: Better models at the same file size.
[1/5] https://t.co/yfLhQYf74X

4

267

50

163

66K

ionutmodo retweeted

Dan Alistarh @DAlistarh

9 months ago

🚀 We are releasing state-of-the-art post-training quantization (PTQ) algorithms for Microscaling FP4, together with kernels: - First study focused on MXFP4/NVFP4 PTQ for LLMs - New Micro-Rotated (MR) format and GPTQ algorithm - QuTLASS GPU kernels with up to 3.6x speedups.

DAlistarh's tweet photo. 🚀 We are releasing state-of-the-art post-training quantization (PTQ) algorithms for Microscaling FP4, together with kernels:
- First study focused on MXFP4/NVFP4 PTQ for LLMs
- New Micro-Rotated (MR) format and GPTQ algorithm
- QuTLASS GPU kernels with up to 3.6x speedups. https://t.co/zJSU7Ufrxk

2

150

27

75

9K

Ionut-Vlad Modoranu @ionutmodo

9 months ago

@KwangjunA This seems to be similar to our recently shared work, where we use Discrete Cosine Transform (DCT) to perform a cheap low-rank projection of the momentum buffer, followed by Newton-Schulz orthogonalization. Check out our paper here: https://t.co/Sao5sBDDTu

1

0

62

ionutmodo retweeted

Dan Alistarh @DAlistarh

9 months ago

Paper: https://t.co/9jwxD1tU92 We release vLLM & HuggingFace integrations. Code: https://t.co/pvkXTyDlcO Kernels: https://t.co/KjO04HAHyq Credit goes to the team: @RobertoL_Castro, Vage, Denis, @black_samorez and @AshkboosSaleh, with support from @RedHat_AI and @thoefler

0

12

5

3

1K

Ionut-Vlad Modoranu @ionutmodo

9 months ago

@OptionsBuffett Trade

3

0

13

Ionut-Vlad Modoranu @ionutmodo

over 1 year ago

@deedydas DeepSeek gets it right:

0

14

ionutmodo retweeted

Dan Alistarh @DAlistarh

about 2 years ago

Introducing Panza, a personalized LLM email assistant, running entirely on-device! [1/6] * Panza adapts LLaMA-3-8B to match your unique writing style; * Can be fine-tuned and executed on a single GPU (free Colab version available!). Give it a try: https://t.co/JjyYT9Mrun

DAlistarh's tweet photo. Introducing Panza, a personalized LLM email assistant, running entirely on-device!
[1/6]
* Panza adapts LLaMA-3-8B to match your unique writing style;
* Can be fine-tuned and executed on a single GPU (free Colab version available!).
Give it a try:
https://t.co/JjyYT9Mrun https://t.co/qsRphMsG9W

5

139

30

164

39K

Ionut-Vlad Modoranu

@ionutmodo

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users