HPC-AI Lab

Victor.Kai Wang @VictorKaiWang1

8 months ago

Nice work of Ziming!

Ziming Liu @lzm_mlsys

8 months ago

🚀Serving MoE models made EASY and CHEAP!! We built EaaS — think of experts not as layers in a model, but as microservices you can spin up, replicate, or kill independently. No all-to-all. No static process groups. No system-wide crash when one GPU dies. Just: ⚙️ Clients (attention) ↔ Servers (experts) 🧠 Stateless → easy replication 📡 Asymmetric async P2P (no CPU involved!) 🧱 Fine-grained scaling without restarting and save real 💰！ Monolithic inference is over. Serving is becoming cloud-native. Preprint here → https://t.co/IRUxzoLmCF

lzm_mlsys's tweet photo. 🚀Serving MoE models made EASY and CHEAP!!

We built EaaS — think of experts not as layers in a model, but as microservices you can spin up, replicate, or kill independently.

No all-to-all. No static process groups. No system-wide crash when one GPU dies.

Just:

⚙️ Clients (attention) ↔ Servers (experts)
🧠 Stateless → easy replication
📡 Asymmetric async P2P (no CPU involved!)
🧱 Fine-grained scaling without restarting and save real 💰！
Monolithic inference is over. Serving is becoming cloud-native.
Preprint here → https://t.co/IRUxzoLmCF

1

4

0

2

3K

0

84

HPCAILab retweeted

about 1 year ago

Customizing Your LLMs in seconds using prompts🥳! Excited to share our latest work with @HPCAILab, @VITAGroupUT, @k_schuerholt, @YangYou1991, @mmbronstein, @damianborth : Drag-and-Drop LLMs(DnD). 2 features: tuning-free, comparable or even better than full-shot tuning.(🧵1/8)

5

113

75

61

18K

HPCAILab retweeted

Ziqiao Wang @ZiqiaoWang63428

about 1 year ago

Representation Alignment (REPA) is NOT ALWAYS helpful for diffusion training!🤷 Sharing latest work w/ @HPCAILab and @VITAGroupUT: "REPA Works Until It Doesn't: Early-Stopped, Holistic Alignment Supercharges Diffusion Training". Acceleration up to 28x w/o performance drop.(🧵1/7)

ZiqiaoWang63428's tweet photo. Representation Alignment (REPA) is NOT ALWAYS helpful for diffusion training!🤷
Sharing latest work w/ @HPCAILab and @VITAGroupUT: "REPA Works Until It Doesn't: Early-Stopped, Holistic Alignment Supercharges Diffusion Training".
Acceleration up to 28x w/o performance drop.(🧵1/7) https://t.co/0kNMA0e7xx

1

17

7

4K

Who to follow

Ning Ding

@stingning

Researcher of AI. Assistant Professor @Tsinghua_Uni. Working on scalable methods of language and physical models.

yi

@agihippo

I'm a nice and friendly hippoty.

Victor.Kai Wang

@VictorKaiWang1

Principal Researcher at Tencent HY, Prev. Ph.D. student (学渣) at NUS, focus on parameter generation.

Zekai Li @Richard91316073

over 1 year ago

Congrats to our lab member @lzm_mlsys !

Ziming Liu @lzm_mlsys

over 1 year ago

🚀Towards efficient Diffusion Transformers! 😆We are happy to introduce RAS, the first diffusion sampling strategy that allows for regional variability in sampling ratios, achieving up to 2x+ speedup! 🔌Training-free, plug and play! 💪Nice work with @MSFTResearch @YangYou1991 @Yif_Yang et al. 📜Paper: https://t.co/IHDk6ihhMz 📖Blog: https://t.co/KZ0eoje3YN ⌨️Code: https://t.co/LtzcJyHJiS (1/5)

6

186

40

103

18K

0

190

HPCAILab retweeted

over 1 year ago

🚀 We’re thrilled to introduce DD-Ranking: Rethinking the Evaluation of Dataset Distillation—a unified, user-friendly, and long-term maintained benchmark for dataset distillation (DD)! 📢 Together with 20 institutions worldwide, we’re releasing our Repo, API Documentation, and Leaderboard: 🔗 Repo: https://t.co/n2WT5ziaJJ 📖 Documentation: https://t.co/9nqogEM350 🏆 Leaderboard: https://t.co/tAHZKXOLH1 🧵1/7

Richard91316073's tweet photo. 🚀 We’re thrilled to introduce DD-Ranking: Rethinking the Evaluation of Dataset Distillation—a unified, user-friendly, and long-term maintained benchmark for dataset distillation (DD)!
📢 Together with 20 institutions worldwide, we’re releasing our Repo, API Documentation, and Leaderboard:
🔗 Repo: https://t.co/n2WT5ziaJJ
📖 Documentation: https://t.co/9nqogEM350
🏆 Leaderboard: https://t.co/tAHZKXOLH1 🧵1/7

1

20

16

7

5K

HPCAILab retweeted

Victor.Kai Wang @VictorKaiWang1

over 1 year ago

Generating ~200 million parameters in just minutes! 🥳 Excited to share our work with @MTDovent , @heisejiasuo96 , and @YangYou1991: 'Recurrent Diffusion for Large-Scale Parameter Generation' (RPG for short). Example: Obtain customized models using prompts (see below). (🧵1/8)

4

285

86

219

46K

HPCAILab retweeted

Yang Luo @YangL_7

over 1 year ago

Training-free Video Enhancement: Achieved 🎉 Nice work with @oahzxl @shaowenqi126301 @VictorKaiWang1 @VitaGroupUT @YangYou1991 et al. Non-trivial enhancement, training-free, and plug-and-play 🥳 Blog: https://t.co/8Cz78M0L7v (🧵1/6)

9

250

79

174

47K

HPCAILab retweeted

Zangwei Zheng @ZangweiZheng

over 2 years ago

Exciting News from Open-Sora! 🚀 They've just made the ENTIRE suite of their video-generation model open source! Dive into the world of cutting-edge AI with access to model weights, comprehensive training source code, and detailed architecture insights. Start building your dream video-generation model today! Check it out 👉 https://t.co/12Fl4ZysIG

15

609

149

392

246K

HPCAILab retweeted

AK

@_akhaliq

over 2 years ago

Neural Network Diffusion Diffusion models have achieved remarkable success in image and video generation. In this work, we demonstrate that diffusion models can also generate high-performing neural network parameters. Our approach is simple, utilizing an autoencoder and a standard latent diffusion model. The autoencoder extracts latent representations of a subset of the trained network parameters. A diffusion model is then trained to synthesize these latent parameter representations from random noise. It then generates new representations that are passed through the autoencoder's decoder, whose outputs are ready to use as new subsets of network parameters. Across various architectures and datasets, our diffusion process consistently generates models of comparable or improved performance over trained networks, with minimal additional cost. Notably, we empirically find that the generated models perform differently with the trained networks. Our results encourage more exploration on the versatile use of diffusion models.

23

1K

233

753

475K

HPCAILab retweeted

over 2 years ago

InfoBatch is accepted as Oral to ICLR'24! 🔥 InfoBatch prunes data on the fly and speedups 20%~40% on img classification, semantic segmentation, MAE, Diffusion, LLM instruction tunning.🧵3 ArXiv: https://t.co/3RKELMonnd Blog: https://t.co/tbLZMP7y8W Code: https://t.co/C1tznhNMnb

ZangweiZheng's tweet photo. InfoBatch is accepted as Oral to ICLR'24! 🔥 InfoBatch prunes data on the fly and speedups 20%~40% on img classification, semantic segmentation, MAE, Diffusion, LLM instruction tunning.🧵3

ArXiv: https://t.co/3RKELMonnd
Blog: https://t.co/tbLZMP7y8W
Code: https://t.co/C1tznhNMnb https://t.co/Vyz2OHhQzH

4

53

6

19

6K

HPCAILab retweeted

over 2 years ago

I am happy to share that our paper has been accepted by ICLR as an ORAL paper (1.2% acceptance rate). InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning https://t.co/YoY5voUhyh InfoBatch randomly prunes a portion of less informative samples based on the loss distribution and rescales the gradients of the remaining samples to approximate the original gradient. As a plug-and-play and architecture-agnostic framework, InfoBatch consistently obtains lossless training results on classification, semantic segmentation, vision pertaining, and instruction fine-tuning tasks. On real-world applications, InfoBatch can losslessly save 40% overall cost.

YangYou1991's tweet photo. I am happy to share that our paper has been accepted by ICLR as an ORAL paper (1.2% acceptance rate).

InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning

https://t.co/YoY5voUhyh

InfoBatch randomly prunes a portion of less informative samples based on the loss distribution and rescales the gradients of the remaining samples to approximate the original gradient. As a plug-and-play and architecture-agnostic framework, InfoBatch consistently obtains lossless training results on classification, semantic segmentation, vision pertaining, and instruction fine-tuning tasks.

On real-world applications, InfoBatch can losslessly save 40% overall cost.

0

210

37

100

26K

over 2 years ago

📢 Join us for the HPC-AI Lab Public Seminar! 🔗 Registration: https://t.co/CZKcFKeZul 🗓️ Date/Time: 29 Nov. 2023 (Wednesday), 1 PM to 2 PM 📍 Online via Zoom

HPCAILab's tweet photo. 📢 Join us for the HPC-AI Lab Public Seminar!

🔗 Registration: https://t.co/CZKcFKeZul
🗓️ Date/Time: 29 Nov. 2023 (Wednesday), 1 PM to 2 PM
📍 Online via Zoom https://t.co/cMeJBgeKHc

0

1

3

0

445

HPCAILab retweeted

over 2 years ago

Time flies! I got my PhD from Berkeley 1218 days ago. My first PhD student is graduating. That is my first achievement :-)

6

360

5

9

40K

HPCAILab retweeted

over 2 years ago

Excited to share our #ICCV2023 paper: Fine-tuning Vision-Language Models without Zero-Shot Transfer Degradation (ZSCL). ZSCL outperforms the pre-trained model on downstream tasks and maintains its zero-shot transferability to other tasks. paper: https://t.co/snJVOKDmwj blog: https://t.co/Pe5l4Jx5vw Poster: 10:30am to 12:30pm, 6 Oct. Room Foyer Sud 143 https://t.co/cqSafX1izn

YangYou1991's tweet photo. Excited to share our #ICCV2023 paper: Fine-tuning Vision-Language Models without Zero-Shot Transfer Degradation (ZSCL). ZSCL outperforms the pre-trained model on downstream tasks and maintains its zero-shot transferability to other tasks.

paper: https://t.co/snJVOKDmwj
blog: https://t.co/Pe5l4Jx5vw
Poster: 10:30am to 12:30pm, 6 Oct. Room Foyer Sud 143 https://t.co/cqSafX1izn

0

18

4

3K

HPCAILab retweeted

Victor.Kai Wang @VictorKaiWang1

over 2 years ago

Excited to introduce our #ICCV2023 paper Dataset Quantization (DQ). DQ achieves lossless training performances with 2% data keep ratio on language tasks and 60% data keep ratio on vision tasks. Just check out our paper and project: https://t.co/HNQDiRmdFf https://t.co/2ZrudNyhQw

YangYou1991's tweet photo. Excited to introduce our #ICCV2023 paper Dataset Quantization (DQ). DQ achieves lossless training performances with 2% data keep ratio on language tasks and 60% data keep ratio on vision tasks. Just check out our paper and project:
https://t.co/HNQDiRmdFf

https://t.co/2ZrudNyhQw https://t.co/kDd2itGc16

1

16

2

3

2K

HPCAILab retweeted

almost 3 years ago

Our work DREAM has been accepted by ICCV-2023 @ICCVConference. We are the first to explore the matching efficiency in dataset distillation and speed up the previous works more than 8 times without performance drop. Check out our DREAM in Github: https://t.co/bjtzr8BwN2

VictorKaiWang1's tweet photo. Our work DREAM has been accepted by ICCV-2023 @ICCVConference. We are the first to explore the matching efficiency in dataset distillation and speed up the previous works more than 8 times without performance drop.
Check out our DREAM in Github: https://t.co/bjtzr8BwN2 https://t.co/ZlGgWhFRZC

0

9

1

0

847

almost 3 years ago

🎉 Exciting news! Our Lab has two papers accepted at ACL 2023! 📚✨ We're thrilled to announce that our CAME optimizer has won the Outstanding Paper award! 🏆 Congratulations to the entire team for their remarkable achievement! 🥳 #ACL2023

HPCAILab's tweet photo. 🎉 Exciting news! Our Lab has two papers accepted at ACL 2023! 📚✨ We're thrilled to announce that our CAME optimizer has won the Outstanding Paper award! 🏆 Congratulations to the entire team for their remarkable achievement! 🥳 #ACL2023 https://t.co/GhVSajz0IC

0

12

3

1

952

HPCAILab retweeted

almost 3 years ago

I am happy to share that our paper won the Outstanding Paper Award of ACL. We propose CAME to simultaneously achieve two goals: fast convergence as in traditional adaptive methods, and low memory usage as in LLM training. https://t.co/kiu6jrJCSu

3

148

12

24

34K