Jun Liu @junliume - Twitter Profile

JunliuMe retweeted

23 days ago

After some mathematical rewrite, turns out all of transformer is a series of gemm + epilogue. Given a few optimized primitives, LLMs (and novice humans) can write speed-of-light kernels for all transformer ops!

17

1K

128

946

132K

JunliuMe retweeted

DeepSeek

@deepseek_ai

about 2 months ago

🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length. 🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models. 🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice. Try it now at https://t.co/GCdiMzk1Dl via Expert Mode / Instant Mode. API is updated & available today! 📄 Tech Report: https://t.co/drlDrxkYtp 🤗 Open Weights: https://t.co/T13Y8i7SDM 1/n

deepseek_ai's tweet photo. 🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length.

🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models.
🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice.

Try it now at https://t.co/GCdiMzk1Dl via Expert Mode / Instant Mode. API is updated & available today!

📄 Tech Report: https://t.co/drlDrxkYtp
🤗 Open Weights: https://t.co/T13Y8i7SDM

1/n

2K

46K

8K

10K

10M

Jun Liu @JunliuMe

6 months ago

#mcclellan02 Monitor HABIT restoration

0

1

0

23

Jun Liu @JunliuMe

8 months ago

@DHSgov Wait … seriously, this is NOT a parody account?

0

1

0

5

Who to follow

tjmurphy50

@tjmurphy501

An alter-ego letting opinions fly & paying homage to the greatest movie ever made! He who has the gold makes the rules.

Rj ❇️🔺

@RJinvestcoin

Crypto to Altseason super crazy bullrun with OGs, AI, Gaming and 2 memes.

Amir Ayupov

@disruptnhandlr

LLVM BOLT @ Meta, ex-CPU R&D (APX) @ Intel

Jun Liu @JunliuMe

8 months ago

Church and state, politicians and free market. Has there been any checks on the system the directly link a politician’s random thoughts on the huge influence of stock market?

0

3

0

44

Jun Liu @JunliuMe

8 months ago

@elonmusk The strength of diversity is exactly that none sense like “diversity is not your strength” is spoken of without fear.

0

1

0

16

Jun Liu @JunliuMe

9 months ago

@sama Oh well, for someone that’s getting a Pulse score every day … interesting

0

1

0

12

JunliuMe retweeted

CBP

@CBP

9 months ago

Let’s set the record straight: President Trump’s updated H-1B visa requirement applies only to new, prospective petitions that have not yet been filed. Petitions submitted prior to September 21, 2025 are not affected. Any reports claiming otherwise are flat-out wrong and should be ignored.

CBP's tweet photo. Let’s set the record straight: President Trump’s updated H-1B visa requirement applies only to new, prospective petitions that have not yet been filed. Petitions submitted prior to September 21, 2025 are not affected. Any reports claiming otherwise are flat-out wrong and should be ignored.

3K

9K

2K

2M

JunliuMe retweeted

Ryan Saavedra

@RyanSaavedra

9 months ago

This is actually a good statement from Bernie Sanders Worth the watch

4K

80K

11K

18K

6M

Jun Liu @JunliuMe

9 months ago

@elonmusk So … they know what they did and happened to locals last time, so they are “concerned” about what is going to happen this time?

0

47

JunliuMe retweeted

Qwen

@Alibaba_Qwen

11 months ago

🚀 Qwen3-30B-A3B Small Update: Smarter, faster, and local deployment-friendly. ✨ Key Enhancements: ✅ Enhanced reasoning, coding, and math skills ✅ Broader multilingual knowledge ✅ Improved long-context understanding (up to 256K tokens) ✅ Better alignment with user intent and open-ended tasks ✅ No more <think> blocks — now operating exclusively in non-thinking mode 🔧 With 3B activated parameters, it's approaching the performance of GPT-4o and Qwen3-235B-A22B Non-Thinking Qwen Chat: https://t.co/B9SNZr366l HF:https://t.co/OglPSNk8lz or https://t.co/oF4NS4yJg0 ModelScope: https://t.co/PwWUW02Pgd or https://t.co/T7aw4V5iVE

Alibaba_Qwen's tweet photo. 🚀 Qwen3-30B-A3B Small Update: Smarter, faster, and local deployment-friendly.

✨ Key Enhancements:
✅ Enhanced reasoning, coding, and math skills
✅ Broader multilingual knowledge
✅ Improved long-context understanding (up to 256K tokens)
✅ Better alignment with user intent and open-ended tasks
✅ No more <think> blocks — now operating exclusively in non-thinking mode

🔧 With 3B activated parameters, it's approaching the performance of GPT-4o and Qwen3-235B-A22B Non-Thinking

Qwen Chat: https://t.co/B9SNZr366l

HF:https://t.co/OglPSNk8lz or https://t.co/oF4NS4yJg0

ModelScope: https://t.co/PwWUW02Pgd or https://t.co/T7aw4V5iVE

79

2K

266

551

390K

JunliuMe retweeted

Zhihao Jia

@JiaZhihao

12 months ago

One of the best ways to reduce LLM latency is by fusing all computation and communication into a single GPU megakernel. But writing megakernels by hand is extremely hard. 🚀Introducing Mirage Persistent Kernel (MPK), a compiler that automatically transforms LLMs into optimized megakernel, reducing latency by 1.2-6.7x. 🔧Tool: https://t.co/mRJ8sSg7HX 📝Blog: https://t.co/97b0YRSrS6

JiaZhihao's tweet photo. One of the best ways to reduce LLM latency is by fusing all computation and communication into a single GPU megakernel. But writing megakernels by hand is extremely hard.

🚀Introducing Mirage Persistent Kernel (MPK), a compiler that automatically transforms LLMs into optimized megakernel, reducing latency by 1.2-6.7x.

🔧Tool: https://t.co/mRJ8sSg7HX
📝Blog: https://t.co/97b0YRSrS6

17

772

122

569

84K

Jun Liu @JunliuMe

over 1 year ago

@Jiankui_He I cannot disagree more. If one says “regulations” there might be rooms for a constructive discussion but here “Ethics” is non-negotiable in so many levels.

0

28

JunliuMe retweeted

Tri Dao

@tri_dao

over 1 year ago

I've been excited about this for a while: a simple architectural change to the residual connection that allows arbitrary overlapping of computation of one layer and the communication of another layer, leading to ~30% speedup in TP! More on MoE and expert parallel to come soon!

3

503

64

236

52K

Jun Liu @JunliuMe

over 1 year ago

@BillAckman Who would put a helicopter route right at the gliding path of commercial airways?!

0

207

Jun Liu @JunliuMe

over 1 year ago

New year, and a new chapter is about to start.

0

35

JunliuMe retweeted

Dylan Patel

@dylan522p

over 1 year ago · Georgia

Met with @LisaSu today for 1.5 hours as we went through everything She acknowledged the gaps in AMD software stack She took our specific recommendations seriously She asked her team and us a lot of questions Many changes are in flight already! Excited to see improvements coming

84

3K

193

622

597K

JunliuMe retweeted

Tri Dao

@tri_dao

over 1 year ago

A strong Mamba2 hybrid model, competitive with transformers trained on 7x more data. Next step: we’ll make Mamba inference really fly, especially for large batch and long context

5

257

44

50

22K

JunliuMe retweeted

CNBCOvertime

@CNBCOvertime

over 1 year ago

AMD CEO Lisa Su explains how ROCm fits into AMD's #AI performance and how partnerships may increase ROCm's impact. #chips #software $AMD @JonFortt $NVDA #CUDA

2

17

3

2

3K

JunliuMe retweeted

Lisa Su

@LisaSu

over 1 year ago

Big day at #SC24!! Excited to announce El Capitan, powered by @AMD is now the world's fastest supercomputer at 1.742 exaflops! We now power 5 of the top 10 and 21 of the top 50 supercomputers in the world. Thanks to @HPE, @Livermore_Lab, @ENERGY for their partnership!

LisaSu's tweet photo. Big day at #SC24!! Excited to announce El Capitan, powered by @AMD is now the world's fastest supercomputer at 1.742 exaflops! We now power 5 of the top 10 and 21 of the top 50 supercomputers in the world. Thanks to @HPE, @Livermore_Lab, @ENERGY for their partnership! https://t.co/VRZFK4Gnn1

73

1K

167

42

83K

Jun Liu

@JunliuMe

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users