Marq @dev_null321 - Twitter Profile

about 14 hours ago

@elliotarledge So you think he’s right? Your recommendation is to give up on keeping up, pick a thing, specialize in that thing, and ignore the rest?

0

1

0

601

Marq

@dev_null321

about 14 hours ago

Kissland 2.0

The Weeknd

@theweeknd

about 14 hours ago

Susumu Hirasawa 🪄

464

55K

8K

4K

2M

0

96

dev_null321 retweeted

Wise

@trikcode

1 day ago

You basically need to be unemployed to keep up with all this AI stuff.

427

8K

596

372

246K

dev_null321 retweeted

Phil Hoyeck

@PAHoyeck

about 16 hours ago

The Liar, Lunatic, or Lord Argument from C.S. Lewis' Mere Christianity (1952)

11

92

8

6

5K

Who to follow

Steve Burkhart

@juststeve84

41•Single•21+•NSFW•Allen, TX•MEN only!•#Divorced

ScorpVayne

@ScorpVayne

CEO | CIO | Cybersecurity // AI // Sustainable AI // Quantum 🔍 | 🐧 Never Stop Learning 💍@cirrus_traveler

petikvx

@petikvx

Malware Researcher Collecter - All my samples will be on https://t.co/ifIYiMAyVd

dev_null321 retweeted

Polymarket

@Polymarket

about 20 hours ago

JUST IN: Tesla’s unsupervised Robotaxi service is now available across the entire Austin geofence area.

76

2K

105

35

102K

dev_null321 retweeted

Muyu He

@HeMuyu0327

1 day ago

I am a big fan of Jianlin Su's blog because it always starts from first principles in mathematics, rather than "ML tricks", to approach a typical ML problem (eg. training-free MoE load balancing). Here is me trying to "reinvent" one such blog which provides an elegant alternative to compute Muon, by filling in all the derivations that the blog skips for a less math-savvy audience (besides being entirely in Mandarin). The goal of the blog is to find a way to compute a essential component of Muon, ie. the left and right singular value matrices U and V for the gradient G, **individually**. In the standard form, Muon really just needs their product UV^T, hence the standard way to compute it via computing a low-rank polynomial of G many times ("Newton-Schulz"). But there are more variants of Muon to control the properties of model updates if we can get both individually, hence the blog's proposal to revisit some fundamental linear algebra techniques for the computation. The methodological takeaway from the blog's thought process is that there are three components to breaking down a ML problem: (1) how to be able to compute something (power iteration), (2) how to compute it fast (cholesky decomposition), and (3) how to compute it accurately given finite floating points (repeated orthogonalization). The goal of reading inspiring blogs like this is, in Feynman's term, to be able to "reinvent" them at any time to grasp the fundamental approach of doing similar work. Original blog: https://t.co/5ksKPICpMW

HeMuyu0327's tweet photo. I am a big fan of Jianlin Su's blog because it always starts from first principles in mathematics, rather than "ML tricks", to approach a typical ML problem (eg. training-free MoE load balancing).

Here is me trying to "reinvent" one such blog which provides an elegant alternative to compute Muon, by filling in all the derivations that the blog skips for a less math-savvy audience (besides being entirely in Mandarin).

The goal of the blog is to find a way to compute a essential component of Muon, ie. the left and right singular value matrices U and V for the gradient G, **individually**. In the standard form, Muon really just needs their product UV^T, hence the standard way to compute it via computing a low-rank polynomial of G many times ("Newton-Schulz"). But there are more variants of Muon to control the properties of model updates if we can get both individually, hence the blog's proposal to revisit some fundamental linear algebra techniques for the computation.

The methodological takeaway from the blog's thought process is that there are three components to breaking down a ML problem: (1) how to be able to compute something (power iteration), (2) how to compute it fast (cholesky decomposition), and (3) how to compute it accurately given finite floating points (repeated orthogonalization). The goal of reading inspiring blogs like this is, in Feynman's term, to be able to "reinvent" them at any time to grasp the fundamental approach of doing similar work.

Original blog: https://t.co/5ksKPICpMW

10

2K

137

2K

71K

dev_null321 retweeted

Jay Stratton

@jaystratton

1 day ago

I’m excited to announce my memoir, Out of the Shadows, will be published by HarperCollins in North America on October 13, 2026. In the book, I break my silence to reveal everything I legally can about my investigations of UAP and non-human intelligent life on behalf of the U.S Government and the profound impact my work had on me and my family. We are at a turning point in human history and I am proud to play a role in opening the public’s eyes to the truth and bringing about long overdue disclosure.

jaystratton's tweet photo. I’m excited to announce my memoir, Out of the Shadows, will be published by HarperCollins in North America on October 13, 2026. In the book, I break my silence to reveal everything I legally can about my investigations of UAP and non-human intelligent life on behalf of the U.S Government and the profound impact my work had on me and my family. We are at a turning point in human history and I am proud to play a role in opening the public’s eyes to the truth and bringing about long overdue disclosure.

478

5K

849

1K

407K

Marq

@dev_null321

2 days ago

Done scamming people with your crypto huh?

tetsuo

@tetsuoai

2 days ago

I can’t sleep at night because my mind races with all the cool shit I could be building. AI has turned my workdays into 24 hour grind sessions. I code until I literally collapse from exhaustion 7 days a week.

tetsuoai's tweet photo. I can’t sleep at night because my mind races with all the cool shit I could be building. AI has turned my workdays into 24 hour grind sessions. I code until I literally collapse from exhaustion 7 days a week. https://t.co/PLtEXHTQTP

121

447

36

50

24K

0

27

Marq

@dev_null321

3 days ago

@sharno3 @VictorTaelin @m_aggan This is actually a good idea.

0

1

0

27

dev_null321 retweeted

Alex Drath @drathdmr

3 days ago

You will never feel ready because ready isn’t a feeling, it’s a decision.

0

6

2

1

428

dev_null321 retweeted

Phil Hoyeck

@PAHoyeck

3 days ago

The next discussion of Wittgenstein's Tractatus Logico-Philosophicus will take place this evening at 6:00 PM EDT. @mishapathy and I will be discussing 4.12 to 4.53. Hope to see some of you there! https://t.co/9zoHwB0I5s

3

28

4

7

2K

Marq

@dev_null321

3 days ago

@hamadwaqar9 @polydao Yes.

0

128

Marq

@dev_null321

3 days ago

As long as we are using the same rocketry we will not explore the stars. Even nuclear engines are better than what we use now.

𝚟𝚒𝚎 ⟢

@viemccoy

3 days ago

As long as you live relatively healthy and in proximity to San Francisco, the odds are much higher than expected that you are in fact *not* born too early to explore the stars. If we're lucky, you were actually born at the exact right moment. And I'm feeling pretty damn lucky

13

150

4

14

5K

0

32

dev_null321 retweeted

Anthropic

@AnthropicAI

3 days ago

Anthropic has confidentially submitted a draft S-1 registration statement to the Securities and Exchange Commission. Pending completion of SEC review, this gives us the option to pursue an initial public offering. Read more: https://t.co/onGZAhRLvD

977

22K

3K

20M

Marq

@dev_null321

4 days ago

Fucking terrible take.

𝚟𝚒𝚎 ⟢

@viemccoy

5 days ago

hot take: qualia isn't real and the soul is part of the body

33

148

4

13

8K

1

0

72

Marq

@dev_null321

4 days ago

@Hikari_07_jp Get the 512. Future proof yourself.

1

0

77

dev_null321 retweeted

Ruben Laukkonen

@RubenLaukkonen

5 days ago

Abstraction is insufficient for consciousness.

23

508

81

284

25K

dev_null321 retweeted

DailyPapers

@HuggingPapers

6 days ago

NVIDIA just released a quantized Qwen3.6 MoE model on Hugging Face 35B total, 3B active parameters NVFP4 shrinks memory ~3x with near-zero accuracy loss

HuggingPapers's tweet photo. NVIDIA just released a quantized Qwen3.6 MoE model on Hugging Face

35B total, 3B active parameters

NVFP4 shrinks memory ~3x with near-zero accuracy loss https://t.co/7UOF7rBz01

24

965

79

726

57K

Marq

@dev_null321

5 days ago

@usr_bin_roygbiv Love this.

0

1

0

963

dev_null321 retweeted

BlackwellBoy

@SlimTradeyBaby

5 days ago

As promised 🙏 This is what $billions in AI infra actually looks like on the floor not in a keynote, not in a brochure. NVIDIA DGX B300 racks. Compute as far as the eye can see. Most people never get within 500 metres of this stuff. This is what the future actually looks. 🔥

SlimTradeyBaby's tweet photo. As promised 🙏

This is what $billions in AI infra actually looks like on the floor not in a keynote, not in a brochure.

NVIDIA DGX B300 racks. Compute as far as the eye can see. Most people never get within 500 metres of this stuff.

This is what the future actually looks. 🔥 https://t.co/qqku5bVAgC

7

86

5

4

6K

Marq

@dev_null321

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users