Clive Chan @itsclivetime - Twitter Profile

New @nytimes op-ed by @BernieSanders calls for sovereign wealth fund tied to the stock of frontier labs. Whether or not you back this idea, his closing is a reminder that people support what they help build. Absent more meaningful mechanisms for people to share their views on AI and shape its development, the backlash will grow and “missed uses” will become the default (ie we will fail to realize the most beneficial uses of AI). “It must be decided by workers, parents, teachers, artists, scientists, communities and the American people. It’s our future. We must decide it.”

KevinTFrazier's tweet photo. New @nytimes op-ed by @BernieSanders calls for sovereign wealth fund tied to the stock of frontier labs.

Whether or not you back this idea, his closing is a reminder that people support what they help build. Absent more meaningful mechanisms for people to share their views on AI and shape its development, the backlash will grow and “missed uses” will become the default (ie we will fail to realize the most beneficial uses of AI).

“It must be decided by workers, parents, teachers, artists, scientists, communities and the American people. It’s our future. We must decide it.”

21

86

13

48

79K

8

33

1

9

10K

Clive Chan

@itsclivetime

2 days ago

most confidential filing in the history of filings, maybe ever

Anthropic

@AnthropicAI

2 days ago

Anthropic has confidentially submitted a draft S-1 registration statement to the Securities and Exchange Commission. Pending completion of SEC review, this gives us the option to pursue an initial public offering. Read more: https://t.co/onGZAhRLvD

971

22K

3K

20M

7

198

1

15

25K

Clive Chan

@itsclivetime

5 days ago

@tchirnhaus20039 a single call into the driver queues hundreds of kernels in a cuda graph, instead of just one in a normal kernel launch

0

44

Who to follow

Yun-Ta Tsai

@yunta_tsai

Sr. Staff Engineer @Tesla_AI

Julian Ibarz

@julianibarz

TeslaBot Optimus AI Lead

6 days ago

Improving CPU speed by 10x should not affect training speed essentially at all. The CPU's main job is to kick off the real work on the GPU. If your kernels are sane (fused etc), the time to launch a kernel on the CPU is <<1% of the kernel runtime, even in Python.

Elon Musk

@elonmusk

7 days ago

SpaceX has almost finished writing V1.0 of an in-house AI training stack in C that exact-maps to 220k GB300s with 800G NICs, making heavy use of pipeline parallelism and getting as close to bare metal as possible. The potential speed improvement vs JAX for large training runs is over an order of magnitude.

7K

98K

11K

7K

30M

38

531

20

172

129K

Clive Chan

@itsclivetime

5 days ago

@FelixCLC_ @jonmasters @lauriewired @FritzchensFritz interesting!

0

1

0

55

Clive Chan

@itsclivetime

5 days ago

@jonmasters @lauriewired @FritzchensFritz what i've always wondered is... does scheduling stuff actually reveal materially interesting ip? why would intel ever care what amd's latencies are, they have their own pipelines to work on feels like old company habits dying hard

2

5

0

351

Clive Chan

@itsclivetime

6 days ago

@xiaosun86 @uncledoomer underrated comment

0

1

0

128

Clive Chan

@itsclivetime

6 days ago

@tenderizzation 10 wide ooo cpus are literal witchcraft and i refuse to believe otherwise

2

27

0

2

2K

itsclivetime retweeted

Hieu Pham

@hyhieu226

6 days ago

https://t.co/y0tc2tyjgE 😂

6

76

5

13

35K

Clive Chan

@itsclivetime

6 days ago

@hyhieu226 LMAO

0

4

0

2K

Clive Chan

@itsclivetime

6 days ago

@bubbleboi yes, 100% elon's post is just about replacing JAX entirely though. which is not the relevant part

1

12

0

1

2K

Clive Chan

@itsclivetime

6 days ago

@bubbleboi in the right places certainly yes! replacing JAX is not the right place, replacing NCCL is. this won't get you 10x > how do you coordinate and schedule compute between nodes efficiently this is handled by the user (writing code in JAX) for the most part. not something C related

2

21

2

5

6K

Clive Chan

@itsclivetime

6 days ago

@jedpolglase i mean these are all pretty straightforward to implement in python no? not sure why C comes into the picture

4

5

0

1

2K

Clive Chan

@itsclivetime

6 days ago

@sakurayukiai yep, definitely makes sense for low batch inference. but surely jax already can do cudagraphs, it's been around for many years

0

1

0

938

Clive Chan

@itsclivetime

6 days ago

>hundreds of parallel subagents gonna hit your quota in like 3 seconds

Claude

@claudeai

6 days ago

Also new in Claude Code: dynamic workflows (research preview). For the hardest tasks, Claude makes a plan, runs hundreds of parallel subagents, and verifies its work before reporting back. Think a migration touching hundreds of files. Read more: https://t.co/7gt06kGkDN

96

3K

190

1K

849K

8

118

3

9

17K

Clive Chan

@itsclivetime

6 days ago

@LiuYunlong63318 yea pipelining is a really good usecase for a compiler. writing the code by hand, especially when supporting multiple pipeline schedules, is nightmare material

2

43

0

13

67K