New @nytimes op-ed by @BernieSanders calls for sovereign wealth fund tied to the stock of frontier labs.
Whether or not you back this idea, his closing is a reminder that people support what they help build. Absent more meaningful mechanisms for people to share their views on AI and shape its development, the backlash will grow and “missed uses” will become the default (ie we will fail to realize the most beneficial uses of AI).
“It must be decided by workers, parents, teachers, artists, scientists, communities and the American people. It’s our future. We must decide it.”
Anthropic has confidentially submitted a draft S-1 registration statement to the Securities and Exchange Commission.
Pending completion of SEC review, this gives us the option to pursue an initial public offering.
Read more: https://t.co/onGZAhRLvD
Improving CPU speed by 10x should not affect training speed essentially at all.
The CPU's main job is to kick off the real work on the GPU. If your kernels are sane (fused etc), the time to launch a kernel on the CPU is <<1% of the kernel runtime, even in Python.
SpaceX has almost finished writing V1.0 of an in-house AI training stack in C that exact-maps to 220k GB300s with 800G NICs, making heavy use of pipeline parallelism and getting as close to bare metal as possible.
The potential speed improvement vs JAX for large training runs is over an order of magnitude.
@jonmasters@lauriewired@FritzchensFritz what i've always wondered is... does scheduling stuff actually reveal materially interesting ip? why would intel ever care what amd's latencies are, they have their own pipelines to work on
feels like old company habits dying hard
@bubbleboi in the right places certainly yes! replacing JAX is not the right place, replacing NCCL is. this won't get you 10x
> how do you coordinate and schedule compute between nodes efficiently
this is handled by the user (writing code in JAX) for the most part. not something C related
Also new in Claude Code: dynamic workflows (research preview).
For the hardest tasks, Claude makes a plan, runs hundreds of parallel subagents, and verifies its work before reporting back. Think a migration touching hundreds of files.
Read more: https://t.co/7gt06kGkDN
@LiuYunlong63318 yea pipelining is a really good usecase for a compiler. writing the code by hand, especially when supporting multiple pipeline schedules, is nightmare material