@giffmana@sshkhr16 Had no doubts about the author of big_vision :)
I heard the same from people that went GMD -> A\. Were missing tools (XManager FTW), but quickly built better versions.
At one point, Malachowsky and Priem wanted to call the company NVision, but that name was already taken by a manufacturer of toilet paper. Both Priem and Huang have taken credit for coming up with the name Nvidia, from "invidia", the Latin word for "envy".
Building momentum at Marin! Upgrading from Dense -> 129B parameter MoEs -> architecture improvements -> optimizer improvements gives our pretraining recipe an estimated 6x cumulative learning speedup, accounting for MFU. Includes community contributions. https://t.co/5dPB9uBiSp
My MLSys keynote on AI writing systems code got more interest than I expected. The recording will take a while, so in the finest tradition of AI labs sharing blog posts, we’re starting the Core Automation Blog with this one https://t.co/h4uSOyrglf
@marksaroufim "I was told that the recording wouldn’t be available for another few months" great potential for using AI for that :)
Thanks for the great article!