Tom St. John @tstjohn_hpcml - Twitter Profile

The Gemini era is here. Thrilled to launch Gemini 1.0, our most capable & general AI model. Built to be natively multimodal, it can understand many types of info. Efficient & flexible, it comes in 3 sizes each best-in-class & optimized for different uses https://t.co/VUu1277bC2

demishassabis's tweet photo. The Gemini era is here. Thrilled to launch Gemini 1.0, our most capable & general AI model. Built to be natively multimodal, it can understand many types of info. Efficient & flexible, it comes in 3 sizes each best-in-class & optimized for different uses https://t.co/VUu1277bC2 https://t.co/pKyBxXwdYw

380

11K

2K

1K

3M

tstjohn_hpcml retweeted

MLCommons @MLCommons

over 2 years ago

Introducing the AlgoPerf: Training Algorithms Benchmark! Compete for a share of the $50,000 prize pool by submitting more effective and efficient neural network training algorithms. Learn more https://t.co/csfXkUCKbN #Algorithms #MachineLearning #Competition

0

41

14

10

13K

Who to follow

Verónica Vergara (she/her/ella)

@verolero86

Kanjun 🐙

@kanjun

helping humans fight Moloch. CEO @imbue_ai. support founders @outsetcap.

MediaTek India

@MediaTekIndia

Welcome to the MediaTek India Twitter page. For global news, follow @MediaTek

tstjohn_hpcml retweeted

MLCommons @MLCommons

over 2 years ago

SC23 attendees join MLCommons BOF sessions to add your voice to the @MLCommons community. Wed, 11/15, 5:15pm in Rm 601-603 MLPerf: A Benchmark for Machine Learning, or in Rm 702 join the conversation around the Future of Benchmarks in Supercomputing. https://t.co/xjuOrQqXjW #SC23

MLCommons's tweet photo. SC23 attendees join MLCommons BOF sessions to add your voice to the @MLCommons community. Wed, 11/15, 5:15pm in Rm 601-603 MLPerf: A Benchmark for Machine Learning, or in Rm 702 join the conversation around the Future of Benchmarks in Supercomputing. https://t.co/xjuOrQqXjW #SC23 https://t.co/r2qapTggah

0

5

2

1

912

tstjohn_hpcml retweeted

Google AI

@GoogleAI

over 2 years ago

Today on the blog, learn how we’re supporting a new effort by the non-profit MLCommons Association that aims to bring together expert researchers across academia and industry to develop standard AI safety benchmarks that everyone can use and understand. ↓ https://t.co/stExJ386UD

6

168

47

14

55K

Tom St. John @tstjohn_hpcml

over 2 years ago

@ShriramKMurthi Based on my experience, the professor doesn't contribute much of anything to the papers written by their students.

0

154

tstjohn_hpcml retweeted

Sharon Zhou ✈️ ICML

@realSharonZhou

almost 3 years ago

Excited to announce a HUGE secret with @LisaSu: @LaminiAI has been building LLMs on @AMD GPUs *in production* for over the past year! We’ve made running LLMs on AMD super easy and a highly competitive option through our LLM Superstation, available now at ~10x lower cost than cloud. 👉🏻 https://t.co/wqx9d8mPOK Our enterprise customers have already built *thousands* of private LLMs on @LaminiAI LLM Superstations, e.g. @iFit leading at-home fitness with millions of users and @AMD itself: 🚀 Easy & fast: “It was simple to iterate and deploy with a few lines of code and amazingly fast with the AMD Instinct™ hardware.” ⭐️ LLMs are the new IP: “Using a public LLM wasn’t enough: we needed something that we could easily and quickly personalize to our customers’ data and constantly improve on new data, while keeping all of our data private.” ⚙️ Any infrastructure: “We’ve deployed Lamini in our internal Kubernetes cluster with AMD Instinct GPUs, and are using finetuning to create models trained on [our data].” We had a cameo quote from Joe Spisak at @MetaAI who leads the Llama efforts said: “…Llama 2 is becoming the foundation of some of the most innovative companies.” 🫱🏼‍🫲🏾 Join Fortune 500 enterprises, and get your own private LLM Superstations—hosted, VPC, or on-premise (just 2 questions): https://t.co/ECVOsux9ay More (technical) details here👉🏻 https://t.co/wqx9d8mPOK

33

752

101

278

399K

Tom St. John @tstjohn_hpcml

almost 3 years ago

@HPC_Guru @Tesla @nvidia @tomshardware So much for their claims that they can train their networks with Dojo

1

2

0

103

Tom St. John @tstjohn_hpcml

almost 3 years ago

If you're planning to attend @hotchipsorg, come check out our ML inference tutorial on Sunday. We've got a great line-up of speakers from @NVIDIAAI, @berkeley_ai, @Qualcomm, @MetaAI, and Moffett AI. #HotChips35 https://t.co/pMtdPbfOu8

tstjohn_hpcml's tweet photo. If you're planning to attend @hotchipsorg, come check out our ML inference tutorial on Sunday. We've got a great line-up of speakers from @NVIDIAAI, @berkeley_ai, @Qualcomm, @MetaAI, and Moffett AI.

#HotChips35
https://t.co/pMtdPbfOu8 https://t.co/dd9ssL0ywU

0

6

2

1

698

Tom St. John @tstjohn_hpcml

almost 3 years ago

@jabsNtriangles Lethal Weapon showed us what happens when someone knows jiu-jitsu (they hired Rorion Gracie to serve as a technical advisor) https://t.co/jUHJZnCheb

0

85

Tom St. John @tstjohn_hpcml

almost 3 years ago

@krismicinski My lab was able to do that for experimental architectures we were using (and there was a lot of red tape), but there were exclusive contracts in place keeping us from doing that for any of our clusters.

0

6

0

389

Tom St. John @tstjohn_hpcml

almost 3 years ago

@HPC_Guru @AmpereComputing @AMD @ServeTheHome Time will tell, but I think Ampere cutting their system level cache in half to accommodate the higher core count (compared to Altra) is going to hurt their performance in the long run.

1

0

222

tstjohn_hpcml retweeted

Abdulrahman Mahmoud @ARHmahmoud

over 3 years ago

Call for applications for the inaugural Machine Learning and Systems Rising Stars 2023 workshop! Website: https://t.co/kNbw8OmXMu

0

17

6

1

2K

Tom St. John @tstjohn_hpcml

over 3 years ago

@karpathy Congratulations

0

35

Tom St. John @tstjohn_hpcml

over 3 years ago

@Meng2Fu Congratulations!

1

0

83

tstjohn_hpcml retweeted

Horace He

@cHHillee

over 3 years ago

Let's talk about a detail that occurs during PyTorch 2.0's codegen - tiling. In many cases, tiling is needed to generate efficient kernels. Even for something as basic as torch.add(A, B), you might need tiling to be efficient! But what is tiling? And when is it needed? (1/13)

cHHillee's tweet photo. Let's talk about a detail that occurs during PyTorch 2.0's codegen - tiling.

In many cases, tiling is needed to generate efficient kernels. Even for something as basic as torch.add(A, B), you might need tiling to be efficient! But what is tiling? And when is it needed?

(1/13) https://t.co/99fPSv8Lx0

7

883

130

571

192K

tstjohn_hpcml retweeted

Vivek Natarajan

@vivnat

over 3 years ago

Delighted to share our new @GoogleHealth @GoogleAI @Deepmind paper at the intersection of LLMs + health. Our LLMs building on Flan-PaLM reach SOTA on multiple medical question answering datasets including 67.6% on MedQA USMLE (+17% over prior work). https://t.co/jZZuFDrxGw

vivnat's tweet photo. Delighted to share our new @GoogleHealth @GoogleAI @Deepmind paper at the intersection of LLMs + health.

Our LLMs building on Flan-PaLM reach SOTA on multiple medical question answering datasets including 67.6% on MedQA USMLE (+17% over prior work).

https://t.co/jZZuFDrxGw https://t.co/8k7qO6wTZw

34

2K

357

513

783K

tstjohn_hpcml retweeted

Xavier Bresson @xbresson

over 3 years ago

Our paper "Benchmarking Graph Neural Networks" has been accepted for publication at Journal of Machine Learning Research @JmlrOrg! https://t.co/OXq9uJwt9u (after rejection from NeurIPS, ICLR and ICML :)

xbresson's tweet photo. Our paper "Benchmarking Graph Neural Networks" has been accepted for publication at Journal of Machine Learning Research @JmlrOrg!
https://t.co/OXq9uJwt9u

(after rejection from NeurIPS, ICLR and ICML :) https://t.co/5jyYPrh2MV

12

931

128

261

166K

Tom St. John

@tstjohn_hpcml

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users