Atharv Bhat @ATHARVBHAT - Twitter Profile

about 2 months ago

These days I just look at my GPU power usage as a cheap heuristic to see if anything is wrong. It works for most cases Profile your code to get the rest of the way there

Sakura Yuki

@sakurayukiai

about 2 months ago

Just uninstalled `nvtop` and I feel so betrayed. `nvidia-smi` reports 100% utilization if even a single kernel is active on one SM! Swapped to Utilyze to sample actual hardware counters for true compute throughput. My 'fully saturated' 5070 Ti is literally sitting at 32% 🫠

6

133

5

78

10K

0

1

0

21

ATHARVBHAT retweeted

Patrick C Toulme

@PatrickToulme

about 2 months ago

Launching pyptx — a Python DSL for writing NVIDIA PTX kernels. One PTX instruction = one Python call. Write pure PTX in Python. Direct Hopper + Blackwell support: wgmma, TMA, tcgen05, mbarriers. JAX + PyTorch integration. Includes GEMM, grouped GEMM, RMSNorm, SwiGLU, and a PTX→Python transpiler pip install pyptx[torch] pip install pyptx[jax] https://t.co/PcISpsaeQ5

34

1K

135

814

181K

Atharv Bhat @ATHARVBHAT

4 months ago

People are finally seeing the light. Welcome to the dark side

rohan anil

@_arohan_

4 months ago

JAX just feels better for multi-accelerator programming - there is a human touch. I see a big difference is the mental model in JAX, you talk about parallelism in a model-agnostic way, while in torch I still often end up thinking through a mix of framework features, wrappers, and distributed strategies. Is there something I am missing here?

9

156

6

37

15K

0

1

0

46

Atharv Bhat @ATHARVBHAT

4 months ago

Exactly why I switched to Jax and will (probably) never look back

typedfemale

@typedfemale

4 months ago

everything i have learned about how fsdp2 and torch compile interact has been against my will

2

123

3

7

7K

0

1

0

46

Who to follow

Nikunj Gupta

@NikunjGupta97

CS PhD @uscviterbi | previously intern @Mila_Quebec | MS @nyutandon (thesis @Nyu_Courant) | RA @UAlberta | Interested in fundamental & applied multi-agent RL

math ∩ computing, Opinions are my own, RTs are not endorsements.

Atharv Bhat @ATHARVBHAT

4 months ago

Another reason to switch to Jax :)

difficultyang @difficultyang

4 months ago

Interview question I would have failed prior to today: why does PyTorch's DistributedDataParallel give incorrect gradients when the global objective function is computed as a sum over per-sample loss?

16

432

15

487

55K

0

1

0

47

Atharv Bhat @ATHARVBHAT

6 months ago

@timrudner @UofT @VectorInst Congratulations professor !

1

0

82

Atharv Bhat @ATHARVBHAT

8 months ago

@daniel_nguyenx I need this on a T-shirt

0

23

Atharv Bhat @ATHARVBHAT

9 months ago

This is a great article. I wish to see more of this kind of research. Idk how but @cHHillee keeps putting out absolute bangers.

Thinking Machines

@thinkymachines

9 months ago

Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference” We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to prompt engineering. Here we share what we are working on and connect with the research community frequently and openly. The name Connectionism is a throwback to an earlier era of AI; it was the name of the subfield in the 1980s that studied neural networks and their similarity to biological brains. https://t.co/lrJioBmpbT

thinkymachines's tweet photo. Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference”

We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to prompt engineering. Here we share what we are working on and connect with the research community frequently and openly.

The name Connectionism is a throwback to an earlier era of AI; it was the name of the subfield in the 1980s that studied neural networks and their similarity to biological brains.

https://t.co/lrJioBmpbT

230

8K

1K

5K

3M

0

65

Atharv Bhat @ATHARVBHAT

9 months ago

I'm primarily a Python/JAX/PyTorch developer, so diving into rust programming has been an incredible learning experience. If you think this is interesting and worth your time, please give it a try. I welcome contributions and feedback ! (6/6)

0

40

Atharv Bhat @ATHARVBHAT

9 months ago

I'm excited to share something I've been working on for the past few weeks: Otters �� - A minimal vector search library with powerful metadata filtering powered by an ergonomic Polars-like expressions API written in Rust! (1/6) https://t.co/mdoqngI0c0

1

2

0

74

Atharv Bhat @ATHARVBHAT

9 months ago

The library is in very early stages and there are tons of features that i want to add. - Python bindings, NumPy support, - Serialization and persistence, - Parquet / Arrow integration, - Vector quantization, etc. (5/6)

1

0

43