Satvik Garimella

6 days ago · Toronto

What a show @DonToliver . Genuinely such an insane performer

0

5

0

1

64

7 days ago

GOATED

7 days ago

@sakshambatraa and I have now implemented Softmax on our toy LPU! after overhauling the VXM pipeline, we set our sights on implementing the Softmax module in hardware 🧵

michael_trbo's tweet photo. @sakshambatraa and I have now implemented Softmax on our toy LPU!

after overhauling the VXM pipeline, we set our sights on implementing the Softmax module in hardware 🧵 https://t.co/i7to32dqby

5

22

7

8

1K

0

10

0

3

496

7 days ago

@michael_trbo @sakshambatraa AURAAAAA

0

2

0

51

satvikgari retweeted

7 days ago

@sakshambatraa and I have now implemented Softmax on our toy LPU! after overhauling the VXM pipeline, we set our sights on implementing the Softmax module in hardware 🧵

5

22

7

8

1K

11 days ago

@michael_trbo @sakshambatraa 🧑‍🍳

0

2

0

63

11 days ago

@michael_trbo @sakshambatraa My goats are cooking

0

2

0

67

@michael_trbo @sakshambatraa

11 days ago

0

2

0

70

11 days ago

Wake up the goat posted

11 days ago

another progress update on reinventing Groq's LPU with @sakshambatraa: we redesigned out vector execution module (VXM) to better support overlap on operations, and introduce compatibility to run self attention!

7

18

8

6

3K

0

9

0

2

335

satvikgari retweeted

11 days ago

another progress update on reinventing Groq's LPU with @sakshambatraa: we redesigned out vector execution module (VXM) to better support overlap on operations, and introduce compatibility to run self attention!

7

18

8

6

3K

20 days ago

@ShamsCharania Toronto being snubbed again…. Scottie should be first team this is a joke

1

59

0

4K

22 days ago

@SpliftedNGifted @ShamsCharania Cedric coward bro what

1

0

55

satvikgari retweeted

Jaival Patel

@patjaival

about 1 month ago

after 3 months of continuous crashing, i finally got rl to land a rocket by itself! yes, the complete 6dof dynamics: translation + rotation, variable mass, tvc, disturbances, all of it done by the rl itself. the core issue is that landing is a constrained braking problem, not open-ended control. rl fails because the feasible solution manifold is extremely narrow. once the search space was shaped properly, rl converged. i tried various rl policies and architectures to figure all this out. full technical analysis here check it out!: https://t.co/IYrlPlmiPQ

patjaival's tweet photo. after 3 months of continuous crashing, i finally got rl to land a rocket by itself! yes, the complete 6dof dynamics: translation + rotation, variable mass, tvc, disturbances, all of it done by the rl itself.

the core issue is that landing is a constrained braking problem, not open-ended control. rl fails because the feasible solution manifold is extremely narrow.

once the search space was shaped properly, rl converged. i tried various rl policies and architectures to figure all this out.

full technical analysis here check it out!: https://t.co/IYrlPlmiPQ

2

20

1

12

1K

about 1 month ago

@TannerBennet1 @Bantonappp I am NOT GIVING UP CMB BRO WHAT

0

110

about 1 month ago

@C_HANN_ING This is the greed they talked about in the bible

0

127

about 1 month ago

@XanderChin Step 2

0

56

about 1 month ago

A few months ago, I saw Karpathy build NanoChat in PyTorch, and it made me want to understand how these models work underneath the abstractions. So I decided to try building one myself, but in a different framework: JAX. Here’s how I did it: 🧵

satvikgari's tweet photo. A few months ago, I saw Karpathy build NanoChat in PyTorch, and it made me want to understand how these models work underneath the abstractions.

So I decided to try building one myself, but in a different framework: JAX.

Here’s how I did it: 🧵 https://t.co/EzpZHoE3fR

5

25

7

9

2K

about 1 month ago

@sakshambatraa Thanks vro

0

1

0

74

about 1 month ago

Full stack: → Transformer implementation in raw JAX → Custom Optax training loop → Modal serverless GPUs → Alpaca fine-tuning → DuckDuckGo RAG pipeline → FastAPI + React chat interface Building this gave me a much better understanding of transformer internals, inference optimization, and how modern LLM systems are actually put together. Github: https://t.co/IksdAwW4un Website:https://t.co/viGnjYlNYT

0

6

0

1

164