Hasith Vattikuti @hasith_v - Twitter Profile

hasith_v retweeted

19 days ago

AI compute and inference are increasingly $$$. How can we change the unit economics of AI to improve accessibility? It's been fun working with @prlnet to release the first model endpoint that simultaneously generates tokens **and** a digital asset that can subsidize inference! 🪙 Check it out, links below 🚀

3

57

2

15

6K

Hasith Vattikuti @hasith_v

about 1 month ago

Loved working on this project, super proud of it! Thank you @AwesomeBao, @bialjail, and @wgilpin0 for being such a great team that I learned so much from. Excited to share our work at ICML!

William Gilpin @wgilpin0

about 1 month ago

Our ICML spotlight paper discovers universal redundancies in time series foundation models: the middle layers of many models can be removed without sacrificing performance 1/

wgilpin0's tweet photo. Our ICML spotlight paper discovers universal redundancies in time series foundation models: the middle layers of many models can be removed without sacrificing performance 1/ https://t.co/hyswlfcXFg

2

186

13

111

19K

1

8

2

1

2K

hasith_v retweeted

Jackson Stokes

@jackson_stokes

about 2 months ago

We trained LoRA adapters of different ranks to understand training dynamics, finding that adapters for GSM8k live in a surprisingly vast, low-rank solution space. This hints that some model skills are easy to learn, and training is more forgiving than we think. @hasith_v 1/6 🧵

jackson_stokes's tweet photo. We trained LoRA adapters of different ranks to understand training dynamics, finding that adapters for GSM8k live in a surprisingly vast, low-rank solution space.

This hints that some model skills are easy to learn, and training is more forgiving than we think. @hasith_v 1/6 🧵 https://t.co/jmhXaBrmjT

5

252

26

221

23K

hasith_v retweeted

William Gilpin @wgilpin0

about 2 months ago

Congratulations to @BaigYasa on his well-deserved selection as a PD Soros fellow!

0

7

2

0

1K

hasith_v retweeted

Jackson Stokes

@jackson_stokes

about 2 months ago

We post-trained MedGemma to be SoTA in visual medicine ddx, outperforming Opus 4.6, Gemini 3.1 and GPT-5.4 while running at ~1/30th the cost. @getnolla Part 1 - improving visual reasoning 🧵1/6

jackson_stokes's tweet photo. We post-trained MedGemma to be SoTA in visual medicine ddx, outperforming Opus 4.6, Gemini 3.1 and GPT-5.4 while running at ~1/30th the cost. @getnolla Part 1 - improving visual reasoning 🧵1/6 https://t.co/ri6InzBeca

6

34

9

3

4K

Hasith Vattikuti @hasith_v

3 months ago

@jxmnop This is cool, I've always been slightly uncomfortable with treating *everything* unlabeled as a negative. I wonder if using LLMs to produce rankings (even a somewhat noisy one) would be better than a binary classification. Perhaps we can weight according to rank in softmax?

0

683

hasith_v retweeted

William Gilpin @wgilpin0

3 months ago

How do time series foundation models forecast unseen dynamical systems? In new experiments, we find that small transformers learn to approximate transfer operators in-context. (1/N) https://t.co/6YuLr8QuJD

3

378

78

293

29K

hasith_v retweeted

dhruva @dhruvakarkada

4 months ago

Soooo proud of this one! I'll make a post w more details shortly

1

10

1

0

320

Hasith Vattikuti @hasith_v

4 months ago

@jxmnop Will code be released? Interested in playing around with this

0

15

Hasith Vattikuti @hasith_v

4 months ago

@khoomeik @LEGO_Group ASML shut down all talks of a collab in fears of trade secrets being leaked in the build

1

4

0

165

hasith_v retweeted

Yasa Baig @BaigYasa

4 months ago

Great to see high quality software dev in comp bio. It still amazes me how much of computational biology is based on single-thread processing of large .txt files with minimal application-specific-optimization.

2

9

1

0

591

Hasith Vattikuti @hasith_v

5 months ago

@sidbing Share a list of your favorites!

0

25

Hasith Vattikuti @hasith_v

8 months ago

@mattarderne @karpathy See the section titled “The LLaDa Algorithm” in my blog post

0

1

0

38

Hasith Vattikuti @hasith_v

8 months ago

@karpathy https://t.co/DiZH86Wlnn

1

9

0

1

168

Hasith Vattikuti @hasith_v

8 months ago

@karpathy I actually hacked nanogpt sometime ago to become a diffusion llm. Results were pretty decent on shakespeare with character-level tokenization. Honestly was just surprised it even learned to spell words and pick up on basic grammar. Link in reply

hasith_v's tweet photo. @karpathy I actually hacked nanogpt sometime ago to become a diffusion llm. Results were pretty decent on shakespeare with character-level tokenization.

Honestly was just surprised it even learned to spell words and pick up on basic grammar.

Link in reply https://t.co/JzvM1rNgLN

3

46

2

10

2K

Hasith Vattikuti @hasith_v

8 months ago

@materzynska @AIatMeta Very interested in diffusion models and social AI. Would love to talk with you. You can see more about me on my blog: https://t.co/BUGkLLEquN

0

1

0

1

396

Hasith Vattikuti @hasith_v

8 months ago

@a16z @LiamFedus @LiamFedus what are yalls methods to verify what the LLMs are discovering? How do you make sure it’s ‘understanding’ current physics correctly? I have lots of thoughts on this as a physics student doing AI research if you want to chat

0

1

0

168

Hasith Vattikuti @hasith_v

8 months ago

@khoomeik @periodiclabs @LiamFedus Very excited to see where periodic will go next! Extremely bullish on trying to get tangible alpha from AI models in natural sciences--it really plays to my background of first doing physics research and then doing AI research

0

2

0

399

Hasith Vattikuti @hasith_v

9 months ago

@CFGeek Yes it is, happy to discuss and get feedback. All is welcome

1

2

0

33

Hasith Vattikuti @hasith_v

9 months ago

@CFGeek To be fair, I also think it will be hard to get it to work, and it might not even. But the negative result plus the rl env will leave us things to learn from. Cause I’m pretty confident that LLMs will be using internal reasoning techniques only a few years down the line.

1

0

52

Hasith Vattikuti

@hasith_v

Last Seen Users on Sotwe

Trends for you

Most Popular Users