Chetan Verma @chtnverma - Twitter Profile

1 day ago

Brilliant idea! Next up: Apple randomly reboots your Mac if you're building competing tech, Gmail silently edits your email if you mention rival platforms, and Tesla Autopilot swerves if it detects you're working on self-driving cars. All in the name of safety, of course. Because malicious actors controlling the world’s operating systems, inboxes and cars would be extremely dangerous!

102

6K

730

315K

chtnverma retweeted

Prateek Jain

@jainprateek_

8 months ago

We are hiring Research Scientists for our Frontiers-of-AI team at Google DeepMind Bangalore, Singapore, Mountain View. If you're passionate about cutting-edge AI research and building thinking, efficient, elastic, customized, and safe LLMs, we'd love to hear from you. We are looking for candidates with a PhD and a strong demonstrated record of ideating and executing deep research projects. If interested, please apply here: https://t.co/NSxao1nPYo

30

829

90

544

356K

Chetan Verma @chtnverma

11 months ago

cc: @tadityasrinivas @inderjit_ml @cho_jui_hsieh @jainprateek_

0

1

0

139

Chetan Verma @chtnverma

11 months ago

📢 Excited to present our paper at ACM KDD 2025 Conference Matryoshka Model Learning for Improved Elastic Student Models https://t.co/FE4uUNO6CX 🪆🙌↓

Aditya Timmaraju

@tadityasrinivas

about 1 year ago

The Matryoshka🪆wave strikes again! 🚀 Excited to share our latest work, accepted to KDD 2025: Matryoshka Model Learning for Improved Elastic Student Models! https://t.co/uWPU3WhP3K We introduce MatTA, a novel nested distillation framework which enables the extraction of multiple high quality student models from a single training run, enhancing adaptability in production ML systems. A thread. 🧵 (1/6) cc @ManishGuptaMG1 @jainprateek_

1

35

5

9

10K

1

16

3

2

2K

Who to follow

Bhavdeep Sethi

@BhavdeepSethi

@trybasis Ex @frecfinance Ex @Twitter via @TellApart, Ex @Flipkart via Mime360, MachineLearning @Columbia, Mumbaikar. 👀

cts.sf

@tianshu_c

Debugging this world, while tweeting about beauty in it.

Tayo Oviosu 🗽

@oviosu

Chetan Verma @chtnverma

11 months ago

Please come talk to me at #KDD2025 if you're interested in learning more :)

1

0

141

chtnverma retweeted

After Dinner

@AfterDinnerCo

about 1 year ago

@friedberg Emergency pod, but no Jason.

25

878

5

4

93K

chtnverma retweeted

Andrej Karpathy

@karpathy

over 1 year ago

Over the last ~2 hours I curated a new Podcast of 10 episodes called "Histories of Mysteries". Find it up on Spotify here: https://t.co/BH6FTglLIf 10 episodes of this season are: Ep 1: The Lost City of Atlantis Ep 2: Baghdad battery Ep 3: The Roanoke Colony Ep 4: The Antikythera Mechanism Ep 5: Voynich Manuscript Ep 6: Late Bronze Age collapse Ep 7: Wow! signal Ep 8: Mary Celeste Ep 9: Göbekli Tepe Ep 10: LUCA: Last Universal Common Ancestor Process: - I researched cool topics using ChatGPT, Claude, Google - I linked NotebookLM to the Wikipedia entry of each topic and generated the podcast audio - I used NotebookLM to also write the podcast/episode descriptions. - Ideogram to create all digital art for the episodes and the podcast itself - Spotify to upload and host the podcast I did this as an exploration of the space of possibility unlocked by generative AI, and the leverage afforded by the use of AI. The fact that I can, as a single person in 2 hours, curate (not create, but curate) a podcast is I think kind of incredible. I also completely understand and acknowledge the potential and immediate critique here, of AI generated slop taking over the internet. I guess - have a listen to the podcast when you go for walk/drive next time and see what you think.

karpathy's tweet photo. Over the last ~2 hours I curated a new Podcast of 10 episodes called "Histories of Mysteries". Find it up on Spotify here:
https://t.co/BH6FTglLIf

10 episodes of this season are:
Ep 1: The Lost City of Atlantis
Ep 2: Baghdad battery
Ep 3: The Roanoke Colony
Ep 4: The Antikythera Mechanism
Ep 5: Voynich Manuscript
Ep 6: Late Bronze Age collapse
Ep 7: Wow! signal
Ep 8: Mary Celeste
Ep 9: Göbekli Tepe
Ep 10: LUCA: Last Universal Common Ancestor

Process:
- I researched cool topics using ChatGPT, Claude, Google
- I linked NotebookLM to the Wikipedia entry of each topic and generated the podcast audio
- I used NotebookLM to also write the podcast/episode descriptions.
- Ideogram to create all digital art for the episodes and the podcast itself
- Spotify to upload and host the podcast

I did this as an exploration of the space of possibility unlocked by generative AI, and the leverage afforded by the use of AI. The fact that I can, as a single person in 2 hours, curate (not create, but curate) a podcast is I think kind of incredible. I also completely understand and acknowledge the potential and immediate critique here, of AI generated slop taking over the internet. I guess - have a listen to the podcast when you go for walk/drive next time and see what you think.

382

8K

778

6K

706K

chtnverma retweeted

Awni Hannun

@awnihannun

almost 2 years ago

The Transformer architecture has changed surprisingly little from the original paper in 2017 (over 7 years ago!). The diff: - The nonlinearity in the MLP has undergone some refinement. Almost every model uses some form of gated nonlinearity. A silu or gelu nonlinearity is common. - The placement of normalization layers. This tends to vary a little from architecture to architecture. Sometimes more normalization layers per Transformer block (e.g.Gemma 2). Sometimes keys and queries are normalized (e.g. Command+R). - The type of normalization layer. RMS norm is commonly used instead of Layer Norm. All of Llama 3, Phi 3 and Gemma 2 use RMS norm now. Seems like vanilla Layer Norm is becoming a little less common. - Group-query attention is now a staple as it really speeds up inference for larger KV cache's (e.g. longer prompts / generations). - And of course the positional encodings have changed from sinusoidal to rotary (aka RoPE). Not too much variation otherwise.

24

1K

140

1K

124K

Chetan Verma @chtnverma

about 2 years ago

grug be true. warning: stomach hurt laughing https://t.co/C3KeEKDd4b

0

5

0

98

chtnverma retweeted

Gaby Goldberg

@gaby_goldberg

over 2 years ago

Every tech groupchat rn

50

9K

539

189

664K

chtnverma retweeted

Mckay Wrigley

@mckaywrigley

over 2 years ago

You can give ChatGPT a picture of your team’s whiteboarding session and have it write the code for you. This is absolutely insane.

619

29K

5K

11K

11M

chtnverma retweeted

Brian Feroldi

@BrianFeroldi

almost 3 years ago

15 visuals every investor should memorize: 1: In the long run, stocks win:

579

18K

4K

20K

9M

Chetan Verma @chtnverma

over 3 years ago

friendships were forged

0

9

0

540

Chetan Verma @chtnverma

over 3 years ago

@sdachen Yeah if your LinkedIn doesn’t have “He …” then you still haven’t made it, Scott :)

0

2

0

112

Chetan Verma @chtnverma

over 3 years ago

have you really made it if your linkedin isn't written in 3rd person?

1

4

0

559

chtnverma retweeted

Adam Grant

@AdamMGrant

over 3 years ago

We pay too much attention to the most confident voices—and too little attention to the most thoughtful ones. Certainty is not a sign of credibility. Speaking assertively is not a substitute for thinking deeply. It's better to learn from complex thinkers than smooth talkers.

300

25K

7K

1K

0

Chetan Verma

@chtnverma

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users