Shradha Sehgal @shradhasgl - Twitter Profile

Shradha Sehgal

@shradhasgl

about 1 month ago

@rronak_ @MichaelElabd @QuantumArjun Congrats Ronak!

1

2

0

154

shradhasgl retweeted

Ahan Gupta

@AhanGupta13

2 months ago

1/🚨 A bit late to the party, but excited to share our paper at #ICLR2026: AutoSP — the first compiler-based solution to unlock long-context LLM training. ✅ Up to 2.7× longer contexts on NVIDIA (2.5x on AMD) ✅ Negligible runtime overhead ✅ Merged into DeepSpeed 🚀 🧵👇

1

11

4

3

1K

Shradha Sehgal

@shradhasgl

7 months ago

I’ll be at NeurIPS from Dec 1-8! 🌊🏖️ Excited to catch up with old and new friends! Let’s grab coffee ☕️ or have a quick chat if you’re attending! #NeurIPS2025

1

7

0

644

Shradha Sehgal

@shradhasgl

8 months ago

@krandiash Sonic! This is awesome! Congrats Karan and team.

0

1

0

48

Who to follow

Mohit Chandra

@mohit__30

Applied Scientist Intern @Microsoft | PhDing @GeorgiaTech | Past: @AmazonScience @MSFTResearch | NLP/Responsible AI for Mental Health | Opinions are personal

Anmol Goel

@anmgoel

AI Safety x Privacy @ELLISForEurope PhD @UKPLab, @TUDarmstadt and @UCPH_Research | Prev @iiit_hyderabad |

shizia

@deutranium

I like working on hard problems phd (dropout) @UZH_en fell in love with graphs @sn_ethz learnt all about NNs @iiit_hyderabad

shradhasgl retweeted

Nirav Diwan

@ocean_drifters

9 months ago

Excited to finally share our #NeurIPS2025 paper "🔮PurpCode: Reasoning for Safer Code Generation"! 🙌 👐 First post-training recipe for training safe code reasoning models 🚀 SOTA for cybersafety + utility, outperforming Sonnet 4, o4-mini, R1 🥇 Winner of 2025 Amazon Nova AI Challenge 📝 Paper: https://t.co/9aOoiL5zgJ 🧵👇 1/11

ocean_drifters's tweet photo. Excited to finally share our #NeurIPS2025 paper "🔮PurpCode: Reasoning for Safer Code Generation"! 🙌

👐 First post-training recipe for training safe code reasoning models
🚀 SOTA for cybersafety + utility, outperforming Sonnet 4, o4-mini, R1
🥇 Winner of 2025 Amazon Nova AI Challenge

📝 Paper: https://t.co/9aOoiL5zgJ

🧵👇 1/11

1

17

6

1

2K

shradhasgl retweeted

Pratik Sampat @pratikrsampat

9 months ago

Excited to finally share our new paper titled "CPU Autoscaling With a Kernel of Truth" at APSys 2025. Huge thanks to my amazing collaborators and advisors @tianyin_xu, @SaugataGhose Catch my virtual talk tonight if you're in Seoul! Paper: https://t.co/xqwwu6NWiF

pratikrsampat's tweet photo. Excited to finally share our new paper titled "CPU Autoscaling With a Kernel of Truth" at APSys 2025.

Huge thanks to my amazing collaborators and advisors @tianyin_xu, @SaugataGhose

Catch my virtual talk tonight if you're in Seoul!

Paper: https://t.co/xqwwu6NWiF https://t.co/sbrEbVB21H

4

123

20

52

10K

Shradha Sehgal

@shradhasgl

9 months ago

It’s wild how much chatGPT has supercharged learning… I use it regularly to understand topics across fields like biology, economics, politics, etc, which ordinarily would take me hours of research & browsing

1

5

1

0

398

shradhasgl retweeted

Ishika Agarwal @wonderingishika

10 months ago

I'm excited to announce NN-CIFT got into @NeurIPSConf 2025 (featuring a fancy, new title)💃💃🌴Can't wait to discuss it with everyone!! Thank you @dilekhakkanitur and @convai_uiuc 🎉🎉

wonderingishika's tweet photo. I'm excited to announce NN-CIFT got into @NeurIPSConf 2025 (featuring a fancy, new title)💃💃🌴Can't wait to discuss it with everyone!!

Thank you @dilekhakkanitur and @convai_uiuc 🎉🎉 https://t.co/Ga2owLjQkI

3

58

7

8

9K

Shradha Sehgal

@shradhasgl

12 months ago

@OfficialTanvi Congrats!!!! 🎊🍾🎈

0

72

shradhasgl retweeted

Revanth Gangi Reddy

@gangi_official

12 months ago

I've successfully defended my PhD thesis on automated information seeking! Extremely grateful to my advisor @hengjinlp, committee members and all collaborators. Next, I'll be joining @GoogleDeepMind as a research scientist! Link to defense slides: https://t.co/O4y30tnncC

gangi_official's tweet photo. I've successfully defended my PhD thesis on automated information seeking! Extremely grateful to my advisor @hengjinlp, committee members and all collaborators.

Next, I'll be joining @GoogleDeepMind as a research scientist!

Link to defense slides: https://t.co/O4y30tnncC https://t.co/F8a0uE1Orn

11

76

7

8

5K

Shradha Sehgal

@shradhasgl

12 months ago

@gangi_official @hengjinlp @GoogleDeepMind Congrats! 🎉

1

0

115

Shradha Sehgal

@shradhasgl

12 months ago

Amazing work! 😯

Minjia Zhang @_Minjia_Zhang_

about 1 year ago

🔬Interested in training AlphaFold3 faster, at scale, and beyond NVIDIA GPU? Now you can. AlphaFold3 is a major leap in biomolecular modeling, but behind the scenes, it introduces severe system bottlenecks: 🧠 2D EvoAttention spikes memory usage 📉 Retrieval-augmented training pipeline causes long GPU idle time ⛔ Frequent but memory-intensive ops slow everything down Today, I'm excited to announce MegaFold, a fully open-source system to make AlphaFold3 training fast, scalable, and cross-platform on both NVIDIA and AMD GPUs. MegaFold delivers: ⚡ Up to 1.73x / 1.62x faster training on NVIDIA H100 / AMD MI250 🧬 Up to 1.35× longer sequences compared to PyTorch baseline Key features: 🚀 Memory-Efficient EvoAttention via portable Triton kernels 💡 Ahead-of-Time Caching to eliminate GPU idle time in retrieval pipelines 🔗 DeepFusion for reducing overhead of small but frequent memory-intensive AF3 ops 📘 Project page: https://t.co/sSaxDtBT1O 📄 Paper: https://t.co/2sCArFoq82 💻 Code: https://t.co/gn7HR5kcQC 🤝 MegaFold is developed in collaboration between UIUC SSAIL Lab and researchers from University of Missouri and Lawrence Berkeley National Laboratory. Kudos to the brilliant team: Hoa La, Ahan Gupta, Alex Morehead, Jianlin Cheng #AlphaFold3 #AI #ProteinFolding #Bioinformatics #AMD #Triton #CrossPlatform #OpenSource

0

3

5

1

3K

0

2

0

581

shradhasgl retweeted

Minjia Zhang @_Minjia_Zhang_

about 1 year ago

🔬Interested in training AlphaFold3 faster, at scale, and beyond NVIDIA GPU? Now you can. AlphaFold3 is a major leap in biomolecular modeling, but behind the scenes, it introduces severe system bottlenecks: 🧠 2D EvoAttention spikes memory usage 📉 Retrieval-augmented training pipeline causes long GPU idle time ⛔ Frequent but memory-intensive ops slow everything down Today, I'm excited to announce MegaFold, a fully open-source system to make AlphaFold3 training fast, scalable, and cross-platform on both NVIDIA and AMD GPUs. MegaFold delivers: ⚡ Up to 1.73x / 1.62x faster training on NVIDIA H100 / AMD MI250 🧬 Up to 1.35× longer sequences compared to PyTorch baseline Key features: 🚀 Memory-Efficient EvoAttention via portable Triton kernels 💡 Ahead-of-Time Caching to eliminate GPU idle time in retrieval pipelines 🔗 DeepFusion for reducing overhead of small but frequent memory-intensive AF3 ops 📘 Project page: https://t.co/sSaxDtBT1O 📄 Paper: https://t.co/2sCArFoq82 💻 Code: https://t.co/gn7HR5kcQC 🤝 MegaFold is developed in collaboration between UIUC SSAIL Lab and researchers from University of Missouri and Lawrence Berkeley National Laboratory. Kudos to the brilliant team: Hoa La, Ahan Gupta, Alex Morehead, Jianlin Cheng #AlphaFold3 #AI #ProteinFolding #Bioinformatics #AMD #Triton #CrossPlatform #OpenSource

0

3

5

1

3K

Shradha Sehgal

@shradhasgl

about 1 year ago

@AkshayGoindani1 Thanks @AkshayGoindani1 looks like we might be quick to assume these gains since evils did not reproduce the baselines effectively https://t.co/0LEygcoxJl

Shashwat Goel @ ICML 🇰🇷

@ShashwatGoel7

about 1 year ago

Confused about recent LLM RL results where models improve without any ground-truth signal? We were too. Until we looked at the reported numbers of the Pre-RL models and realized they were serverely underreported across papers. We compiled discrepancies in a blog below🧵👇

ShashwatGoel7's tweet photo. Confused about recent LLM RL results where models improve without any ground-truth signal? We were too. Until we looked at the reported numbers of the Pre-RL models and realized they were serverely underreported across papers. We compiled discrepancies in a blog below🧵👇 https://t.co/Hmn41grrrh

33

869

121

523

324K

1

0

187

Shradha Sehgal

@shradhasgl

about 1 year ago

Can someone pls give a tldr of what happened in RLVR this past week…

1

3

0

767

Shradha Sehgal

@shradhasgl

about 1 year ago

Such interesting insights!! 😮 Esp for cultural linguistic analysis of LLMs

Ishika Agarwal @wonderingishika

about 1 year ago

Would models know more about Indian food in Hindi and Turkey’s history in Turkish? Does the language of a question affect an LLM’s answer? ✨Yes!✨ @nbbozdag and I are excited to announce our newest preprint in which we explore “Language Specific Knowledge (LSK)”.