Mansi Sakarvadia

@Mansi__S

Bluesky: @mansisakarvadia.bsky.social Computer Science/Machine Learning Ph.D. Student @UChicago & @globus. @doecsgf Computational Science Graduate Fellow.

Chicago, IL

Joined August 2015

161 Following

79 Followers

44 Posts

Mansi__S retweeted

FORTUNE

@FortuneMagazine

over 1 year ago

"Without long-term, foundational, and high-risk federal research investments, the seeds of innovation cannot take root," Rebecca Willett and Henry Hoffman write in a new commentary piece. https://t.co/n8lscVfZid

Mansi Sakarvadia @Mansi__S

over 1 year ago

Check out a recent interview in which I discuss the recent Nobel Prizes and some thoughts on the impact on both the domain sciences and ML communities.

Science in Parallel @scienceinparall

over 1 year ago

New 🎙️episode: @Mansi__S of @UChicago and @JoshVermaas of @MSUDOEPlantLab discuss the #AI #Nobels and their impact on computing and research: https://t.co/PJOhOHC4oE Both are part of the @doecsgf community. #HPC

scienceinparall's tweet photo. New 🎙️episode: @Mansi__S of @UChicago and @JoshVermaas of @MSUDOEPlantLab discuss the #AI #Nobels and their impact on computing and research: https://t.co/PJOhOHC4oE Both are part of the @doecsgf community. #HPC https://t.co/HPl35QdGjH

336

242

Mansi Sakarvadia @Mansi__S

over 1 year ago

Reflecting on my 2024 PhD journey: passed my qualifying exam, spent the summer at Berkeley, mentored undergrad students, and tackled the fast pace of AI/ML research. It’s been a year of milestones and growth! Read more here: https://t.co/8LiZaZd8Rn #PhDJourney #AIResearch

211

Mansi Sakarvadia @Mansi__S

over 1 year ago

Congrats to Jordan for winning 1st place at SC24 student poster competition! It was super fun to mentor him this summer on his project "Mind Your Manners: Detoxifying Language Models via Attention Head Intervention".

Nathaniel Hudson @Nchudson95

over 1 year ago

I am super proud of Jordan Pettyjohn, an undergraduate student I had the privilege of working with this past summer, for winning the Student Research Competition at the @Supercomputing conference! 🏆 His work studied ablation strategies for toxicity in #LLMs.

Nchudson95's tweet photo. I am super proud of Jordan Pettyjohn, an undergraduate student I had the privilege of working with this past summer, for winning the Student Research Competition at the @Supercomputing conference! 🏆

His work studied ablation strategies for toxicity in #LLMs. https://t.co/7QcT2aGayD

449

260

Who to follow

Chi Chen

@chc273

Quantum Applications at IonQ. Views are mine.

Zhengchun Liu

@lzhengchun

Computer Scientist and Programmer. Father of two girls. Opinions are mine. 🛴

Aditi Krishnapriyan

@ask1729

Assistant Professor at UC Berkeley

Mansi__S retweeted

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8

over 1 year ago

Towards Interpreting Language Models: A Case Study in Multi-Hop Reasoning paper: https://t.co/03JhOvpZep This method improves multi-hop reasoning in language models by injecting “memories” into key attention heads, increasing accuracy in complex tasks. An open-source tool, Attention Lens, interprets attention outputs, helping trace the model’s reasoning and pinpoint issues like bias or harmful content.

118

101

14K

Mansi__S retweeted

Globus Labs @LabsGlobus

over 1 year ago

Excited to share our latest work: "SoK: On Finding Common Ground in Loss Landscapes Using Deep Model Merging Techniques"! 🧠 https://t.co/XAG79rS3ij By @arhamk1216, @toddknife, @Nchudson95, @Mansi__S, @dcgrzenda, @aswathy__ajith, Jordan Pettyjohn, @chard_kyle, and @ianfoster.

352

Mansi Sakarvadia @Mansi__S

over 1 year ago

7/ 🙏 Special Thanks: A huge shoutout to my incredible co-authors from multiple institutions for their contributions to this work: @aswathy__ajith, Arham Khan, @Nchudson95, @calebgeniesse, @nsfzyzz, @chard_kyle, @ianfoster, Michael Mahoney

112

Mansi Sakarvadia @Mansi__S

over 1 year ago

1/🧵New Research on Language Models! Language models (LMs) often "memorize" data, leading to privacy risks. This paper explores ways to reduce that! Paper: https://t.co/OBtYz9mJON Code: https://t.co/x0C5I77CG3 Blog: https://t.co/nA6AH5rnXV

Mansi__S's tweet photo. 1/🧵New Research on Language Models!
Language models (LMs) often "memorize" data, leading to privacy risks. This paper explores ways to reduce that!
Paper: https://t.co/OBtYz9mJON
Code: https://t.co/x0C5I77CG3
Blog: https://t.co/nA6AH5rnXV https://t.co/MEOXv6mBX4

Mansi Sakarvadia @Mansi__S

over 1 year ago

6/ 🌍 Scalable Impact: Our methods aren’t just for small models! We show that they scale effectively to larger LMs, providing robust memorization mitigation without compromising performance across different sizes of models. Exciting progress for real-world applications!

Mansi__S's tweet photo. 6/ 🌍 Scalable Impact:
Our methods aren’t just for small models! We show that they scale effectively to larger LMs, providing robust memorization mitigation without compromising performance across different sizes of models. Exciting progress for real-world applications! https://t.co/TVfYKS7xLu

104

Mansi__S retweeted

Globus Labs @LabsGlobus

over 1 year ago

Language models can memorize sensitive data! 🔒 Our new research by the team (@Mansi__S, @Nchudson95, and others) with TinyMem shows unlearning methods like BalancedSubnet effectively mitigate memorization while keeping performance high. #AI #Privacy https://t.co/usBb1wLSID

117

Mansi__S retweeted

Globus Labs @LabsGlobus

almost 2 years ago

Jordan Pettyjohn, @Nchudson95, @Mansi__S, @aswathy__ajith, and @chard_kyle just published new work demonstrating detoxification strategies on Language Model outputs at @BlackboxNLP! ""Mind Your Manners: Detoxifying Language Models via Attention Head Intervention" Congrats All!

133

Mansi Sakarvadia @Mansi__S

about 2 years ago

@doctagert Thanks, Adam!

Mansi Sakarvadia @Mansi__S

about 2 years ago

🎉 I successfully defended my Master's dissertation in the area of interpretable Language Modeling! Check out my work's applications in better understanding multi-hop reasoning, bias localization, and malicious prompt detection in my talk: https://t.co/DGEvsfzawK

368

Mansi__S retweeted

Globus Labs @LabsGlobus

about 2 years ago

@Mansi__S presented her Master's thesis on "Memory Injections: Correcting Multi-Hop Reasoning Failures during Inference in Transformer-Based Language Models". Watch the recording here: https://t.co/4tgQUgFDTT

Mansi__S retweeted

Ben Blaiszik

@BenBlaiszik

about 2 years ago

Interested in understanding how #LLM s work, why they often fail to reason, and how to improve performance? One tool to boost multi-hop reasoning is with targeted memory injections. This improves desired token probability by up to 424%! 🎥 Watch the talk by @Mansi__S now: https://t.co/ek1HIvDPXk

Mansi__S retweeted

Ben Blaiszik

@BenBlaiszik

over 2 years ago

✨Trillion Parameter Models in Science✨ We present an initial vision for a shared ecosystem to take the next step in large language models for scientific research – Trillion Parameter Models (TPMs). #LLM are becoming more powerful by the day. But, there is still work done to enable discovery of new therapeutics, materials, and physics with these tools. 🔬 📜 https://t.co/5Onyx4jTnD

Mansi Sakarvadia

@Mansi__S

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users