Akash Srivastava @variational_i - Twitter Profile

6 months ago

come check out poster #5518 at NeurIPS morning session today to learn about how you can encourage diversity / prevent early-pruning during inference-time scaling and boost the performance of any model without additional training!

1

14

1

0

2K

Akash Srivastava @variational_i

6 months ago

@annadgoldie @Azaliamirh @RicursiveAI Congrats @annadgoldie and @Azaliamirh — very exciting!

0

1

0

53

Akash Srivastava @variational_i

12 months ago

What does it take to scale AI beyond the lab? At #RedHatSummit, @ishapuri101 and I spoke with Red Hat CEO Matt Hicks & CTO Chris Wright on inference-time scaling, open infra (LLMD), and making AI affordable for enterprise. 🎧 https://t.co/HDtJcEFF28 #NoMathAI @RedHat_AI

0

8

1

0

6K

Akash Srivastava @variational_i

about 1 year ago

🚀 How is generative AI transforming the way we design cars, planes, and entire systems? In Ep 2 of No Math AI, @ishapuri101 and I chat with Dr. @_faezahmed (@MIT DeCoDE Lab) about how AI boosts creativity, cuts design time, and works with engineers—not against them.

Red Hat AI

@RedHat_AI

about 1 year ago

How is generative AI reshaping engineering design? In Episode 2 of No Math AI, hosts Dr. Akash Srivastava (@variational_i) and MIT PhD student Isha Puri (@ishapuri101) sit down with Dr. Faez Ahmed (@_faezahmed) from MIT DeCoDE Lab to explore just that. 👇

1

8

2

1

2K

0

3

1

0

1K

Who to follow

Pavel Izmailov

@Pavel_Izmailov

Researcher @AnthropicAI 🤖 Assistant Professor @nyuniversity 🏙️ Previously @OpenAI #StopWar 🇺🇦

Ali Eslami

@arkitus

Research Scientist at @GoogleDeepMind working on Gemini and Search.

Volodymyr Kuleshov 🇺🇦

@volokuleshov

Co-Founder & Chief Scientist @_inception_ai | Prof @Cornell & @Cornell_Tech | PhD @Stanford

Akash Srivastava @variational_i

about 1 year ago

SQuat: KV-Cache for making reasoning models go 🚀 📄paper: https://t.co/SCYVEz8YKL 💻 code: https://t.co/ZLVKIAEgmc From my awesome collaborators @RedHat_AI

Hao Wang @HW_HaoWang

about 1 year ago

[1/x] 🚀 We're excited to share our latest work on improving inference-time efficiency for LLMs through KV cache quantization---a key step toward making long-context reasoning more scalable and memory-efficient.

HW_HaoWang's tweet photo. [1/x] 🚀 We're excited to share our latest work on improving inference-time efficiency for LLMs through KV cache quantization---a key step toward making long-context reasoning more scalable and memory-efficient. https://t.co/kAYPUZHUNN

9

25

8

1

4K

0

10

2

4

1K

variational_i retweeted

Red Hat AI

@RedHat_AI

about 1 year ago

Excited to share our preliminary work on customizing reasoning models using Red Hat AI Innovation’s Synthetic Data Generation (SDG) package! 📄 Turn your documents into training data for LLMs. 🧵👇

2

10

5

6

1K

variational_i retweeted

Isha Puri

@ishapuri101

about 1 year ago

had a great time giving a talk about probabilistic inference scaling and the power of small models at the IBM Research ML Seminar Series - the best talks end with tons of questions, and it was great to see everyone so engaged : ) https://t.co/zr09shHGT7

2

141

21

90

15K

Akash Srivastava @variational_i

over 1 year ago

Come along and help us build reasoning in small LLMs

Kai Xu @xukai92

over 1 year ago

🚀 Exploring LLM reasoning—live! We, the @RedHat AI Innovation Team, are working on reproducing R1-like reasoning in small LLMs without distilling R1 or its derivatives. We’re documenting our journey in real-time: 🔗 Follow along: https://t.co/89HLcgrtVt

1

12

4

848

0

3

0

1

435

Akash Srivastava @variational_i

over 1 year ago

Excited to share our latest work with @ishapuri101 et al.! 🚀 We introduce a probabilistic inference approach for inference-time scaling of LLMs using particle-based Monte Carlo methods—achieving 4–16x better scaling on math reasoning tasks and O1-level performance on MATH500.

Isha Puri

@ishapuri101

over 1 year ago

[1/x] can we scale small, open LMs to o1 level? Using classical probabilistic inference methods, YES! Joint @MIT_CSAIL / @RedHat AI Innovation Team work introduces a particle filtering approach to scaling inference w/o any training! check out https://t.co/Iz8zoVbZPn

ishapuri101's tweet photo. [1/x] can we scale small, open LMs to o1 level? Using classical probabilistic inference methods, YES! Joint @MIT_CSAIL / @RedHat AI Innovation Team work introduces a particle filtering approach to scaling inference w/o any training! check out https://t.co/Iz8zoVbZPn https://t.co/jcAxIRyypU

2

235

67

151

45K

0

5

0

427

variational_i retweeted

Seungwook Han

@seungwookh

over 1 year ago

🧩 Why do task vectors exist in pretrained LLMs? Our new research uncovers how transformers form internal abstractions and the mechanisms behind in-context learning(ICL).

seungwookh's tweet photo. 🧩 Why do task vectors exist in pretrained LLMs?

Our new research uncovers how transformers form internal abstractions and the mechanisms behind in-context learning(ICL). https://t.co/7IERpwybhu

6

188

30

147

22K

variational_i retweeted

Cole Hurwitz

@cole_hurwitz

over 1 year ago

Neural activity is correlated among animals performing the same task and across sequential trials. Led by @zhang_yizi and @hl3616, we develop an reduced-rank model that exploits shared structure across animals to improve neural decoding. https://t.co/Ip7nO0q4yS

cole_hurwitz's tweet photo. Neural activity is correlated among animals performing the same task and across sequential trials.

Led by @zhang_yizi and @hl3616, we develop an reduced-rank model that exploits shared structure across animals to improve neural decoding.

https://t.co/Ip7nO0q4yS https://t.co/7knN0oo5Mn

1

190

35

110

15K

variational_i retweeted

Cole Hurwitz

@cole_hurwitz

almost 2 years ago

What will a foundation model for the brain look like? We argue that it must be able to solve a diverse set of tasks across multiple brain regions and animals. Check out our preprint where we introduce a multi-region, multi-animal, multi-task model (MtM): https://t.co/eaC4jyFsBN

5

255

62

153

36K

variational_i retweeted

Seungwook Han

@seungwookh

about 2 years ago

🚀 Stronger, simpler, and better! 🚀 Introducing Value Augmented Sampling (VAS) - our new algorithm for LLM alignment and personalization that outperforms existing methods!

seungwookh's tweet photo. 🚀 Stronger, simpler, and better! 🚀

Introducing Value Augmented Sampling (VAS) - our new algorithm for LLM alignment and personalization that outperforms existing methods! https://t.co/dOUVCXGACk

4

128

33

74

26K

variational_i retweeted

Seungwook Han

@seungwookh

about 2 years ago

Excited to give a talk on our hottest, newest work “Value Augmented Sampling for Language Model Alignment and Personalization” at 2:30p Halle A3 in #ICLR2024 Reliable and Responsible Foundation Models Workshop 🥳🥳

1

12

2

0

1K

Akash Srivastava @variational_i

about 2 years ago

Attending #ICLR2024, interested in continual learning and like probabilistic modeling? Lazar from the @MITIBMLab, will be presenting our latest work that takes a probabilistic approach to modular continual learning on Tuesday, 7 May, Halle B #222 (https://t.co/dVYKhvtkM7).

Lazar Valkov @lazarvalkov

about 2 years ago

I’ll be presenting our #ICLR2024 paper on a probabilistic approach to scaling modular continual learning algorithms while achieving different types of knowledge transfer. (https://t.co/IbhoqPmjkI, in collaboration with @variational_i @swarat @RandomlyWalking ). A tldr (1/8):

2

12

2

4

7K

0

11

1

1K

variational_i retweeted

Faez Ahmed @_faezahmed

about 2 years ago

Check out our work titled "From Automation to Augmentation: Redefining Engineering Design and Manufacturing in the Age of NextGen-AI", where we highlight the requirements for NextGenAI suitable for design, engineering, and manufacturing. https://t.co/vHY6l39nqO

1

13

3

2

3K

variational_i retweeted

Mathieu

@miniapeur

about 2 years ago

27

3K

230

323

236K

Akash Srivastava @variational_i

about 2 years ago

@jeethu @Geronimo_AI Good point, model card has been updated with v2 results.

0

1

0

90

Akash Srivastava @variational_i

about 2 years ago

New work from @MITIBMLab researchers on large scale alignment of LLMs. Check out the models at HF https://t.co/AlujBOwM0O

David Cox

@neurobongo

about 2 years ago

Hey, we did a thing: "LAB: Large-scale Alignment for chatBots"—a new synthetic data-driven LLM alignment method that yields great results without using large-scale human or proprietary model data. https://t.co/QdrAzgD9Kr models: https://t.co/Q1eHsPOHUv, https://t.co/ipNIwpJPuf

1

15

6

2

2K

0

6

1

0

462

Akash Srivastava @variational_i

about 2 years ago

New work on automated red-teaming in LLMs using curiosity-driven exploration! #iclr24

Zhang-Wei Hong

@ZhangWeiHong9

about 2 years ago

(1/4) 🎉 Excited to share our ICLR'24 paper on "Curiosity-driven Red-teaming for Large Language Models"! We bridge curiosity-driven exploration in reinforcement learning (RL) with red-teaming, introducing the Curiosity-driven Red-teaming (CRT) method. #ICLR24 #AI #LLMSecurity

ZhangWeiHong9's tweet photo. (1/4) 🎉 Excited to share our ICLR'24 paper on "Curiosity-driven Red-teaming for Large Language Models"! We bridge curiosity-driven exploration in reinforcement learning (RL) with red-teaming, introducing the Curiosity-driven Red-teaming (CRT) method. #ICLR24 #AI #LLMSecurity https://t.co/3BWzpNwSHv

5

29

7

11

8K

0

13

1

0

925

Akash Srivastava

@variational_i

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users