Debadeepta Dey @debadeepta - Twitter Profile

Pinned Tweet

about 1 year ago

1️⃣We are excited to open-source syftr: a powerful tool for automatically finding Pareto-optimal generative AI flows! syftr searches a large search space of agentic and non-agentic flows to surface optimal tradeoffs between accuracy, cost and latency. 🧵

debadeepta's tweet photo. 1️⃣We are excited to open-source syftr: a powerful tool for automatically finding Pareto-optimal generative AI flows! syftr searches a large search space of agentic and non-agentic flows to surface optimal tradeoffs between accuracy, cost and latency.

🧵 https://t.co/UWT5z3HatT

1

41

5

21

7K

Debadeepta Dey @debadeepta

6 days ago

@PatrickToulme I see it sometimes on OAI models as well

0

21

Debadeepta Dey @debadeepta

about 2 months ago

@MainzOnX If the compiler does too much it takes a lot of effort to keep extracting perf from different generations of even same hardware (triton->gluon for example). If we keep enriching the DSL as hardware evolves we do less work in the compiler and let AI do more heavy lifting.

0

20

Debadeepta Dey @debadeepta

about 2 months ago

@MainzOnX Agreed with all the points above. My personal speculation is that rich DSLs coupled with a simple thin compiler which only implements the most obvious tricks that we don’t need LLMs to rediscover from scratch will win out. Humans still need to read code that AI generates.

1

0

16

Who to follow

Deepak Pathak

@pathak2206

Co-Founder & CEO @SkildAI, Faculty @CarnegieMellon. PhD @UCBerkeley; BTech @IITKanpur I study topics in AI (robotics, machine learning & computer vision).

Devendra Chaplot

@dchaplot

Building superintelligence @xai

Abhishek Gupta

@abhishekunique7

Assistant Professor at University of Washington. I like robots, and reinforcement learning. Previously: post-doc at MIT, PhD at Berkeley

debadeepta retweeted

Jaber

@Akashi203

2 months ago

we published autokernel on arxiv inspired by @karpathy 's autoresearch, we applied the same keep/revert agent loop to GPU kernel optimization you give it any pytorch model, it profiles it, ranks bottlenecks by amdahl's law, writes triton or CUDA C++ replacements, and runs 300+ experiments overnight with no human in the loop - 5.29x over pytorch eager on rmsnorm - 2.82x on softmax - beats torch.compile by 3.44x on softmax and 2.94x on cross entropy - #1 on the vectorsum_v2 B200 leaderboard - single prompt triton FP4 matmul that beats CUTLASS by up to 2.15x every candidate passes a 5-stage correctness harness before any speedup counts, and the whole thing runs at ~40 experiments/hour so you wake up to a faster model arxiv: https://t.co/FjXSIG7qp9 github: https://t.co/45z8Z7nP3N

Akashi203's tweet photo. we published autokernel on arxiv
inspired by @karpathy 's autoresearch, we applied the same keep/revert agent loop to GPU kernel optimization
you give it any pytorch model, it profiles it, ranks bottlenecks by amdahl's law, writes triton or CUDA C++ replacements, and runs 300+ experiments overnight with no human in the loop

- 5.29x over pytorch eager on rmsnorm
- 2.82x on softmax
- beats torch.compile by 3.44x on softmax and 2.94x on cross entropy
- #1 on the vectorsum_v2 B200 leaderboard
- single prompt triton FP4 matmul that beats CUTLASS by up to 2.15x

every candidate passes a 5-stage correctness harness before any speedup counts, and the whole thing runs at ~40 experiments/hour so you wake up to a faster model

arxiv: https://t.co/FjXSIG7qp9
github: https://t.co/45z8Z7nP3N

18

660

80

636

89K

debadeepta retweeted

Satya Nadella

@satyanadella

2 months ago

Introducing Critique, a new multi-model deep research system in M365 Copilot. You can use multiple models together to generate optimal responses and reports.

424

4K

510

2K

1M

debadeepta retweeted

Elliot Arledge

@elliotarledge

3 months ago

https://t.co/eqhNRHzaMq

6

65

7

43

6K

Debadeepta Dey @debadeepta

3 months ago

Vibe-coding wars.

The New York Times

@nytimes

3 months ago

Breaking News: The U.S. was responsible for a missile strike on an Iranian school, an ongoing military investigation found. The inquiry said the strike — which Iranian officials said killed at least 175 people — was the result of a targeting mistake. https://t.co/88FIdIJOQi

4K

43K

18K

4K

9M

0

1

0

118

debadeepta retweeted

Leonardo de Moura @Leonard41111588

3 months ago

AI is writing a growing share of the world's software. No one is formally verifying any of it. New essay: "When AI Writes the World's Software, Who Verifies It?" https://t.co/8zjS9FkdA8

41

2K

246

2K

423K

debadeepta retweeted

Yuda Song @yus167

4 months ago

RL on LLMs inefficiently uses one scalar per rollout. But users regularly give much richer feedback: "make it formal," "step 3 is wrong." Can we train LLMs on this human-AI interaction? We introduce RL from Text Feedback, with 1) Self-Distillation; 2) Feedback Modeling (1/n) 🧵

yus167's tweet photo. RL on LLMs inefficiently uses one scalar per rollout. But users regularly give much richer feedback: "make it formal," "step 3 is wrong."

Can we train LLMs on this human-AI interaction?

We introduce RL from Text Feedback, with 1) Self-Distillation; 2) Feedback Modeling (1/n) 🧵 https://t.co/i8ncPFKq70

14

598

102

495

107K

Debadeepta Dey @debadeepta

10 months ago

Of course, one can run syftr on top of the silver bullets to lift the Pareto-frontier up even more!

0

2

0

162

Debadeepta Dey @debadeepta

10 months ago

What if you could bring your task and have a system generate a set of AI workflows which work well out-of-the-box! No manual trial-and-error on which of the numerous agents (from single-agent to multi-agent workflows) to use. We built exactly that https://t.co/qrsbAXaHFs

1

2

1

4

382

Debadeepta Dey @debadeepta

10 months ago

We took a leaf out of this literature and find that by cross-pollinating search across many different datasets (metatraining), one can find a set of flows we term as "silver bullets" that can do well (in the Pareto-sense) *across* tasks *without* running syftr from scratch.

1

0

193

debadeepta retweeted

Wen Sun

@WenSun1

11 months ago

Does RL actually learn positively under random rewards when optimizing Qwen on MATH? Is Qwen really that magical such that even RLing on random rewards can make it reason better? Following prior work on spurious rewards on RL, we ablated algorithms. It turns out that if you deploy algorithms like Reinforce and REBEL (a generalization of Natural Policy Gradient), RL does not learn under random rewards. These two simple algorithms simply behave as we would expect in this case. GRPO and PPO indeed can behave strangely. They can learn positively or negatively, depending on different random seeds. The clipping heuristic introduces certain bias in the objective function, which causes such unexpected behaviors (this even happens in bandit which has nothing to do w/ LLM or reasoning). Perhaps it is time to abandon the clipping heuristic...

0

101

12

53

14K

Debadeepta Dey @debadeepta

about 1 year ago

@allenainie Thank you for the kind words and for making Trace! We love how easy it is to weave into complicated workflows. Excited for what Trace is cooking next.

0

2

0

100

debadeepta retweeted

Shital Shah

@sytelus

about 1 year ago

A different and interesting work from my ex-colleague Dey: How do you generate Pareto frontier for the agentic workflow? Many practical applications must balance cost vs performance for agents and this pioneering work shows the way!

0

10

1

3

1K

debadeepta retweeted

roma 🦁 @roma_glushko

about 1 year ago

✨Meet syftr, a new OSS framework to find the best RAG workflows (both agentic and not) balancing cost/latency/accuracy using multi-objective Bayesian Optimization

roma_glushko's tweet photo. ✨Meet syftr, a new OSS framework to find the best RAG workflows (both agentic and not) balancing cost/latency/accuracy using multi-objective Bayesian Optimization https://t.co/yvk2FKDr9d

1

4

3

343

Debadeepta Dey @debadeepta

about 1 year ago

6️⃣Want to get involved? 📖 Technical blog post and full paper (to appear at @automl_conf ). 💻 Try syftr https://t.co/YiuVrvt0dd 🙌 Contribute via PRs

0

1

0

1

198

Debadeepta Dey @debadeepta

about 1 year ago

1️⃣We are excited to open-source syftr: a powerful tool for automatically finding Pareto-optimal generative AI flows! syftr searches a large search space of agentic and non-agentic flows to surface optimal tradeoffs between accuracy, cost and latency. 🧵

1

41

5

21

7K

Debadeepta Dey @debadeepta

about 1 year ago

5️⃣syftr is made possible thanks to: Ray for distributed search orchestration. @anyscalecompute LlamaIndex for building advanced workflows. @llama_index HuggingFace Datasets for fast dataset interfaces. @huggingface Starting with question-answering and actively expanding tasks

1

0

203

Debadeepta Dey

@debadeepta

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users