Herumb Shandilya 🦀

Verified account

@krypticmouse

Research Engineer @mixedbreadai | Building DSRs | MSCS, Research, ColBERT, DSPy @Stanford

Stanford

Joined December 2013

521 Following

2.5K Followers

1.8K Posts

Pinned Tweet

Herumb Shandilya 🦀

9 months ago

DSRs, @DSPyOSS for Rust is here🚀 Happy to finally announce the stable release of DSRs. Over the past few months, I’ve been building DSRs with incredible support and contributions from folks Maguire Papay, @tech_optimist, and @joshmo_dev. A big shout out to @lateinteraction and @ChenMoneyQ who were the first people to hear my frequent rants on this!! Couldn't have done this without all of them. DSRs originally started as a passion project to explore true compilation and as it progressed I saw it becoming more. I can’t wait to see what the community builds with it. DSRs is a 3 phase project: 1. API Stabilization. We are nearly done with this and it was mostly implementing the API design. We kept the DSPy style in mind and tried to keep it close to it so it's easier to onboard and while at it we tried to improve it and make it a bit more idiomatic and intuitive! 2. Performance Optimisation with benchmarking vs DSPy. We want to benchmark LLMs performance vs DSPy, with API design finalized we want to improve performance in every front. We'll improve the latency and improve the templates and optimizers in DSRs. 3. True Module Compilation. Why should you optimize signature when you can optimize and fuse much more? This is the idea of the final phase of DSRs. A true LLM workflow compiler. More on this after Phase 2. Really grateful for @PrimeIntellect offering compute to drive Phase 2 and 3 experimentation for this! Big shoutout to them and @johannes_hage for this!!! But what is DSRs? What does it offer? Let's see.

krypticmouse's tweet photo. DSRs, @DSPyOSS for Rust is here🚀

Happy to finally announce the stable release of DSRs. Over the past few months, I’ve been building DSRs with incredible support and contributions from folks Maguire Papay, @tech_optimist, and @joshmo_dev.

A big shout out to @lateinteraction and @ChenMoneyQ who were the first people to hear my frequent rants on this!! Couldn't have done this without all of them.

DSRs originally started as a passion project to explore true compilation and as it progressed I saw it becoming more. I can’t wait to see what the community builds with it.

DSRs is a 3 phase project:

1. API Stabilization. We are nearly done with this and it was mostly implementing the API design. We kept the DSPy style in mind and tried to keep it close to it so it's easier to onboard and while at it we tried to improve it and make it a bit more idiomatic and intuitive!

2. Performance Optimisation with benchmarking vs DSPy. We want to benchmark LLMs performance vs DSPy, with API design finalized we want to improve performance in every front. We'll improve the latency and improve the templates and optimizers in DSRs.

3. True Module Compilation. Why should you optimize signature when you can optimize and fuse much more? This is the idea of the final phase of DSRs. A true LLM workflow compiler. More on this after Phase 2.

Really grateful for @PrimeIntellect offering compute to drive Phase 2 and 3 experimentation for this! Big shoutout to them and @johannes_hage for this!!!

But what is DSRs? What does it offer? Let's see.

12

212

24

96

40K

krypticmouse retweeted

Mixedbread @mixedbreadai

4 days ago

By now, everyone knows that single-vector embedding models are hugely limiting for modern workflows. But they contain than you think: you can extract sparse Latent Terms from them. And it turns out that BM25 is all you need to turn this vocabulary into a strong retriever.

6

190

24

182

38K

Herumb Shandilya 🦀

5 days ago

@halcyonrayes @USCViterbi @USC Congratulations sir 🚀💪🏻!! Upwards and Onwards 🔥🔥🔥

1

2

0

0

127

krypticmouse retweeted

Jon Saad-Falcon

9 days ago

The dominant story in AI has been the growing cloud: bigger clusters, larger models, more gigawatts. We believe the future is in the opposite direction: on-device inference, smaller models, watts instead of gigawatts. Today we're releasing @OpenJarvisAI v1.0: a personal AI assistant that lives, learns, and works on your device.

49

599

91

565

145K

Who to follow

ishan.disbalanced

@loadDisbalanced

i am ishan || migrant technology worker in ncr

"Two there are who are never satisfied - the lover of the world and the lover of knowledge."

Verified account

AI at @Accel. Views personal. Ran https://t.co/sVSWudeYGd before.

Herumb Shandilya 🦀

10 days ago

@KShivendu_ @Stanford @mixedbreadai Thanks Shivedu!!

0

1

0

0

47

Herumb Shandilya 🦀

11 days ago

A pretty late update, but I've graduated from @Stanford and joined @mixedbreadai as a Research Engineer! Excited to help build and optimize the future of search 🚀🚀 It’s been such a fun place to work! Every day, I wake up excited to learn more and more!! Big thanks to @lateinteraction, @JonSaadFalcon and everyone who helped me throughout this journey!! As I always say my achievements are barely mine and much more of the people around me. Grateful to all of them🙏. This is just the beginning.

11

70

6

11

7K

krypticmouse retweeted

Mixedbread @mixedbreadai

10 days ago

New: grep for exact matching grep → keyword / regex matching search → fine-grained semantic retrieval Works across uploaded content, including text, PDFs (OCR) and audio/video (transcription). Give your agents both retrieval primitives to perform at their best.

mixedbreadai's tweet photo. New: grep for exact matching

grep → keyword / regex matching
search → fine-grained semantic retrieval

Works across uploaded content, including text, PDFs (OCR) and audio/video (transcription).

Give your agents both retrieval primitives to perform at their best.

2

65

5

37

5K

Herumb Shandilya 🦀

10 days ago

@RuiTheBaker @Stanford @mixedbreadai 🥖

0

0

0

0

54

Herumb Shandilya 🦀

10 days ago

@antoine_chaffin @Stanford @mixedbreadai Thank you Antoine!!!

0

1

0

0

82

Herumb Shandilya 🦀

10 days ago

@cataluna84 @Stanford @mixedbreadai Thank you!!

0

0

0

0

45

Herumb Shandilya 🦀

11 days ago

@JonSaadFalcon @Stanford @mixedbreadai Thank you Jon! And thank you so much for all the support 💪🏻🙏🏻

0

1

0

0

104

Herumb Shandilya 🦀

11 days ago

@SwishMoe @Stanford @mixedbreadai Thank you!!!

0

0

0

0

103

Herumb Shandilya 🦀

11 days ago

@nikilravi @Stanford @mixedbreadai Thanks Nikil!!

0

1

0

0

158

Herumb Shandilya 🦀

11 days ago

@swayaminsync @Stanford @mixedbreadai Thanks bro!!

0

0

0

0

104

Herumb Shandilya 🦀

11 days ago

@joshmo_dev @Stanford @mixedbreadai Thank you so much Josh!!

0

1

0

0

65

Herumb Shandilya 🦀

11 days ago

@CShorten30 @Stanford @mixedbreadai Thank you Connor!!

0

1

0

0

157

krypticmouse retweeted

Mixedbread @mixedbreadai

12 days ago

Feature: Native agentic search on Mixedbread Search with auto-planning, exploration, and multi-hop reasoning across documents. Built for: - evidence discovery - exhaustive search - cross-document reasoning → Topped MADQA @snowflake with 93.4% accuracy across 18,000 PDF pages.

mixedbreadai's tweet photo. Feature: Native agentic search on Mixedbread

Search with auto-planning, exploration, and multi-hop reasoning across documents.

Built for:
- evidence discovery
- exhaustive search
- cross-document reasoning

→ Topped MADQA @snowflake with 93.4% accuracy across 18,000 PDF pages.

1

81

13

47

9K

krypticmouse retweeted

Mixedbread @mixedbreadai

26 days ago

Introducing mxbai-rerank-v3-listwise: reranking that goes beyond binary relevance. It reads the whole candidate set, resolves conflicts, and ranks by directives like recency, source priority, and multi-step rules. +11% NDCG@10 on average across multiple domains, modalities, and languages in runs with Wholembed v3. Available today in preview in Mixedbread.

mixedbreadai's tweet photo. Introducing mxbai-rerank-v3-listwise: reranking that goes beyond binary relevance.

It reads the whole candidate set, resolves conflicts, and ranks by directives like recency, source priority, and multi-step rules.

+11% NDCG@10 on average across multiple domains, modalities, and languages in runs with Wholembed v3.

Available today in preview in Mixedbread.

5

136

18

72

25K

Herumb Shandilya 🦀

about 2 months ago

@lateinteraction The other half were busy chasing project/assignment deadlines

0

1

0

0

167

Herumb Shandilya 🦀

2 months ago

@elrbtclr Keyboard src?

1

1

0

0

53

Herumb Shandilya 🦀

2 months ago

0

1

0

0

55

Last Seen Users on Sotwe

Trends for you

Most Popular Users