Saiful Haq

@RetrieveRerank

On a career break Prev. Director of AI and Staff Research Engineer @Hyperbots_Inc IIT Bombay 5th Year CS PhD @cfiltnlp @iitbombay Building in stealth 🚀

Bengaluru, India

Joined August 2023

136 Following

100 Followers

120 Posts

Pinned Tweet

Saiful Haq @RetrieveRerank

over 2 years ago

🇮🇳Releasing resources for Multilingual Search in 11 Indian languages! 1⃣INDIC-MARCO (Translated version of MSMARCO in 11 Indian Languages): https://t.co/R7K69S0cSR 2⃣Indic-ColBERT (11 Multilingual ColBERT Models): https://t.co/XE4YoMZYOt Paper: https://t.co/a9Rl3Jo87w

3

24

4

21

7K

RetrieveRerank retweeted

@lateinteraction

2 months ago

overwhelming evidence for late interaction / multi-vector models yet again :-) > even after finetuning, single-vector models lag far behind multi-vector embeddings, which achieve significant performance gains and exhibit greater robustness to catastrophic forgetting.

4

89

7

47

9K

RetrieveRerank retweeted

@lateinteraction

2 months ago

The “grep-is-all-you-need” nonsense arguments arise from the fact that too many people think neural search means single-vector IR, which do in fact suck. But we’ve known that since 2019. Quoting @aaxsh18, CEO of Mixedbread: > late interaction cant stop winning

9

225

15

165

25K

RetrieveRerank retweeted

Niyati Chhaya @niyatic

4 months ago

Hackathon alert: The Financial needle in theHaystack. Join us! https://t.co/hSiGFNSjp4 via @LumaHQ #bangalore #financeAI #documents #startups

0

2

1

1

150

RetrieveRerank retweeted

4 months ago

Makes me wonder if Omar sees the rest of the world with this much clarity or just ML/AI let’s hear some hot political takes haha

3

21

1

6

9K

RetrieveRerank retweeted

ukituki @ukituki

4 months ago

@gooby_esq Dspy.RLM("Given all his papers reverse engineer and extract author's mental models") 😅

0

3

1

0

205

RetrieveRerank retweeted

@lateinteraction

5 months ago

Of course not. In fact, my lab is simultaneously building RLMs as the next paradigm for LLMs *and* developing the next paradigm for retrieval (stay tuned!). Retrieval will not go anywhere: if you have a large corpus with, say, billions of tokens over which you issue many queries, you necessarily need to build some index data structures that enable fast sub-linear access. RLMs may internally choose to build such an index when it proves to be an effective tool, but fundamentally RLMs are about long one-off context. You wouldn’t typically put an RLM over a million documents and expect that to be the optimal system design. (Thank you for the question @jayitabhattac11 !)

21

209

17

71

16K

RetrieveRerank retweeted

5 months ago

For those interested in making OSS contributions to the RLM repo, I've added a bunch of random thoughts and TODOs of what to add in a *messy* Markdown file on the GH repo. Feel free to tackle any of them, or any other things you think are meaningful. I'll be pretty active here or on the repo. Once I finish some other related work, I might open up a Discord channel or something for people who want to make longer standing contributions to the repo / discuss the direction of where to take it. Cheers! https://t.co/EVK7g0vzf0

19

292

30

148

17K

Saiful Haq @RetrieveRerank

5 months ago

@a1zhang This is awesome!

0

1

0

0

92

RetrieveRerank retweeted

@lateinteraction

5 months ago

@a1zhang IMO, RLMs are as “language model”-y as modern “LLMs” or Reasoning Models are truly “statistical models of language”. All three are a bit of a stretch BUT in the same way. Pedantically, all three are language processing systems, eg recursive/reasoning language processing system.

0

20

1

6

1K

RetrieveRerank retweeted

6 months ago

Check out our text leaderboard at https://t.co/BglMcHqApl and our SVG leaderboard at https://t.co/iUETLPq1SR!

0

7

1

1

842

Saiful Haq @RetrieveRerank

6 months ago

@prithivida Yupp AI’s SVG leaderboard might be relevant.

1

1

0

0

80

Saiful Haq @RetrieveRerank

6 months ago

Awesome!

6 months ago

while everyone’s reading about the gpt-5.2 release, i’m still training gpt-oss-20b on a dataset generated with gpt-oss-120b!

MaziyarPanahi's tweet photo. while everyone’s reading about the gpt-5.2 release, i’m still training gpt-oss-20b on a dataset generated with gpt-oss-120b!

13

188

12

54

17K

1

2

0

0

319

Saiful Haq @RetrieveRerank

6 months ago

@MaziyarPanahi Nothing beats a fine tuned model in niche domains!

1

1

0

0

28

RetrieveRerank retweeted

@lateinteraction

6 months ago

> You’ll implement ColBERT to understand multi-vector search [and] apply ColPali for patch-level image retrieval. So happy to see the great folks at @DeepLearningAI @AndrewYNg host a course on late interaction (ColBERT, ColPali et al) after their short course on DSPy :D

3

114

8

45

10K

RetrieveRerank retweeted

Swaroop Nath @swaroopnath6

6 months ago

Please consider applying to the program. Over two years, my research skills, perspective on research have all been broadened and sharpened. This is an exceptional group, in the way they groom you, and allow you a room for exploring wild ideas. Pls reach out if you have questions!

0

6

1

0

416

RetrieveRerank retweeted

@lateinteraction

6 months ago

lateinteraction's tweet photo. https://t.co/z5lC1CPMTV

4

264

16

141

35K

RetrieveRerank retweeted

@lateinteraction

7 months ago

Martin @martin_casado and I had a fun hour-long chat about why we need an AI software layer, and why that's true even if AGI arrives. This is basically my take on why "the model" is definitely NOT "the product", though models are one way you may decide to implement some products

15

182

37

102

33K

RetrieveRerank retweeted

Ankur Gupta @getpy

7 months ago

Happy Friday Everyone, DSPyWeekly Issue #11 is live! 🚀 Highlights: 🔹 A cookbook for Self-Evolving Agents 🔹 Teaching local models tool-calling 🔹 New DSPy + Neo4j integration 🔹 A new "Events" section to track DSPy meetups! Plus new projects like codex_dspy & AUTODSPy. #DSPy #AI #LLMs #AgenticAI #Neo4j

7

77

9

45

11K

Saiful Haq @RetrieveRerank

7 months ago

@lateinteraction PEFT as an idea is clean and modular. LoRA is a bit of a hack that happens to work. Yet experimentally, it is the most effective PEFT.

0

2

0

0

529

RetrieveRerank retweeted

@lateinteraction

7 months ago

The labs don't want you to know this (jk) but they have no clue how to best prompt their own models either. To some approximation, you just pre-/post-train it on a lot of data, intervene on certain behaviors, and what comes out is what comes out.

2

2

1

0

1K

Last Seen Users on Sotwe

Trends for you

Most Popular Users