Max Conti @mlpc123 - Twitter Profile

mlpc123 retweeted

16 days ago

if you write some really good code these days everyone thinks you did it with Claude Code. if you lose a lot of weight everyone thinks you did it with Ozempic. at least you can still be really good at dancing. they don't make ozempic or Claude Code for being good at dancing

40

1K

60

105

30K

mlpc123 retweeted

Bo @bo_wangbo

17 days ago

okay maybe it's a good time? We have a small colbert model trained at pplx, it is a continue-training of pplx-embed-0.6b, so native multilingual, just made it open and added a section how to use MaxSim kernel: https://t.co/iwa0PrTPm4

7

100

18

56

24K

Max Conti @mlpc123

20 days ago

Exciting publication alert!! Congrats to the whole team for the nice ideas and thorough execution 🎯 Hyped to take the time to deep-dive into every part :)

Manuel Faysse

@ManuelFaysse

20 days ago

🚨 Do LLMs need to store everything they read in memory? To reduce KV cache size and improve decoding speeds, we propose Self-Pruned KV attention, a mechanism where the model learns to decide which KVs to write in the persistent KV cache, discarding all the rest! @AIatMeta🧵

ManuelFaysse's tweet photo. 🚨 Do LLMs need to store everything they read in memory?
To reduce KV cache size and improve decoding speeds, we propose Self-Pruned KV attention, a mechanism where the model learns to decide which KVs to write in the persistent KV cache, discarding all the rest! @AIatMeta🧵 https://t.co/5UeHSpusGo

8

203

45

149

21K

0

2

0

58

mlpc123 retweeted

Raphaël Sourty

@raphaelsrty

about 1 month ago

We're releasing LateOn and DenseOn today. Two open retrieval models, 149M parameters each. LateOn (ColBERT, multi-vector): 57.22 NDCG@10 on BEIR. DenseOn (dense, single-vector): 56.20. Both beat models up to 4× larger We're open-sourcing the weights under Apache 2.0 🧵👇

raphaelsrty's tweet photo. We're releasing LateOn and DenseOn today. Two open retrieval models, 149M parameters each.

LateOn (ColBERT, multi-vector): 57.22 NDCG@10 on BEIR.

DenseOn (dense, single-vector): 56.20.

Both beat models up to 4× larger

We're open-sourcing the weights under Apache 2.0 🧵👇 https://t.co/Qq6KTcSEPS

2

167

27

92

10K

mlpc123 retweeted

Antoine Chaffin

@antoine_chaffin

about 1 month ago

The new generation of open state-of-the-art single and multi-vector retrieval models is here It's time, DenseOn with the LateOn 🎶 @LightOnIO releases models that leap past existing ones, and everything you need to do the same!

antoine_chaffin's tweet photo. The new generation of open state-of-the-art single and multi-vector retrieval models is here

It's time, DenseOn with the LateOn 🎶

@LightOnIO releases models that leap past existing ones, and everything you need to do the same! https://t.co/B96cNdqn7b

13

224

52

104

40K

mlpc123 retweeted

Antoine E. @antoine_edy

2 months ago

Update on the Late-Interaction Workshop @ ECIR2026: it was amazing! 🇳🇱 So cool to meet late-interactors IRL for the first time, I enjoyed every bit of that day 🤗 And congrats & huge thanks to the organizers again. Link to our work + poster below ⬇️

antoine_edy's tweet photo. Update on the Late-Interaction Workshop @ ECIR2026: it was amazing! 🇳🇱

So cool to meet late-interactors IRL for the first time, I enjoyed every bit of that day 🤗 And congrats & huge thanks to the organizers again.

Link to our work + poster below ⬇️ https://t.co/8GhyIW33yV

3

35

8

2

3K

Max Conti @mlpc123

2 months ago

Looking forward to meeting fellow late interactors! 🤓🇳🇱

Antoine E. @antoine_edy

2 months ago

@mlpc123, @MaceQuent1 and I will be at the Late Interaction Workshop at #ECIR2026 on Thursday 🤗 See you in Delft 🇳🇱

1

14

4

0

3K

0

1

0

53

mlpc123 retweeted

paul @pteiletche

3 months ago

Great work from @weaviate_io that compares the performances of text retrievers and multimodal ones. It appears that their errors are complementary, which makes their combination in hybrid search promising. Check their paper! https://t.co/vu9Nwcjpjs

1

9

4

2

1K

mlpc123 retweeted

paul @pteiletche

3 months ago

Love this from @weaviate_io!

1

13

5

2

2K

mlpc123 retweeted

Manuel Faysse

@ManuelFaysse

4 months ago

Most practicionners would agree that text embeddings should be "contextual" - ie. they should encode a passage w.r.t. the wider scope of the entire document the passage stems from; "They beat the British" could refer to football or french history without further context... In ConTEB (https://t.co/Qhezrjbmks), we highlight the standard failure modes of embedding models on retrieval tasks that require context to be properly embedded. We also propose a training strategy that extends standard "late chunking" to teach models to infuse embeddings with just the right amount of contextual knowledge to optimize retrieval. Super happy to see some new work by @perplexity_ai on contextual embedding models. They eval on ConTEB and use our in-sequence contrastive loss, along with a ton of cool techniques in multiple phases of training. Love the work @bo_wangbo and will read in details, but super happy to see one more stone towards contextual embedding models, in the path already traveled by @hxiao and @jxmnop ! Link to the paper: https://t.co/qlWRcn4QRx

2

37

6

22

2K

mlpc123 retweeted

Macé Quentin @MaceQuent1

5 months ago

Very proud of this paper, we lead many more experiments since the first release. I think we made a pretty complete analysis of what is currently possible with the benchmark ! Thanks again to everyone involved !

0

5

2

0

81

mlpc123 retweeted

Manuel Faysse

@ManuelFaysse

5 months ago

In our EMNLP 2025 Oral paper with @mlpc123, we propose an extension to Late Chunking and demonstrate how we can embed contextual information within passage embeddings... and why it's often very useful to improve document retrieval! (9/15) https://t.co/SrD61QkxAQ

1

3

2

0

132

mlpc123 retweeted

paul @pteiletche

7 months ago

And happy to see our dear ModernVBERT competing with models much larger on it!

1

4

2

0

85

Max Conti @mlpc123

7 months ago

So happy for you that this project finally sees the day, more than a simple extension of the previous versions! Congrats for the hard work @MaceQuent1 @antonio_loison 🥳

António Loison @antonio_loison

7 months ago

📢 ViDoRe V3, our new multimodal retrieval benchmark for enterprise use cases, is finally here! It focuses on real-world applied RAG scenarios using high-quality human-verified data. https://t.co/Fs6W2gNQJc 🧵(1/N)

antonio_loison's tweet photo. 📢 ViDoRe V3, our new multimodal retrieval benchmark for enterprise use cases, is finally here!
It focuses on real-world applied RAG scenarios using high-quality human-verified data. https://t.co/Fs6W2gNQJc
🧵(1/N) https://t.co/7T4WWbAnJd

5

75

19

50

13K

0

1

0

71

Max Conti @mlpc123

7 months ago

@ManuelFaysse Friday 7th 10h45am! Looking forward :))

0

25

Max Conti @mlpc123

7 months ago

@JinaAI_ Why did you organize the BoF at the same time as a Information Extraction and Retrieval Oral session 😢 Will be presenting our work that builds on top of Late Chunking there instead :)

0

1

0

57

Max Conti @mlpc123

7 months ago

I'll be in Suzhou next week to present this project as an Oral at #EMNLP2025! 🥳 Let me know if you're there and wanna get in touch, or if you know anyone who'd be interested :) Looking forward! 🙌🇨🇳 https://t.co/ckRv8yHHmQ

Max Conti @mlpc123

about 1 year ago

🕺Super happy to release our latest work with @ManuelFaysse: in our paper "Context Is Gold to Find the Gold Passage", we share all our findings on how to train embedding models to meaningfully include doc-wide context into chunks - leading to convincing results! 🧑‍🍳 🧵1/N

mlpc123's tweet photo. 🕺Super happy to release our latest work with @ManuelFaysse: in our paper "Context Is Gold to Find the Gold Passage", we share all our findings on how to train embedding models to meaningfully include doc-wide context into chunks - leading to convincing results! 🧑‍🍳 🧵1/N https://t.co/M5warsjGXe

1

26

5

17

2K

0

2

0

96

Max Conti @mlpc123

8 months ago

Looking forward to what @ManuelFaysse @pteiletche and @antonio_loison will be cooking next, and to the next one with @MaceQuent1 😉🤝

0

2

0

53

Max Conti @mlpc123

8 months ago

Super excited to finally release this 🕺 What a fun project that was with these guys, s/o to them for all the great work!! https://t.co/Lnh1d1DJI4

paul @pteiletche

8 months ago

Introducing ModernVBERT: a vision-language encoder that matches the performance of models 10× its size on visual document retrieval tasks! 👁️ Read more in the thread👇 (1/N)

pteiletche's tweet photo. Introducing ModernVBERT: a vision-language encoder that matches the performance of models 10× its size on visual document retrieval tasks! 👁️

Read more in the thread👇 (1/N) https://t.co/qHB35KGMny

7

210

34

141

89K

1

3

0

96

Max Conti @mlpc123

8 months ago

Besides our main results, it was also really interesting to look at some understudied training dynamics in more details Our findings suggest that we've been using visual encoders far below their potential, and I think we can expect a lot of improvements building on top of this!

1

0

47

Max Conti

@mlpc123

Last Seen Users on Sotwe

Trends for you

Most Popular Users