Stéphane Clinchant @sclincha - Twitter Profile

sclincha retweeted

Eiso Kant

@eisokant

12 days ago

https://t.co/13VRNrqv45

8

132

33

59

42K

sclincha retweeted

Sumit @_reachsumit

24 days ago

Inference-Free Multimodal Learned Sparse Retrieval for Production-Scale Visual Document Search Naver introduces a sparse retriever that directly indexes visual documents and serves text queries without neural query encoding. 📝 https://t.co/BNXWR5GH7F 👨🏽‍💻 https://t.co/DBC82Fwv2Q

0

26

4

10

1K

sclincha retweeted

Amélie Chatelain

@AmelieTabatta

about 1 month ago

I told you it would be a stacked line-up! @N1colAIs @thibault_formal and @antoine_chaffin talking tomorrow at the Search Meetup™️ (along with others whose twitter @ I don't have) Join us for nice presentations and chats ❤️

AmelieTabatta's tweet photo. I told you it would be a stacked line-up!
@N1colAIs @thibault_formal and @antoine_chaffin talking tomorrow at the Search Meetup™️ (along with others whose twitter @ I don't have)
Join us for nice presentations and chats ❤️ https://t.co/uHk8nRnfdc

1

18

5

1

641

Stéphane Clinchant @sclincha

about 1 month ago

@_reachsumit @santoshradha FYI, I use the q-log on the term frequencies in 2012 : https://t.co/kmAmjsuZSz

0

1

0

1

50

Who to follow

ReNeuIR Workshop @ ACM SIGIR 2026

@ReNeuIRWorkshop

The 5th Workshop on Reaching Efficiency in Neural Information Retrieval (ReNeuIR) to be held jointly with ACM SIGIR 2026

Carlos Lassance

@cadurosar

MTS @ Cohere, constantly trying to make Information Retrieval work better, while making mistakes on the process.

Matthias Gallé

@mgalle

Post-training lead @poolsideai

sclincha retweeted

Databricks @databricks

about 1 month ago

Databricks is proud to be a Founding Gold Sponsor of @TheOfficialACM Conference on AI and Agentic Systems—the first ACM conference dedicated to compound AI and agentic systems, with our co-founder @matei_zaharia on the organizing committee. Join us May 26–29 in San Jose for the premier event for rigorous, reproducible research in compound AI architectures, optimization, and deployment. Register today: https://t.co/Y0b2NhQjWv

databricks's tweet photo. Databricks is proud to be a Founding Gold Sponsor of @TheOfficialACM Conference on AI and Agentic Systems—the first ACM conference dedicated to compound AI and agentic systems, with our co-founder @matei_zaharia on the organizing committee.

Join us May 26–29 in San Jose for the premier event for rigorous, reproducible research in compound AI architectures, optimization, and deployment.

Register today: https://t.co/Y0b2NhQjWv

2

57

14

8

6K

sclincha retweeted

Simon Lupart @simon_lupart

3 months ago

Most code search systems rely on dense embeddings. In this work, we release SPLADE-Code, learned sparse retrieval models for code retrieval, with strong generalization, high interpretability, compatibility with inverted indexes, and working across 20+ programming languages.

3

47

7

23

7K

sclincha retweeted

Thibault Formal

@thibault_formal

3 months ago

New sparse retrieval model: introducing SPLARE, which extends SPLADE by replacing the vocabulary head with pretrained SAEs! paper: https://t.co/Un2zhX14KR (ICLR'26) also how we won the WSDM'26 Cup on multilingual retrieval: https://t.co/77QlgZsnls (model weights coming soon!)

1

46

8

26

4K

sclincha retweeted

Xin Eric Wang

@xwang_lk

about 1 year ago

𝘏𝘶𝘮𝘢𝘯𝘴 𝘵𝘩𝘪𝘯𝘬 𝘧𝘭𝘶𝘪𝘥𝘭𝘺—𝘯𝘢𝘷𝘪𝘨𝘢𝘵𝘪𝘯𝘨 𝘢𝘣𝘴𝘵𝘳𝘢𝘤𝘵 𝘤𝘰𝘯𝘤𝘦𝘱𝘵𝘴 𝘦𝘧𝘧𝘰𝘳𝘵𝘭𝘦𝘴𝘴𝘭𝘺, 𝘧𝘳𝘦𝘦 𝘧𝘳𝘰𝘮 𝘳𝘪𝘨𝘪𝘥 𝘭𝘪𝘯𝘨𝘶𝘪𝘴𝘵𝘪𝘤 𝘣𝘰𝘶𝘯𝘥𝘢𝘳𝘪𝘦𝘴. But current reasoning models remain constrained by discrete tokens, limiting their full potential. Introducing 𝐒𝐨𝐟𝐭 𝐓𝐡𝐢𝐧𝐤𝐢𝐧𝐠: a training-free method that mimics human-like “soft” reasoning by generating continuous, abstract concept tokens. These tokens smoothly blend multiple meanings through probability-weighted mixtures of embeddings, enabling richer representations and seamless exploration of diverse reasoning paths. 𝐓𝐡𝐞 𝐢𝐦𝐩𝐚𝐜𝐭? ✅ Improved accuracy on math & code benchmarks by up to 2.48% (pass@1). ✅ Reduced token usage by up to 22.4%, making reasoning models both smarter and more efficient.

xwang_lk's tweet photo. 𝘏𝘶𝘮𝘢𝘯𝘴 𝘵𝘩𝘪𝘯𝘬 𝘧𝘭𝘶𝘪𝘥𝘭𝘺—𝘯𝘢𝘷𝘪𝘨𝘢𝘵𝘪𝘯𝘨 𝘢𝘣𝘴𝘵𝘳𝘢𝘤𝘵 𝘤𝘰𝘯𝘤𝘦𝘱𝘵𝘴 𝘦𝘧𝘧𝘰𝘳𝘵𝘭𝘦𝘴𝘴𝘭𝘺, 𝘧𝘳𝘦𝘦 𝘧𝘳𝘰𝘮 𝘳𝘪𝘨𝘪𝘥 𝘭𝘪𝘯𝘨𝘶𝘪𝘴𝘵𝘪𝘤 𝘣𝘰𝘶𝘯𝘥𝘢𝘳𝘪𝘦𝘴. But current reasoning models remain constrained by discrete tokens, limiting their full potential.

Introducing 𝐒𝐨𝐟𝐭 𝐓𝐡𝐢𝐧𝐤𝐢𝐧𝐠: a training-free method that mimics human-like “soft” reasoning by generating continuous, abstract concept tokens. These tokens smoothly blend multiple meanings through probability-weighted mixtures of embeddings, enabling richer representations and seamless exploration of diverse reasoning paths.

𝐓𝐡𝐞 𝐢𝐦𝐩𝐚𝐜𝐭?
✅ Improved accuracy on math & code benchmarks by up to 2.48% (pass@1).
✅ Reduced token usage by up to 22.4%, making reasoning models both smarter and more efficient.

26

911

137

1K

120K

sclincha retweeted

Nadia Chirkova @nadiinchi

about 1 year ago

Arrived in Singapore for #ICLR2025 and will be presenting PROVENCE on Friday, Poster session 3 at 10am, poster #255! Blogpost: https://t.co/Q7TRYP04ZV Will be happy to meet & chat about #LLMs, #RAG, #InformationRetrieval and #MultilingualNLP :) #NLProc @naverlabseurope

nadiinchi's tweet photo. Arrived in Singapore for #ICLR2025 and will be presenting PROVENCE on Friday, Poster session 3 at 10am, poster #255!

Blogpost: https://t.co/Q7TRYP04ZV

Will be happy to meet & chat about #LLMs, #RAG, #InformationRetrieval and #MultilingualNLP :)

#NLProc @naverlabseurope https://t.co/gRZgznKJpY

2

16

4

2

769

sclincha retweeted

Vaibhav (VB) Srivastav

@reach_vb

over 1 year ago

AllenAI COOKED, Llama 3.1 Tulu 405B beats DeepSeek V3 - all whilst being 40% SMALLER! 🔥 Fully open model weights, data and training pipeline 🤗

reach_vb's tweet photo. AllenAI COOKED, Llama 3.1 Tulu 405B beats DeepSeek V3 - all whilst being 40% SMALLER! 🔥

Fully open model weights, data and training pipeline 🤗 https://t.co/V9nqnjxAIm

17

419

55

153

46K

Stéphane Clinchant @sclincha

almost 2 years ago

We welcome contributors to add datasets, metrics, and other tasks to BERGEN. Join us! 🤝 #RAG #LLMs

0

1

0

102

Stéphane Clinchant @sclincha

almost 2 years ago

What’s a good baseline for RAG? 🤔 The literature shows consistent differences in experimental setups, retrievers, datasets, and metrics. So, we built the BERGEN library https://t.co/9srOoFQNQ5 to enhance reproducibility and identify strong baselines : 🧵 @naverlabseurope

1

21

7

5

1K

Stéphane Clinchant @sclincha

almost 2 years ago

Our recommendations are detailed in our first Arxiv paper, with additional findings on multilingual RAG in our second paper. https://t.co/3MkkSraIcd https://t.co/qk6jp9GQvF

1

0

103

Stéphane Clinchant @sclincha

about 2 years ago

😀We're looking for a talented researcher to join our team at Naver Labs Europe (@naverlabseurope) , working on LLMs and Retrieval!😃 Please apply here: https://t.co/0lea7ABHld !

0

42

19

18

9K

Stéphane Clinchant @sclincha

about 2 years ago

@jerryjliu0 @rpradeep42 If efficiency matters, a simpler solution is to actually use a state of the art reranker (cf our study comparing LLMs and cross-encoders) https://t.co/SOpZOgQIFy

1

0

128

Stéphane Clinchant @sclincha

over 2 years ago

... especially when reviewers said ‘dense retrieval on its own has shown to surpass sparse retrieval considerably ‘ and that our ‘approach is quite incremental’ 2/2

0

180

Stéphane Clinchant @sclincha

over 2 years ago

It feels good when someone from a big company shares that they saw ‘ pretty promising results in terms of quality and space savings [for SPLADE] compared to dense embedding models’ ... 1/2

1

7

0

394

Stéphane Clinchant @sclincha

over 2 years ago

@andysingal @thibault_formal @naverlabseurope https://t.co/woBcjVkDQn

0

1

0

32

sclincha retweeted

Laure Soulier @LaureSoulier

almost 3 years ago

What a great pleasure and honor to share this session about generative AI, ethics, bias, and politics with 3 passionate speakers @plimantour, Andrew Wyckoff, and Juha Heikkilä. Thanks @AI2S2Symposium for the invitation. See you in Geneva on Monday!

0

14

2

0

928

Stéphane Clinchant

@sclincha

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users