Congratulations to our very own @antonio_mallia, @caesar_one_, and @JackPertschuk – as well as their co-authors – on their accepted #ECIR2025 research papers! 🎉 They continue to push the state-of-the-art forward on information retrieval, and we as an industry are better for it! 📚
📜 Sean MacAvaney, Antonio Mallia and Nicola Tonellotto: “Efficient Constant-Space Multi-Vector Retrieval", 2025
📜 Kaili Huang, Thejas Venkatesh, Uma Dingankar, Antonio Mallia, Daniel Campos, Jian Jiao, Christopher Potts, Matei Zaharia, Kwabena Boahen, Omar Khattab, Saarthak Sarup and Keshav Santhanam: “ColBERT-serve: Efficient Multi-Stage Memory-Mapped Scoring”, 2025
📜 Cesare Campagnano, Antonio Mallia, Jack Pertschuk and Fabrizio Silvestri: “E2Rank: Efficient and Effective Layer-wise Reranking”, 2025
Don't miss @Andrea_Bacciu and @caesar_one_ ’s presentation on Thursday at @LrecColing .They’ll be sharing their paper "DanteLLM: Let’s Push Italian LLM Research Forward!”, coauthored with @fabreetseo and @GioTrappolini . Room London @ Lingotto Conference Centre, 3.30 pm CET.
This is actually huge:
- No SFT stage (e.g., Zephyr used 200k examples)
- Preference tuning with 7K examples only (other models trained with at least 60k samples)
I've put a lot of care & love building the DPO version of the amazing Capybara dataset from @ldjconfirmed so I'm really pleased to see these results.
Let's double down on useful open data for OSS AI developers and researchers
Redefining Retrieval in RAG
A nice comprehensive study that focuses on the components needed to improve the retrieval component of a RAG system.
Confirms that the position of relevant information should be placed near the query. The model will struggle to attend to the information if this is not the case.
Surprisingly, it finds that related documents don't necessarily lead to improved performance for the RAG system. Even more unexpectedly, irrelevant and noisy documents can actually help drive up accuracy if placed correctly.
We need more systematic studies around RAG. The hard part of a RAG system is typically the retriever component. Just dumping relevant docs into the context is not an effective approach but it's what a lot of LLM devs do.
I like that the Ragas library proposes the use of several metrics for assessing a RAG system at both the generation and retrieval stages, including an end-to-end evaluation. It's a good first step but we still need better ways to integrate external information that can be effectively leveraged by the generative component.
@karpathy Cool! 💪 Looks like we share the same vision. Feel free to have a look at the preprint of our paper "Prompt-to-OS" which has been accepted at the Vision track of the next IEEE CogMI conf. A joint work with @gtolomei, @fabreetseo and @GioTrappolini.
Link: https://t.co/DtBlRQP2VE
Still can't handle the indecisiveness between Barbie and Oppenheimer? 😫💥 Don't fret!
Come to the presentation of our new perspective paper, "Multimodal Neural Databases", where we lay out the vision for database-like queries on multimodal data.
Tomorrow @SIGIR2023, 1.30pm GMT+8
QLoRA: 4-bit finetuning of LLMs is here! With it comes Guanaco, a chatbot on a single GPU, achieving 99% ChatGPT performance on the Vicuna benchmark:
Paper: https://t.co/J3Xy195kDD
Code+Demo: https://t.co/SP2FsdXAn5
Samples: https://t.co/q2Nd9cxSrt
Colab: https://t.co/Q49m0IlJHD
MMS: Massively Multilingual Speech.
- Can do speech2text and text speech in 1100 languages.
- Can recognize 4000 spoken languages.
- Code and models available under the CC-BY-NC 4.0 license.
- half the word error rate of Whisper.
Code+Models: https://t.co/NIGfUZ8KZg
Paper: https://t.co/W15aEWHGIR
Blog: https://t.co/TFKXFtlPwc
Presentiamo il più grande LLM italiano realizzato dal gruppo di ricerca RSTLess della Sapienza Università di Roma.
Il team di ricerca dietro Fauno comprende @Andrea_Bacciu, @GioTrappolini, Prof @EmanueleRodola , @teelinsan e il Prof @fabreetseo .
https://t.co/hnX4VC7w6x
Three papers accepted at NeurIPS'22 (!!)
1) Efficiently training low-curvature neural networks (https://t.co/nL2FpxuNKh), w/ Kyle Matoba, @hima_lakkaraju, @francoisfleuret
We propose to build NNs that are "as linear as possible", and thus eliminate excess model curvature.
Hey #NLProc, I built this little tool to make working with @huggingface 🤗Transformers a bit easier. If you want to directly access whole-word embeddings hassle-free, give it a try!
👉GitHub: https://t.co/HHuR0KKsY9
This week @Google researchers announced Minerva, an internally developed project that can answer mathematical questions and tackle other complex topics such as physics.
1/5