Cesare Campagnano @caesar_one_ - Twitter Profile

Pinned Tweet

about 4 years ago

I’m thrilled to participate in such a prestigious conference with my first paper! See you in Dublin at #ACL2022 😎 #NLProc

SapienzaNLP @SapienzaNLP

about 4 years ago

#NLPaperAlert 📢 We bring together existing resources, revise them, and propose SRL4E, a unified evaluation on Semantic Role Labeling 4 Emotions! Read our #ACL2022 preprint: https://t.co/Wnqe4waMUD By @caesar_one_ @ConiaSimone @RNavigli + @ERC_Research @EuroLangTech #NLProc

SapienzaNLP's tweet photo. #NLPaperAlert 📢 We bring together existing resources, revise them, and propose SRL4E, a unified evaluation on Semantic Role Labeling 4 Emotions!

Read our #ACL2022 preprint: https://t.co/Wnqe4waMUD

By @caesar_one_ @ConiaSimone @RNavigli
+ @ERC_Research @EuroLangTech #NLProc https://t.co/WXzAzFJZa3

1

30

13

3

0

6

0

caesar_one_ retweeted

RSTLess group @RSTLessGroup

over 1 year ago

We are very excited to share that the work of @caesar_one_ , @antonio_mallia , @JackPertschuk and @fabreetseo has been accepted to #ECIR2025 as a #shortpaper. See you in #Lucca. @ecir2025 @pinecone #AI #Research #IR #industry

0

10

4

0

602

caesar_one_ retweeted

Pinecone

@pinecone

over 1 year ago

Congratulations to our very own @antonio_mallia, @caesar_one_, and @JackPertschuk – as well as their co-authors – on their accepted #ECIR2025 research papers! 🎉 They continue to push the state-of-the-art forward on information retrieval, and we as an industry are better for it! 📚 📜 Sean MacAvaney, Antonio Mallia and Nicola Tonellotto: “Efficient Constant-Space Multi-Vector Retrieval", 2025 📜 Kaili Huang, Thejas Venkatesh, Uma Dingankar, Antonio Mallia, Daniel Campos, Jian Jiao, Christopher Potts, Matei Zaharia, Kwabena Boahen, Omar Khattab, Saarthak Sarup and Keshav Santhanam: “ColBERT-serve: Efficient Multi-Stage Memory-Mapped Scoring”, 2025 📜 Cesare Campagnano, Antonio Mallia, Jack Pertschuk and Fabrizio Silvestri: “E2Rank: Efficient and Effective Layer-wise Reranking”, 2025

pinecone's tweet photo. Congratulations to our very own @antonio_mallia, @caesar_one_, and @JackPertschuk – as well as their co-authors – on their accepted #ECIR2025 research papers! 🎉 They continue to push the state-of-the-art forward on information retrieval, and we as an industry are better for it! 📚

📜 Sean MacAvaney, Antonio Mallia and Nicola Tonellotto: “Efficient Constant-Space Multi-Vector Retrieval", 2025

📜 Kaili Huang, Thejas Venkatesh, Uma Dingankar, Antonio Mallia, Daniel Campos, Jian Jiao, Christopher Potts, Matei Zaharia, Kwabena Boahen, Omar Khattab, Saarthak Sarup and Keshav Santhanam: “ColBERT-serve: Efficient Multi-Stage Memory-Mapped Scoring”, 2025

📜 Cesare Campagnano, Antonio Mallia, Jack Pertschuk and Fabrizio Silvestri: “E2Rank: Efficient and Effective Layer-wise Reranking”, 2025

0

11

2

2K

caesar_one_ retweeted

RSTLess group @RSTLessGroup

about 2 years ago

Congratulations to @caesar_one_ who defended his #PhD #thesis entitled "Foundational Advancements of Large Language Models: Current and Future Implications", advised by @gtolomei , co-advised by @fabreetseo . He will now start his #postdoc in our group. #LLM #NLP #Research

RSTLessGroup's tweet photo. Congratulations to @caesar_one_ who defended his #PhD #thesis entitled "Foundational Advancements of Large Language Models: Current and Future Implications", advised by @gtolomei , co-advised by @fabreetseo . He will now start his #postdoc in our group.
#LLM #NLP #Research https://t.co/3e4WGtG3u9

0

21

4

0

1K

Who to follow

Andrea Santilli

@teelinsan

Senior Research Engineer @NVIDIA | Prev: @Apple, @NousResearch, @GladiaLab, @BigscienceW, @picampusschool #NLProc

Agostina Calabrese 🦋

@agostina_cal

Member of Technical Staff at @cohere 👩🏻‍💻 | PhD in Natural Language Processing from the University of Edinburgh 🎓 | (she/her)

Pere-Lluís Huguet Cabot

@PereLluisHC

Posdoc at FAIR. Prev: Marie-Curie PhD at @SapienzaNLP. Working on Embeddings, Multilinguality. Projects: Omnilingual Sonar/NLLB, REBEL, Minerva, ...

caesar_one_ retweeted

RSTLess group @RSTLessGroup

about 2 years ago

Today @Andrea_Bacciu has presented the paper "DanteLLM: Let’s Push Italian LLM Research Forward!” @LrecColing , coauthored by @caesar_one_ @GioTrappolini and @fabreetseo . Here is the preprint https://t.co/7bppzufuqf #LLM #Research #LREC #PhD

RSTLessGroup's tweet photo. Today @Andrea_Bacciu has presented the paper "DanteLLM: Let’s Push Italian LLM Research Forward!” @LrecColing , coauthored by @caesar_one_ @GioTrappolini and @fabreetseo . Here is the preprint https://t.co/7bppzufuqf

#LLM #Research #LREC #PhD https://t.co/hiVAjjKKIB

0

18

6

0

608

caesar_one_ retweeted

RSTLess group @RSTLessGroup

about 2 years ago

Don't miss @Andrea_Bacciu and @caesar_one_ ’s presentation on Thursday at @LrecColing .They’ll be sharing their paper "DanteLLM: Let’s Push Italian LLM Research Forward!”, coauthored with @fabreetseo and @GioTrappolini . Room London @ Lingotto Conference Centre, 3.30 pm CET.

RSTLessGroup's tweet photo. Don't miss @Andrea_Bacciu and @caesar_one_ ’s presentation on Thursday at @LrecColing .They’ll be sharing their paper "DanteLLM: Let’s Push Italian LLM Research Forward!”, coauthored with @fabreetseo and @GioTrappolini . Room London @ Lingotto Conference Centre, 3.30 pm CET. https://t.co/7TYtphP5XP

1

16

5

0

599

caesar_one_ retweeted

Min Choi

@minchoi

about 2 years ago

Llama 3 is insanely moving fast. People are really pushing Llama 3 to its limits in incredible ways. 10 wild examples (and use cases)

minchoi's tweet photo. Llama 3 is insanely moving fast.

People are really pushing Llama 3 to its limits in incredible ways.

10 wild examples (and use cases) https://t.co/R2YejFZvsv

72

3K

442

4K

1M

caesar_one_ retweeted

Daniel Vila Suero @dvilasuero

about 2 years ago

This is actually huge: - No SFT stage (e.g., Zephyr used 200k examples) - Preference tuning with 7K examples only (other models trained with at least 60k samples) I've put a lot of care & love building the DPO version of the amazing Capybara dataset from @ldjconfirmed so I'm really pleased to see these results. Let's double down on useful open data for OSS AI developers and researchers

dvilasuero's tweet photo. This is actually huge:

- No SFT stage (e.g., Zephyr used 200k examples)
- Preference tuning with 7K examples only (other models trained with at least 60k samples)

I've put a lot of care & love building the DPO version of the amazing Capybara dataset from @ldjconfirmed so I'm really pleased to see these results.

Let's double down on useful open data for OSS AI developers and researchers

2

72

17

48

7K

caesar_one_ retweeted

elvis

@omarsar0

over 2 years ago

Redefining Retrieval in RAG A nice comprehensive study that focuses on the components needed to improve the retrieval component of a RAG system. Confirms that the position of relevant information should be placed near the query. The model will struggle to attend to the information if this is not the case. Surprisingly, it finds that related documents don't necessarily lead to improved performance for the RAG system. Even more unexpectedly, irrelevant and noisy documents can actually help drive up accuracy if placed correctly. We need more systematic studies around RAG. The hard part of a RAG system is typically the retriever component. Just dumping relevant docs into the context is not an effective approach but it's what a lot of LLM devs do. I like that the Ragas library proposes the use of several metrics for assessing a RAG system at both the generation and retrieval stages, including an end-to-end evaluation. It's a good first step but we still need better ways to integrate external information that can be effectively leveraged by the generative component.

omarsar0's tweet photo. Redefining Retrieval in RAG

A nice comprehensive study that focuses on the components needed to improve the retrieval component of a RAG system.

Confirms that the position of relevant information should be placed near the query. The model will struggle to attend to the information if this is not the case.

Surprisingly, it finds that related documents don't necessarily lead to improved performance for the RAG system. Even more unexpectedly, irrelevant and noisy documents can actually help drive up accuracy if placed correctly.

We need more systematic studies around RAG. The hard part of a RAG system is typically the retriever component. Just dumping relevant docs into the context is not an effective approach but it's what a lot of LLM devs do.

I like that the Ragas library proposes the use of several metrics for assessing a RAG system at both the generation and retrieval stages, including an end-to-end evaluation. It's a good first step but we still need better ways to integrate external information that can be effectively leveraged by the generative component.

10

884

180

903

89K

Cesare Campagnano @caesar_one_

over 2 years ago

@karpathy Cool! 💪 Looks like we share the same vision. Feel free to have a look at the preprint of our paper "Prompt-to-OS" which has been accepted at the Vision track of the next IEEE CogMI conf. A joint work with @gtolomei, @fabreetseo and @GioTrappolini. Link: https://t.co/DtBlRQP2VE

0

6

1

0

206

caesar_one_ retweeted

Giovanni Trappolini @GioTrappolini

almost 3 years ago

Still can't handle the indecisiveness between Barbie and Oppenheimer? 😫💥 Don't fret! Come to the presentation of our new perspective paper, "Multimodal Neural Databases", where we lay out the vision for database-like queries on multimodal data. Tomorrow @SIGIR2023, 1.30pm GMT+8

GioTrappolini's tweet photo. Still can't handle the indecisiveness between Barbie and Oppenheimer? 😫💥 Don't fret!
Come to the presentation of our new perspective paper, "Multimodal Neural Databases", where we lay out the vision for database-like queries on multimodal data.
Tomorrow @SIGIR2023, 1.30pm GMT+8 https://t.co/Gb8EY4uYZ0

1

21

5

0

3K

caesar_one_ retweeted

Tim Dettmers

@Tim_Dettmers

about 3 years ago

QLoRA: 4-bit finetuning of LLMs is here! With it comes Guanaco, a chatbot on a single GPU, achieving 99% ChatGPT performance on the Vicuna benchmark: Paper: https://t.co/J3Xy195kDD Code+Demo: https://t.co/SP2FsdXAn5 Samples: https://t.co/q2Nd9cxSrt Colab: https://t.co/Q49m0IlJHD

Tim_Dettmers's tweet photo. QLoRA: 4-bit finetuning of LLMs is here! With it comes Guanaco, a chatbot on a single GPU, achieving 99% ChatGPT performance on the Vicuna benchmark:

Paper: https://t.co/J3Xy195kDD
Code+Demo: https://t.co/SP2FsdXAn5
Samples: https://t.co/q2Nd9cxSrt
Colab: https://t.co/Q49m0IlJHD https://t.co/UJcowpfhpH

81

4K

903

2K

2M

caesar_one_ retweeted

Yann LeCun

@ylecun

about 3 years ago

LIMA : LLaMA 65B + 1000 supervised samples = {GPT4, Bard} level performance. From @MetaAI https://t.co/FIuIo6agXa

75

3K

430

921

629K

caesar_one_ retweeted

Yann LeCun

@ylecun

about 3 years ago

MMS: Massively Multilingual Speech. - Can do speech2text and text speech in 1100 languages. - Can recognize 4000 spoken languages. - Code and models available under the CC-BY-NC 4.0 license. - half the word error rate of Whisper. Code+Models: https://t.co/NIGfUZ8KZg Paper: https://t.co/W15aEWHGIR Blog: https://t.co/TFKXFtlPwc

160

5K

1K

3K

2M

caesar_one_ retweeted

Andrea Bacciu @Andrea_Bacciu

about 3 years ago

Presentiamo il più grande LLM italiano realizzato dal gruppo di ricerca RSTLess della Sapienza Università di Roma. Il team di ricerca dietro Fauno comprende @Andrea_Bacciu, @GioTrappolini, Prof @EmanueleRodola , @teelinsan e il Prof @fabreetseo . https://t.co/hnX4VC7w6x

1

43

15

12

9K

caesar_one_ retweeted

Suraj Srinivas

@Suuraj

over 3 years ago

Three papers accepted at NeurIPS'22 (!!) 1) Efficiently training low-curvature neural networks (https://t.co/nL2FpxuNKh), w/ Kyle Matoba, @hima_lakkaraju, @francoisfleuret We propose to build NNs that are "as linear as possible", and thus eliminate excess model curvature.

Suuraj's tweet photo. Three papers accepted at NeurIPS'22 (!!)

1) Efficiently training low-curvature neural networks (https://t.co/nL2FpxuNKh), w/ Kyle Matoba, @hima_lakkaraju, @francoisfleuret

We propose to build NNs that are "as linear as possible", and thus eliminate excess model curvature. https://t.co/mIJZKXn4Bd

5

218

34

71

0

caesar_one_ retweeted

Ben Meer

@SystemSunday

over 3 years ago

YouTube is free education. But 99% don’t know the best spots on its virtual campus. Here are the top channels to accelerate your learning:

2K

233K

48K

131K

0

caesar_one_ retweeted

Riccardo Orlando @RiccardoRicOrl

almost 4 years ago

Hey #NLProc, I built this little tool to make working with @huggingface 🤗Transformers a bit easier. If you want to directly access whole-word embeddings hassle-free, give it a try! 👉GitHub: https://t.co/HHuR0KKsY9

RiccardoRicOrl's tweet photo. Hey #NLProc, I built this little tool to make working with @huggingface 🤗Transformers a bit easier. If you want to directly access whole-word embeddings hassle-free, give it a try!

👉GitHub: https://t.co/HHuR0KKsY9 https://t.co/RxZt0Triuj

0

22

9

2

0

caesar_one_ retweeted

Bojan Tunguz

@tunguz

almost 4 years ago

This week @Google researchers announced Minerva, an internally developed project that can answer mathematical questions and tackle other complex topics such as physics. 1/5