Stefan Bejgu @SBejgu - Twitter Profile

SBejgu retweeted

over 1 year ago

Four of our industrial #PhD students, @SBejgu, @PereLluisHC, @alescire94 and @SimoneTedeschi_, were awarded their #PhD in #AI last Friday with the best grades (and two cum laude)! Congrats all! 👏 🎉 With @RNavigli, their advisor and Babelscape's scientific director, in the photo

babelscape's tweet photo. Four of our industrial #PhD students, @SBejgu, @PereLluisHC, @alescire94 and @SimoneTedeschi_, were awarded their #PhD in #AI last Friday with the best grades (and two cum laude)! Congrats all! 👏 🎉 With @RNavigli, their advisor and Babelscape's scientific director, in the photo https://t.co/jCN2Rmu016

0

12

5

0

535

Stefan Bejgu @SBejgu

over 1 year ago

@rohanpaul_ai Thank you for highlighting our work! 🙌 You can explore the training and evaluation datasets on Hugging Face here: https://t.co/cuitPI7SGl.

0

3

0

56

SBejgu retweeted

Rohan Paul

@rohanpaul_ai

over 1 year ago

Want to know if an AI is lying? LLM-OASIS helps detect factual accuracy in AI outputs with 81k training examples. LLM-OASIS introduces the largest dataset for training factuality evaluators, created by extracting and falsifying information from Wikipedia articles. This enables end-to-end verification of AI-generated text accuracy. ----- 🤔 Original Problem: LLMs still produce hallucinations in their outputs. Existing factuality evaluation resources are limited by being task-specific, small in size, or focused only on simple claim verification. ----- 🔧 Solution in this Paper: → LLM-OASIS extracts claims from Wikipedia passages using an LLM-based pipeline. → The system falsifies selected claims by introducing subtle but critical factual errors. → It generates pairs of factual and unfactual texts based on the original and modified claims. → The dataset covers 81k Wikipedia pages with 681k claims for training factuality evaluators. ----- 💡 Key Insights: → Task-agnostic factuality evaluation is possible with a large-scale synthetic dataset → Wikipedia provides reliable source material for generating factual/unfactual pairs → Human validation confirms high quality of automated data generation (90%+ accuracy) ----- 📊 Results: → GPT-4 achieves 60% accuracy on end-to-end factuality evaluation → 68% accuracy with Retrieval Augmented Generation → Human validation shows 96.78% accuracy for claim extraction → Dataset creation pipeline maintains 89-98% accuracy across all steps

rohanpaul_ai's tweet photo. Want to know if an AI is lying? LLM-OASIS helps detect factual accuracy in AI outputs with 81k training examples.

LLM-OASIS introduces the largest dataset for training factuality evaluators, created by extracting and falsifying information from Wikipedia articles. This enables end-to-end verification of AI-generated text accuracy.

-----

🤔 Original Problem:

LLMs still produce hallucinations in their outputs. Existing factuality evaluation resources are limited by being task-specific, small in size, or focused only on simple claim verification.

-----

🔧 Solution in this Paper:

→ LLM-OASIS extracts claims from Wikipedia passages using an LLM-based pipeline.

→ The system falsifies selected claims by introducing subtle but critical factual errors.

→ It generates pairs of factual and unfactual texts based on the original and modified claims.

→ The dataset covers 81k Wikipedia pages with 681k claims for training factuality evaluators.

-----

💡 Key Insights:

→ Task-agnostic factuality evaluation is possible with a large-scale synthetic dataset

→ Wikipedia provides reliable source material for generating factual/unfactual pairs

→ Human validation confirms high quality of automated data generation (90%+ accuracy)

-----

📊 Results:

→ GPT-4 achieves 60% accuracy on end-to-end factuality evaluation

→ 68% accuracy with Retrieval Augmented Generation

→ Human validation shows 96.78% accuracy for claim extraction

→ Dataset creation pipeline maintains 89-98% accuracy across all steps

3

15

4

7

2K

SBejgu retweeted

Valentino Maiorca @ValeMaiorca

over 1 year ago

✨ Meet #ResiDual, a novel perspective on the alignment of multimodal latent spaces! Think of it as a spectral "panning for gold" along the residual stream. It improves text-image alignment by simply amplifying task-related directions! 🌌🔍 https://t.co/UuXoYBBsT5 [1/6]

ValeMaiorca's tweet photo. ✨ Meet #ResiDual, a novel perspective on the alignment of multimodal latent spaces!

Think of it as a spectral "panning for gold" along the residual stream. It improves text-image alignment by simply amplifying task-related directions! 🌌🔍

https://t.co/UuXoYBBsT5

[1/6] https://t.co/z75Vd7iQYs

2

30

11

5

3K

Who to follow

Pere-Lluís Huguet Cabot

@PereLluisHC

Posdoc at FAIR. Prev: Marie-Curie PhD at @SapienzaNLP. Working on Embeddings, Multilinguality. Projects: Omnilingual Sonar/NLLB, REBEL, Minerva, ...

Agostina Calabrese 🦋

@agostina_cal

Member of Technical Staff at @cohere 👩🏻‍💻 | PhD in Natural Language Processing from the University of Edinburgh 🎓 | (she/her)

Valentino Maiorca

@ValeMaiorca

Postdoc @ISTAustria | @ELLISforEurope Ph.D. | prev @Apple MLR intern (Barcelona & Paris), NLP Engineer @babelscape

SBejgu retweeted

Babelscape @babelscape

over 1 year ago

🚀 Today marks the start of the @MakerFaireRome 🎉 We’re super excited to be part of it and introduce #Vera, our new LLM-powered fact-checking tool! 🤖🧠 Here’s a sneak peek of what you can expect at our booth! 👀✨ #MakerFaireRome #FactChecking #LLM #ArtificialIntelligence

1

7

4

0

617

SBejgu retweeted

Babelscape @babelscape

over 1 year ago

✨Tired of verifying #AI-generated info?😵 🔎Meet Vera, our #LLM-based fact-checker using trusted sources from the Web or your knowledge base. 💥Check out the live demo at Rome #MakerFaire2024 (Oct 25-27)! More info 👉: https://t.co/OxrZ63rIbK #FactChecking #Misinformation

babelscape's tweet photo. ✨Tired of verifying #AI-generated info?😵
🔎Meet Vera, our #LLM-based fact-checker using trusted sources from the Web or your knowledge base.
💥Check out the live demo at Rome #MakerFaire2024 (Oct 25-27)!
More info 👉: https://t.co/OxrZ63rIbK
#FactChecking #Misinformation https://t.co/Ah039LHTqv

0

8

2

1

457

SBejgu retweeted

UniReps @unireps

over 1 year ago

🔵🔴When do distinct learning processes learn similar representations? Detecting patterns and conditions for this to happen is an open direction: a thread🧵 Working on this topic? Submit at: https://t.co/TjAbJcpAFk DEADLINE: 20 Sept See you at @NeurIPSConf! 🔵🔴 [1/N]

unireps's tweet photo. 🔵🔴When do distinct learning processes learn similar representations?

Detecting patterns and conditions for this to happen is an open direction: a thread🧵

Working on this topic? Submit at: https://t.co/TjAbJcpAFk

DEADLINE: 20 Sept

See you at @NeurIPSConf! 🔵🔴

[1/N] https://t.co/9oTc1qdZuh

1

49

14

20

5K

SBejgu retweeted

SapienzaNLP @SapienzaNLP

almost 2 years ago

Post #ACL2024nlp dinner in #Bangkok with most of the presenting/attending band from our group + @Babelscape. Left to right: @SBejgu @LorenzoProiet13 @giumartinelli_ @19Stefano97 @RiccardoRicOrl @FMTucci @RNavigli @KarimAsh14 & Celebrating our outstanding paper award! #NLProc

SapienzaNLP's tweet photo. Post #ACL2024nlp dinner in #Bangkok with most of the presenting/attending band from our group + @Babelscape. Left to right: @SBejgu @LorenzoProiet13 @giumartinelli_ @19Stefano97 @RiccardoRicOrl @FMTucci @RNavigli @KarimAsh14 & Celebrating our outstanding paper award! #NLProc https://t.co/1xLKB2Hwhs

0

12

3

0

835

SBejgu retweeted

Francesco Maria Molfese @framolfese

about 2 years ago

Come to chat with us at the poster session C in #EACL2024, starting now in room Radisson! #NLProc

0

21

2

0

810

SBejgu retweeted

Alessandro Scirè @alescire94

about 2 years ago

Exciting strides in text summarization with LLMs 🚀but verifying their factual accuracy is still an open challenge 🤔 We introduce FENICE, a factuality-oriented metric for summarization with a strong focus on interpretability🔍https://t.co/jjEI6lbxzG #NLProc #LLMs #Factuality

2

20

10

2

1K

SBejgu retweeted

Francesco Maria Molfese @framolfese

over 2 years ago

📢Happy to share that "Neuralign: A Context-Aware, Cross-Lingual and Fully-Neural Sentence Alignment System for Long Texts" has been accepted to #EACL2024 (main) 🫂Huge thanks to my co-authors @SBejgu @SimoneTedeschi_ @ConiaSimone @RNavigli 📃More details coming soon! #NLProc

framolfese's tweet photo. 📢Happy to share that "Neuralign: A Context-Aware, Cross-Lingual and Fully-Neural Sentence Alignment System for Long Texts" has been accepted to #EACL2024 (main)

🫂Huge thanks to my co-authors @SBejgu @SimoneTedeschi_ @ConiaSimone @RNavigli

📃More details coming soon! #NLProc https://t.co/02q2Nkw9CP

0

14

6

1

819

SBejgu retweeted

Simone Tedeschi @SimoneTedeschi_

over 2 years ago

How to Mitigate Hallucinations in Large Language Models (#LLMs)?🤔 In this new @Medium article, I review the most recent research on mitigating hallucinations, and explain the main methods that are used to address this issue. 📑 https://t.co/R5c8JViYbg #AI #NLP #GPT4 #LLM

SimoneTedeschi_'s tweet photo. How to Mitigate Hallucinations in Large Language Models (#LLMs)?🤔

In this new @Medium article, I review the most recent research on mitigating hallucinations, and explain the main methods that are used to address this issue.

📑 https://t.co/R5c8JViYbg

#AI #NLP #GPT4 #LLM https://t.co/JGARPXv6O0

1

15

2

649

SBejgu retweeted

Babelscape @babelscape

over 2 years ago

Tomorow at 5pm @SBejgu will present our research work on word alignment in 14 language pairs! @CLiC_it_conf #CliCit2023, joint with @SapienzaNLP and many other partners! #NLProc #LLMs

babelscape's tweet photo. Tomorow at 5pm @SBejgu will present our research work on word alignment in 14 language pairs! @CLiC_it_conf #CliCit2023, joint with @SapienzaNLP and many other partners! #NLProc #LLMs https://t.co/Cu45Ndfkkf

0

10

6

1

744

SBejgu retweeted

Babelscape @babelscape

over 3 years ago

Excited about #ChatGPT for your business? Check out #Emotionary! The revolutionary #multilingual AI system that understands #emotions: #analyze customer reviews, #track feelings in #news, #socialmedia & #chatbot conversations! https://t.co/HMIF4MRwYv

0

26

12

1

2K

SBejgu retweeted

Valentino Maiorca @ValeMaiorca

over 3 years ago

📢 It looks like relative representations are here to stay! I'm beyond thrilled to announce that our work has been selected as one of the notable top 5% (oral) papers at #iclr23 ! 🥳 https://t.co/nlZBiaIMHZ [1/5]

3

266

37

112

55K

SBejgu retweeted

Roberto Navigli @RNavigli

almost 4 years ago

The Rome Workshop on 10 Years of #BabelNet & Multilingual Neurosymbolic Natural Language Understanding was a great success, with productive in-person discussions, amazing talks & >100 online participants! Thanks! @ERC_Research @Babelscape @SapienzaNLP @SapienzaRoma @WikiResearch

RNavigli's tweet photo. The Rome Workshop on 10 Years of #BabelNet & Multilingual Neurosymbolic Natural Language Understanding was a great success, with productive in-person discussions, amazing talks & >100 online participants! Thanks!
@ERC_Research @Babelscape @SapienzaNLP @SapienzaRoma @WikiResearch https://t.co/4TWMi3C2cC

0

37

15

0

SBejgu retweeted

SapienzaNLP @SapienzaNLP

about 4 years ago

Open & commercial Neural Machine Translation models heavily suffer from disambiguation biases! We present DiBiMT, our novel benchmark for lexical-semantic bias in MT at #ACL2022! By @Valahaar @FedeMartelli25 @FrancescoSaina @RNavigli @ELEXIS_EU #NLProc 📝:https://t.co/8A8TN1DABj

SapienzaNLP's tweet photo. Open & commercial Neural Machine Translation models heavily suffer from disambiguation biases! We present DiBiMT, our novel benchmark for lexical-semantic bias in MT at #ACL2022! By @Valahaar @FedeMartelli25 @FrancescoSaina @RNavigli
@ELEXIS_EU #NLProc

📝:https://t.co/8A8TN1DABj https://t.co/ewzxaQd8BW

0

21

12

1

0

SBejgu retweeted

Babelscape @babelscape

about 4 years ago

Empower your natural language applications with WordAtlas! #WordAtlas is the next-generation multilingual knowledge graph. What makes it special is its linkage between words and concepts in hundreds of languages. https://t.co/hIWZaCB6EP

babelscape's tweet photo. Empower your natural language applications with WordAtlas!
#WordAtlas is the next-generation multilingual knowledge graph. What makes it special is its linkage between words and concepts in hundreds of languages.
https://t.co/hIWZaCB6EP https://t.co/0iNjAvWwTv

0

15

10

0

SBejgu retweeted

Turing Post

@TheTuringPost

about 4 years ago

Classy is a @PyTorch-based library for the fast prototyping and sharing of deep neural network models. It wraps the best libraries like PyTorch Lightning, Transformers, @streamlit and offers them to users with a simple CLI interface. Try it here: https://t.co/BM6nXI1aUc

TheTuringPost's tweet photo. Classy is a @PyTorch-based library for the fast prototyping and sharing of deep neural network models.

It wraps the best libraries like PyTorch Lightning, Transformers, @streamlit and offers them to users with a simple CLI interface.

Try it here: https://t.co/BM6nXI1aUc https://t.co/6TuIeEaFvO

0

27

17

6

0

Stefan Bejgu

@SBejgu

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users