Roy Bar Haim @roybarhaim - Twitter Profile

about 1 month ago

@bezalelsm אף אחד לא הקים ממשלה עם חמאס. רע״מ היא לא חמאס. רע״מ היא לא ארגון טרור. מנסור עבאס היה ראש המפלגה הערבית הראשון שהכיר בישראל כמדינה יהודית. נתניהו (״שקרן בן שקרן״ לדבריך) דווקא מאוד רצה להקים ממשלה עם רע״מ. אתה אמרת שחמאס הוא נכס. בקרוב תעוף הביתה עם כל ממשלת הטבח. גזען שקרן

1

4

0

686

RoyBarHaim retweeted

Asaf Yehudai

@AsafYehudai

11 months ago

🚨 Benchmarks tell us which model is better — but not why it fails. For developers, this means tedious, manual error analysis. We're bridging that gap. Meet CLEAR: an open-source tool for actionable error analysis of LLMs. 🧵👇

AsafYehudai's tweet photo. 🚨 Benchmarks tell us which model is better — but not why it fails.

For developers, this means tedious, manual error analysis. We're bridging that gap.

Meet CLEAR: an open-source tool for actionable error analysis of LLMs.

🧵👇 https://t.co/VnE0crgPCN

1

44

14

10

2K

RoyBarHaim retweeted

elvis

@omarsar0

12 months ago

Evaluating LLM-based Agents This report has a comprehensive list of methods for evaluating AI Agents. Don't ignore evals. If done right, they are a game-changer. Highly recommend it to AI devs. (bookmark it)

omarsar0's tweet photo. Evaluating LLM-based Agents

This report has a comprehensive list of methods for evaluating AI Agents.

Don't ignore evals. If done right, they are a game-changer.

Highly recommend it to AI devs. (bookmark it) https://t.co/YiZatvmbBC

25

877

170

1K

97K

RoyBarHaim retweeted

Noy Sternlicht @NoySternlicht

about 1 year ago

🔔 New Paper! We propose a challenging new benchmark for LLM judges: Evaluating debate speeches. Are they comparable to humans? Well... it’s debatable. 🤔 https://t.co/u0sd8SrGjj 👇 Here are our findings:

3

50

14

6

4K

Who to follow

NLP at Tel-Aviv University and Google

RoyBarHaim retweeted

about 1 year ago

Interested in Agent Evaluation? 🤖 We’re excited to launch our new repo: “Evaluation of LLM-based Agents: A Reading List” 📚 Browse benchmarks, methods, and frameworks from our recent survey. 👉 Explore & Contribute: https://t.co/nGnL03xCXm #LLMAgents #AgentEvaluation

5

86

24

30

3K

RoyBarHaim retweeted

Asaf Yehudai

@AsafYehudai

about 1 year ago

Survey on Evaluation of LLM-based Agents 🤖 Our paper is the first to provide a comprehensive overview of LLM-based agent evaluation 📜 Paper: https://t.co/43ByXGXLkQ

AsafYehudai's tweet photo. Survey on Evaluation of LLM-based Agents 🤖

Our paper is the first to provide a comprehensive overview of LLM-based agent evaluation 📜

Paper: https://t.co/43ByXGXLkQ https://t.co/KiWfc8377J

3

330

82

264

35K

RoyBarHaim retweeted

Asaf Yehudai

@AsafYehudai

over 1 year ago

New preprint! ✨ Interested in LLM-as-a-Judge? Want to get the best judge for ranking your system? our new work is just for you: "JuStRank: Benchmarking LLM Judges for System Ranking" 🕺💃 https://t.co/7FPgj8FWKh

2

31

9

2

896

RoyBarHaim retweeted

Ariel Gera @ArielGera2

over 1 year ago

Say I want to compare system qualities - pick between 2 configurations, or rank a whole bunch of models. I'll use LLM-as-a-judge, right? 🧑🏻‍⚖️ But how do I know the LLM judge is up to the task? Who is a good judge for ranking systems? Enter our new paper!✨🧵 https://t.co/RJajBQtdUn

1

25

8

4

3K

RoyBarHaim retweeted

Argument Mining @ArgminingOrg

almost 2 years ago

ArgMining 2024 ended with a great photo of its wonderful community. Kudos to all of your great ideas, contributions, and help in organizing.

ArgminingOrg's tweet photo. ArgMining 2024 ended with a great photo of its wonderful community. Kudos to all of your great ideas, contributions, and help in organizing. https://t.co/5nO57ZLBMN

0

16

4

0

2K

RoyBarHaim retweeted

Argument Mining @ArgminingOrg

over 2 years ago

The call for papers for the 11th Workshop on Argument Mining #argminig_2024 is now out: https://t.co/DwjsNT5syK

0

5

2

0

463

Roy Bar Haim @RoyBarHaim

over 2 years ago

@AmitMandelAI https://t.co/nUBFIqHmQy

0

3

RoyBarHaim retweeted

Argument Mining @ArgminingOrg

over 2 years ago

We are excited to announce that the Argument Mining workshop will take place at #ACL2024 in Bangkok, Thailand. For more info see our website at https://t.co/etIRMVxu9h

0

4

2

0

307

RoyBarHaim retweeted

Argument Mining @ArgminingOrg

over 2 years ago

We are happy to announce two shared tasks for ArgMining 2024: 1) Perspective Argument Retrieval organized by Neele Falk and Andreas Waldis. 2) DialAM-2024 organized by Ramon Ruiz-Dolz, John Lawrence, Ella Schad, and Chris Reed.

0

10

3

1

1K

Roy Bar Haim @RoyBarHaim

almost 3 years ago

@GalitDistel אבי האומה // יורשע רק במרמה // והפרת אמונים // גאווה לנתינים

0

10

RoyBarHaim retweeted

Arie Cattan @ArieCattan

about 3 years ago

Curious to see how can we summarize opinions beyond plain text summaries? Check out our #ACL2023 paper: From Key Points to Key Point Hierarchy: Structured and Expressive Opinion Summarization with Lilach Eden, @yoavkantor @RoyBarHaim from @IBMResearch @IBM @biunlp >>

1

18

6

4

2K

RoyBarHaim retweeted

Eyal Shnarch @EyalShnarch

almost 4 years ago

Want to build a text classifier in a few hours? Even if you don’t have any: labeled data #machineLearning knowledge programing skills Label Sleuth https://t.co/ViNd3sQNkT a new open-source no-code system for annotations 🧵 @IBMResearch @NotreDame @StanfordHCI UT Dallas #NLProc

EyalShnarch's tweet photo. Want to build a text classifier in a few hours?

Even if you don’t have any:
labeled data
#machineLearning knowledge
programing skills

Label Sleuth https://t.co/ViNd3sQNkT a new open-source no-code system for annotations 🧵 @IBMResearch @NotreDame @StanfordHCI UT Dallas #NLProc https://t.co/nQ1tp8m7lZ

10

41

21

6

0

RoyBarHaim retweeted

Orith Toledo-Ronen @OrithToledo

almost 4 years ago

Interested in TARGETED #SentimentAnalysis beyond restaurant reviews? In #NAACL2022 we suggest a robust multi-domain model relying on self-training, with no extra annotation -- https://t.co/tRa8oab7pt @OrithToledo @MatanOrbach @YoavKatz73 @noamslonim #NLProc #IBMResearch (1/5)

OrithToledo's tweet photo. Interested in TARGETED #SentimentAnalysis beyond restaurant reviews? In #NAACL2022 we suggest a robust multi-domain model relying on self-training, with no extra annotation -- https://t.co/tRa8oab7pt
@OrithToledo @MatanOrbach @YoavKatz73 @noamslonim
#NLProc #IBMResearch
(1/5) https://t.co/fpzGMDQGWn

1

14

3

0

RoyBarHaim retweeted

Avi Sil @aviaviavi__

almost 4 years ago

Welcome PrimeQA at #NAACL2022! Replicate the state-of-the-art on multilingual open QA quickly! Here’s a new open-source repo in collab with with @stanfordnlp, @huggingface, @Uni_Stuttgart @NLPIllinois1. Link: https://t.co/R6GbTT3mKq Talk to me or read: https://t.co/GblOy35JGK 🧵

aviaviavi__'s tweet photo. Welcome PrimeQA at #NAACL2022! Replicate the state-of-the-art on multilingual open QA quickly! Here’s a new open-source repo in collab with with @stanfordnlp, @huggingface, @Uni_Stuttgart @NLPIllinois1. Link: https://t.co/R6GbTT3mKq Talk to me or read:
https://t.co/GblOy35JGK 🧵 https://t.co/rAjqcWzR8p

1

58

39

8

0

Roy Bar Haim @RoyBarHaim

almost 4 years ago

NAACL 2022 is starting on Sunday! Visit our website https://t.co/7pAzKxHmmZ to learn about the exciting NLP work from IBM Research that will be presented at this conference. @IBMResearch @naaclmeeting #NAACL2022

0

31

11

1

0

Roy Bar Haim @RoyBarHaim

about 5 years ago

NAACL'21 main conference is starting today! Meet our researchers and recruiting team at the @IBMResearch virtual booth: https://t.co/WL1P01vn4v, and learn more about IBM Research's presence at @NAACLHLT, careers and booth schedule at https://t.co/ZR8HW1yaZJ

0

2

0

Roy Bar Haim

@RoyBarHaim

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users