matan orbach @MatanOrbach - Twitter Profile

30 days ago

Need a fast AND accurate retrieval system? checkout this paper!

30 days ago

Zero-shot instruction-following, nuanced classification and reasoning? ➡️ LLMs! Real-world low-latency retrieval over a large-scale corpus? ➡️ Embedding models! But what if you need BOTH at once? That's what our new paper 💡 is all about... https://t.co/WmDM1JnZA1 🧵

1

11

6

1

623

0

1

0

1

44

matan orbach @MatanOrbach

11 months ago

A great tool for error analysis👇

Asaf Yehudai

@AsafYehudai

11 months ago

🚨 Benchmarks tell us which model is better — but not why it fails. For developers, this means tedious, manual error analysis. We're bridging that gap. Meet CLEAR: an open-source tool for actionable error analysis of LLMs. 🧵👇

AsafYehudai's tweet photo. 🚨 Benchmarks tell us which model is better — but not why it fails.

For developers, this means tedious, manual error analysis. We're bridging that gap.

Meet CLEAR: an open-source tool for actionable error analysis of LLMs.

🧵👇 https://t.co/VnE0crgPCN

1

44

14

10

2K

0

1

0

21

matan orbach @MatanOrbach

over 1 year ago

Unitxt has grown from a data preparation library to a cutting-edge evaluation platform in just one year. The best part? Its just getting started. 🚀 Read the Unitxt 2024 Year in Review to learn more: https://t.co/bqlPBtG3xp. #unitxt #llmevaluation

0

14

MatanOrbach retweeted

Yotam Perlitz 👾 @yotamperlitz

almost 2 years ago

HELM just got a great upgrade! We've integrated with Unitxt for: Easy dataset addition 2x the datasets Sharable & reproducible pipelines Check out the blogpost: https://t.co/UJXwfPKzGN And the unitxt repo https://t.co/GeqMCoQhjv @ElronBandel @YifanMai

1

3

2

0

108

Who to follow

Senior Technical Staff Member, NLP at IBM Research AI. Opinions are my own

Elron Bandel

@ElronBandel

Research Scientist | @IBMResearch | General Agent Evaluation Team

matan orbach @MatanOrbach

about 2 years ago

A great new dataset for RAG 👇

Sara Rosenthal @seirasto

about 2 years ago

Very excited to present 👏 ClapNQ our new benchmark dataset for RAG systems! Check out our GitHub: https://t.co/3vxMx09MRG and Paper: https://t.co/twLoU5u6YK and let me know what you think! #CLAPNQ #RAG #dataset #NaturalQuestions @aviaviavi__

seirasto's tweet photo. Very excited to present 👏 ClapNQ our new benchmark dataset for RAG systems! Check out our GitHub: https://t.co/3vxMx09MRG and Paper: https://t.co/twLoU5u6YK and let me know what you think! #CLAPNQ #RAG #dataset #NaturalQuestions @aviaviavi__ https://t.co/b43gQpOhA2

2

25

8

2

4K

0

1

0

49

matan orbach @MatanOrbach

about 2 years ago

Speed up your evaluation 👇

Yotam Perlitz 👾 @yotamperlitz

over 2 years ago

Save yourselves the hours (or days) inferring all 64K examples, when using HELM In https://t.co/O03C4z44yT we show that 160 examples 🤯🤯🤯 is enough to get a very good picture, #ComputeIsForTraining.

yotamperlitz's tweet photo. Save yourselves the hours (or days) inferring all 64K examples, when using HELM

In https://t.co/O03C4z44yT we show that 160 examples 🤯🤯🤯 is enough to get a very good picture, #ComputeIsForTraining. https://t.co/D8Lb5FuF89

0

9

4

2

2K

0

1

0

22

MatanOrbach retweeted

Elron Bandel @ElronBandel

about 2 years ago

We share code on @github We share datasets on @huggingface But where do we share our data processing? We each prompt, instruct, clean, and filter but on our own🥺 Unitxt🦄 A community-based preprocessing tool Let's make it great together https://t.co/Pt3BIwcisu @IBMResearch

1

29

13

6

2K

MatanOrbach retweeted

Ariel Gera @ArielGera2

over 2 years ago

Choosing between 2 generative models, by just a few human(\LM) comparisons. Can this be done? Would the choice be reliable? In our new work, led by @ShirAshuryTahan, we show the answer is yes✨ https://t.co/3H3JOpT9wG The trick to it (contrastive representations) in the 🧵

$ArielGera2's tweet photo. Choosing between 2 generative models, by just a few human(\LM) comparisons. Can this be done? Would the choice be reliable? In our new work, led by @ShirAshuryTahan, we show the answer is yes✨ https://t.co/3H3JOpT9wG The trick to it (contrastive representations) in the 🧵 https://t.co/augVQFGSbF$

2

25

12

9

4K

MatanOrbach retweeted

Asaf Yehudai

@AsafYehudai

over 2 years ago

Happy to share our paper: Genie🧞: Achieving Human Parity in Content-Grounded Datasets Generation was accepted to #ICLR24 From your content Genie creates content-grounded data of magical quality ✨ Rivaling human-based datasets! https://t.co/OJdVfrd9gr

AsafYehudai's tweet photo. Happy to share our paper:

Genie🧞: Achieving Human Parity
in Content-Grounded Datasets Generation

was accepted to #ICLR24

From your content
Genie creates content-grounded data
of magical quality ✨
Rivaling human-based datasets!

https://t.co/OJdVfrd9gr https://t.co/GHas2k3WRL

2

68

19

29

8K

MatanOrbach retweeted

AK

@_akhaliq

over 2 years ago

IBM presents Unitxt Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI paper page: https://t.co/DryLflbWME In the dynamic landscape of generative NLP, traditional text processing pipelines limit research flexibility and reproducibility, as they are tailored to specific dataset, task, and model combinations. The escalating complexity, involving system prompts, model-specific formats, instructions, and more, calls for a shift to a structured, modular, and customizable solution. Addressing this need, we present Unitxt, an innovative library for customizable textual data preparation and evaluation tailored to generative language models. Unitxt natively integrates with common libraries like HuggingFace and LM-eval-harness and deconstructs processing flows into modular components, enabling easy customization and sharing between practitioners. These components encompass model-specific formats, task prompts, and many other comprehensive dataset processing definitions. The Unitxt-Catalog centralizes these components, fostering collaboration and exploration in modern textual data workflows. Beyond being a tool, Unitxt is a community-driven platform, empowering users to build, share, and advance their pipelines collaboratively.

4

143

38

104

55K

MatanOrbach retweeted

Leshem (Legend) Choshen 🤖🤗 @LChoshen

almost 3 years ago

🦾Cohere beats Davinci on HELM 😵‍💫But only if you also test Cohere medium How reliable are our benchmarks really? A fascinating :thread:on HELM, Reliable benchmarks & saving X100 compute Are you up to it? 🧵 https://t.co/OQClOhnrWu

6

102

36

55

43K

MatanOrbach retweeted

Eyal Shnarch @EyalShnarch

almost 3 years ago

Label Sleuth, the no-code open source tool for labeling AND automatically building text classifiers now supports > 150 languages!!! 🌎🌍🌏 Come to see the demo: @IBMResearch booth at #ACL2023 (or https://t.co/JF1mMMh67m)

2

12

9

2

4K

MatanOrbach retweeted

Arie Cattan @ArieCattan

about 3 years ago

Curious to see how can we summarize opinions beyond plain text summaries? Check out our #ACL2023 paper: From Key Points to Key Point Hierarchy: Structured and Expressive Opinion Summarization with Lilach Eden, @yoavkantor @RoyBarHaim from @IBMResearch @IBM @biunlp >>

1

18

6

4

2K

matan orbach @MatanOrbach

about 3 years ago

A recommended read!

Ariel Gera @ArielGera2

about 3 years ago

Can you make an existing pretrained LM behave like a BIGGER pretrained LM? Judging by our new paper on Auto-Contrastive Decoding, in some ways you can! 🤯 https://t.co/PvzSZO9qyE @IBMResearch #acl2023 #NLProc

ArielGera2's tweet photo. Can you make an existing pretrained LM behave like a BIGGER pretrained LM?

Judging by our new paper on Auto-Contrastive Decoding, in some ways you can! 🤯

https://t.co/PvzSZO9qyE

@IBMResearch
#acl2023 #NLProc https://t.co/rVXJVygUuB

2

25

6

9

4K

1

2

0

45

matan orbach @MatanOrbach

about 3 years ago

Join me for a meetup on Targeted Sentiment Analysis! #IBMResearch #AspectBasedSentimentAnalysis #TargetedSentimentAnalysis #AI #NLProc

soniasingh @sonia_singh7

about 3 years ago

Meetup at 2 different time slots! Wed May 10 at 8:00-9:00 am ET: https://t.co/tDzXkeCFK6 Thu May 11 at 3:30-4:30 pm ET: https://t.co/OTQdNS3TXO #TSA #AI #DataScience #DS #ML #UnstructuredData #SentimentAnalysis #IBMWatson #IBM #meetup #technical #technology

0

2

0

151

0

5

1

0

134

MatanOrbach retweeted

Eyal Shnarch @EyalShnarch

about 3 years ago

Label Sleuth is now multi-lingual: English, Arabic, Hebrew, Italian, Romanian. With thousands of downloads, many joyfully label texts and build classifiers with this no-code, open-source system 🤖 https://t.co/gkOfXJabgB @stefanoscotta @radubengulescu #NLProc #NLP #ML

5

27

6

7

2K

MatanOrbach retweeted

Or Carmi • אור כרמי

@orcarmi

about 3 years ago

מה בעצם קרה הערב? לפני ארבעה ימים בלבד, ניתן פסק דין בעתירה שהגישה התנועה לאיכות השלטון בנוגע להסדר ניגוד העניינים של רה"מ נתניהו. בתשובה לעתירה, הודיעה היועמ"ש שהסדר ניגוד העניינים שערך היועמ"ש הקודם מנדלבליט – בעינו עומד. אבל, היא הוסיפה אמירה בנוגע ל-"רפורמה המשפטית". >>

79

1K

141

19

139K

MatanOrbach retweeted

Asaf Yehudai

@AsafYehudai

about 3 years ago

Looking for a new SOTA few-shot classifier? We present QAID! QAID uses batch contrastive learning with BERTScore & classifies by retrieving the intent name. SOTA on few-shot Intent Detection! 🎉 https://t.co/bMxNHwO0gc Accepted to @iclr_conf 2023 🥳 #NLProc #NLP #ML

AsafYehudai's tweet photo. Looking for a new SOTA few-shot classifier?

We present QAID!
QAID uses batch contrastive learning with BERTScore &
classifies by retrieving the intent name.

SOTA on few-shot Intent Detection! 🎉
https://t.co/bMxNHwO0gc

Accepted to @iclr_conf 2023 🥳
#NLProc #NLP #ML https://t.co/Q0FOdEesim

2

84

37

17

7K

MatanOrbach retweeted

Yotam Perlitz 👾 @yotamperlitz

over 3 years ago

What happens when you combine Zero-shot Text Classification and Self-training? You get: Our new EMNLP paper https://t.co/QTJdGuRzb8, Open sourced code https://t.co/1xJObdQEfO Some great results ✨✨✨, and a thread 🧵

yotamperlitz's tweet photo. What happens when you combine Zero-shot Text Classification and Self-training?

You get:

Our new EMNLP paper https://t.co/QTJdGuRzb8,

Open sourced code https://t.co/1xJObdQEfO

Some great results ✨✨✨, and a thread 🧵 https://t.co/TCFssWpCYf

2

10

3

1

0

MatanOrbach retweeted

Leshem (Legend) Choshen 🤖🤗 @LChoshen

over 3 years ago

We want to pretrain🤞 Instead we finetune🚮😔 Could we collaborate?🤗 ColD Fusion: 🔄Recycle finetuning to multitask ➡️evolve pretrained models forever On 35 datasets +2% improvement over RoBERTa +7% in few shot settings 🧵 #NLProc #MachinLearning #NLP #ML #modelRecyclying

6

129

25

48

0

matan orbach

@MatanOrbach

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users