Alon Talmor @AlonTalmor - Twitter Profile

Pinned Tweet

about 6 years ago

Can we instantly correct pre-trained models by showing them natural language rules and facts? Can they systematically reason over implicit knowledge while doing so? https://t.co/fdLNLfPZ4B New joint work with Oyvind Tafjord, Peter Clark, @yoavgo and @JonathanBerant suggests yes!

AlonTalmor's tweet photo. Can we instantly correct pre-trained models by showing them natural language rules and facts? Can they systematically reason over implicit knowledge while doing so? https://t.co/fdLNLfPZ4B New joint work with Oyvind Tafjord, Peter Clark, @yoavgo and @JonathanBerant suggests yes! https://t.co/ZdG9KmSHoR

4

213

37

56

0

AlonTalmor retweeted

AK

@_akhaliq

over 2 years ago

Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking paper page: https://t.co/piWtpFKPLs Reward models play a key role in aligning language model applications towards human preferences. However, this setup creates an incentive for the language model to exploit errors in the reward model to achieve high estimated reward, a phenomenon often termed reward hacking. A natural mitigation is to train an ensemble of reward models, aggregating over model outputs to obtain a more robust reward estimate. We explore the application of reward ensembles to alignment at both training time (through reinforcement learning) and inference time (through reranking). First, we show that reward models are underspecified: reward models that perform similarly in-distribution can yield very different rewards when used in alignment, due to distribution shift. Second, underspecification results in overoptimization, where alignment to one reward model does not improve reward as measured by another reward model trained on the same data. Third, overoptimization is mitigated by the use of reward ensembles, and ensembles that vary by their pretraining seeds lead to better generalization than ensembles that differ only by their fine-tuning seeds, with both outperforming individual reward models. However, even pretrain reward ensembles do not eliminate reward hacking: we show several qualitative reward hacking phenomena that are not mitigated by ensembling because all reward models in the ensemble exhibit similar error patterns.

_akhaliq's tweet photo. Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking

paper page: https://t.co/piWtpFKPLs

Reward models play a key role in aligning language model applications towards human preferences. However, this setup creates an incentive for the language model to exploit errors in the reward model to achieve high estimated reward, a phenomenon often termed reward hacking. A natural mitigation is to train an ensemble of reward models, aggregating over model outputs to obtain a more robust reward estimate. We explore the application of reward ensembles to alignment at both training time (through reinforcement learning) and inference time (through reranking). First, we show that reward models are underspecified: reward models that perform similarly in-distribution can yield very different rewards when used in alignment, due to distribution shift. Second, underspecification results in overoptimization, where alignment to one reward model does not improve reward as measured by another reward model trained on the same data. Third, overoptimization is mitigated by the use of reward ensembles, and ensembles that vary by their pretraining seeds lead to better generalization than ensembles that differ only by their fine-tuning seeds, with both outperforming individual reward models. However, even pretrain reward ensembles do not eliminate reward hacking: we show several qualitative reward hacking phenomena that are not mitigated by ensembling because all reward models in the ensemble exhibit similar error patterns.

1

67

28

33

21K

AlonTalmor retweeted

The Information

@theinformation

over 2 years ago

Two startups are rejecting investments from Web Summit’s venture arm, a sign that fallout from the summit co-founder’s remarks on Israel isn't over. https://t.co/vRgRNSudNp By @nmasc_

0

5

4

0

3K

AlonTalmor retweeted

Mor Geva

@megamor2

about 3 years ago

LMs capture many factual associations, but how do they recall them internally during inference? In a new preprint, we find that LMs build attribute-rich subject representations, from which attention heads extract the predicted attribute. @jasmijnbastings @fajtak @amirgloberson 🧵

megamor2's tweet photo. LMs capture many factual associations, but how do they recall them internally during inference?
In a new preprint, we find that LMs build attribute-rich subject representations, from which attention heads extract the predicted attribute.
@jasmijnbastings @fajtak @amirgloberson 🧵 https://t.co/jqfFl6yLRx

3

436

89

223

61K

Who to follow

Roy Schwartz

@royschwartzNLP

Senior Lecturer at @CseHuji. #NLPROC

UW NLP

@uwnlp

The NLP group at the University of Washington.

Hanna Hajishirzi

@HannaHajishirzi

VP@Microsoft-AI; past: Olmo, Tulu

AlonTalmor retweeted

Ori Yoran @OriYoran

about 3 years ago

A single chain-of-thought (CoT) may not be enough to answer complex questions, can an LLM meta-reason over multiple CoTs? In our new preprint, we show that meta-reasoning boosts performance on multi-hop QA datasets! Paper: https://t.co/UnfZ59Unkt 🧵1/4

2

191

54

62

27K

AlonTalmor retweeted

Or Azulay

@orslimy

about 3 years ago

שרשוראור מוצרי - איך כורים זהב מתמיכה? אחת מהטעויות ששמתי לב אליהן שמנהלי מוצר בתחילת דרכם עושים - הם לא יודעים איך לייצר ערוץ פידבק איכותי מפניות של לקוחות לתמיכה. חשוב לזכור שלקוחות יפנו אליכם (ישאירו ביקורת, או ישלחו הודעה) רק במקרי קיצון, כי זה דורש מהם זמן ואנרגיה, לדוג׳:

orslimy's tweet photo. שרשוראור מוצרי - איך כורים זהב מתמיכה?

אחת מהטעויות ששמתי לב אליהן שמנהלי מוצר בתחילת דרכם עושים - הם לא יודעים איך לייצר ערוץ פידבק איכותי מפניות של לקוחות לתמיכה.

חשוב לזכור שלקוחות יפנו אליכם (ישאירו ביקורת, או ישלחו הודעה) רק במקרי קיצון, כי זה דורש מהם זמן ואנרגיה, לדוג׳: https://t.co/trHbrzjKaw

1

4

1

3

2K

AlonTalmor retweeted

ilan huberman @ilanhub

over 3 years ago

Next week at #sap #dcom Israel What a treat! @shaiyallin @DarkModeIL @leshemco @AlonTalmor

0

4

3

0

1K

AlonTalmor retweeted

Ask-AI @ask_ai_tech

over 3 years ago

Casual night out at @ask_ai_tech 🔥#AI #NLProc

1

5

2

0

821

AlonTalmor retweeted

סטארטאפיסטים מצטלמים @TechPhotoshoots

over 3 years ago

@urieli17 https://t.co/04npRhYqJD

1

19

2

1

3K

AlonTalmor retweeted

ourspace @ourspace_teams

over 3 years ago

Still buzzing from getting to open #WebSummit2022 as one of the breakout startups 🤩 alongside @lokalise, @amity_hq, @ask_ai_tech, @choose_tactic & @gaiascope Our CEO, @mlmurphy818 had the challenge of squeezing all our passion and vision into just 2.5 min! Video coming...

ourspace_teams's tweet photo. Still buzzing from getting to open #WebSummit2022 as one of the breakout startups 🤩 alongside @lokalise, @amity_hq, @ask_ai_tech, @choose_tactic & @gaiascope

Our CEO, @mlmurphy818 had the challenge of squeezing all our passion and vision into just 2.5 min!

Video coming... https://t.co/ezGljvuqbO

1

13

2

0

AlonTalmor retweeted

Mark Peter Davis

@mpd

over 3 years ago

On this week’s pod ep we’re combining a guest interview w/ the new partner meeting format. We start with the partner meeting segments then I chat w/ a guest. Hopefully a fun twist for everyone. This week's guest is @AlonTalmor, Founder & CEO of Ask-AI. https://t.co/6v1SnmHH22

2

13

8

0

AlonTalmor retweeted

Innovation Theory

@Web3nnovators

over 3 years ago

Good morning, Lisbon! Let's start today with another session of “Breakout startups'' will be held today. The experts are @saadhrizvi, @mlmurphy818, @luke_mackey, @alontalmor, @simonrohrbach, @AbbieElsieM, @OneTuomoLaine⚡️ 10:30 AM (CET) / Centre📍 #WebSummit

Web3nnovators's tweet photo. Good morning, Lisbon!

Let's start today with another session of “Breakout startups'' will be held today. The experts are @saadhrizvi, @mlmurphy818, @luke_mackey, @alontalmor, @simonrohrbach, @AbbieElsieM, @OneTuomoLaine⚡️

10:30 AM (CET) / Centre📍

#WebSummit https://t.co/McY1RA19kG

0

12

4

0

AlonTalmor retweeted

Andrew Coy @andrewcoy

over 3 years ago

--> @AlonTalmor of #AskAI -- "aggregates text-heavy company knowledge & customer communications to reveal pinpointed answers and actionable insights." https://t.co/QUfq6raXcm #websummit #opening #mainstage

1

0

AlonTalmor retweeted

Jonathan Berant @JonathanBerant

over 3 years ago

Full house at the Yandex distinguished lecture series as @YejinChoinka is starting her talk!

1

87

8

1

0

Alon Talmor @AlonTalmor

over 3 years ago

Excited to be speaking at #websummit2022 #breakout sessions 1st of Nov. 17:30PM and, 3rd of Nov. 9:45AM, as well as the panel "The socio-economic impacts of AI" at DeepTech stage, at 14:55 Nov 3rd. Come say hello!

AlonTalmor's tweet photo. Excited to be speaking at #websummit2022 #breakout sessions 1st of Nov. 17:30PM and, 3rd of Nov. 9:45AM, as well as the panel "The socio-economic impacts of AI" at DeepTech stage, at 14:55 Nov 3rd. Come say hello! https://t.co/CXvwKM5AmM

0

6

3

1

0

AlonTalmor retweeted

Or Hiltch

@_orcaman

over 3 years ago

אני בד"כ לא נוהג להשקיע ישירות, אבל כשאני כן, זה באנשים כמו @AlonTalmor והצוות שלו ב-Ask! https://t.co/yHDeSBthyn

2

37

2

0

AlonTalmor retweeted

Eynat Guez

@EynatGuez

almost 4 years ago

מחשבות של לילה - הדבר הכי קשה בסטארטאפ בצמיחה זה להבין מה חווית הלקוח ברגע נתון, איפה בתהליכים הוא חווה תסכול או שירות לא טוב. תמיד צריך להשאר מחוברים למקום הזה, זה קשה כי בסוף יש עוד הרבה אנשים בדרך.הטריק שלי זה לקרוא מיילים של לקוחות שמגיעים לתמיכה ולנתח מהם את החוויה לאחור.

29

186

3

0

Alon Talmor @AlonTalmor

almost 4 years ago

@EynatGuez כחברה שמתמחה במתן תשובות וניתוח תקשורות לקוח ממקורות מרובים, הבעיה שתארת היא במרכז הפתרון שבנינו. בעזרת NLP אנחנו מנתחים טקסט ממקורות כמו טיקטים, CSATים, שיחות עם לקוחות כדי להציף בעיות שחוזרות עצמן לא כאוסף טגיות אלא כמשפט שלם בשפה טבעית. https://t.co/3vJnbKF5eh (נדב נאמן מכיר)

0

7

1

0

AlonTalmor retweeted

Shirly Grynberg @ShirlyGrynberg

about 4 years ago · Chicago

Come to say hello and hear about our work with BRAF inhibitors on Ameloblastoma of the mandible. 100% RR 😱 Poster board #140 #ASCO22 https://t.co/KWjN315Fcn

ShirlyGrynberg's tweet photo. Come to say hello and hear about our work with BRAF inhibitors on Ameloblastoma of the mandible. 100% RR 😱 Poster board #140 #ASCO22 https://t.co/KWjN315Fcn https://t.co/XUziFxzFzf

2

22

6

0

AlonTalmor retweeted

Ben Bogin @ben_bogin

about 4 years ago

📢📢📢 We release COVR-10: a set of 10 challenging compositional generalization splits (with some intriguing results on GPT-3’s compositional skills). @JonathanBerant @shivanshug11 https://t.co/wzaXoFtt2A 1/4

ben_bogin's tweet photo. 📢📢📢 We release COVR-10: a set of 10 challenging compositional generalization splits (with some intriguing results on GPT-3’s compositional skills). @JonathanBerant @shivanshug11

https://t.co/wzaXoFtt2A

1/4 https://t.co/jEekw6ej7I

1

57

16

6

0

AlonTalmor retweeted

Dan Peer @peer_lab

about 4 years ago

I am so proud of these four exceptional TAU researchers, my colleagues @MichalFeldman9 @jonathanberant, Leo Corry and Roy Tzohar, who today won the Kadar Family Award for Outstanding #research for 2022 at #TAUbog22. @TelAvivUni @AFTAU #excellence #Israel

peer_lab's tweet photo. I am so proud of these four exceptional TAU researchers, my colleagues @MichalFeldman9 @jonathanberant, Leo Corry and Roy Tzohar, who today won the Kadar Family Award for Outstanding #research for 2022 at #TAUbog22. @TelAvivUni @AFTAU #excellence #Israel https://t.co/J1E9baducp

5

38

5

0

Alon Talmor

@AlonTalmor

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users