Jonathan Herzig

about 1 month ago

Going to @ICLR2026!!✈️ @Itay_Itzhak_, and I (w\ @jonherzig , @mtutek, Idan Szpektor and @boknilev) will present ManagerBench👔 A benchmark that evaluates how models balance goals with potential harm to humans in a management setup! Tomorrow at 10:30 Pavilion 4 #4202!

$AdiSimhi's tweet photo. Going to @ICLR2026!!✈️ @Itay_Itzhak_, and I (w\ @jonherzig , @mtutek, Idan Szpektor and @boknilev) will present ManagerBench👔 A benchmark that evaluates how models balance goals with potential harm to humans in a management setup! Tomorrow at 10:30 Pavilion 4 #4202! https://t.co/24OjPFBgnu$

jonherzig retweeted

@_akhaliq

3 months ago

Thinking to Recall How Reasoning Unlocks Parametric Knowledge in LLMs paper: https://t.co/juzRYfAZ5u

16K

jonherzig retweeted

NLP at Tel-Aviv University and Google

3 months ago

New paper 🚨 We know that reasoning helps when step-by-step solutions are natural, for example in math, code, and multi-hop factual QA. But why should it help with factual recall, where no complex reasoning steps are needed? 1/🧵

zorikgekhman's tweet photo. New paper 🚨

We know that reasoning helps when step-by-step solutions are natural, for example in math, code, and multi-hop factual QA. But why should it help with factual recall, where no complex reasoning steps are needed?

1/🧵 https://t.co/XSTjMQ1EMi

14K

jonherzig retweeted

Ido Amos @AmosaurusRex

4 months ago

Can LLMs reason internally while processing their inputs, similar to how humans can think ahead as we process information? Our latest work introduces Thinking States, a novel architectural adaptation that transforms reasoning into a internal recurrent process. By training models to maintain a dynamic thinking state, we achieve significant inference speedups over Chain-of-Thought while substantially outperforming existing latent reasoning methods. Paper: https://t.co/nXJ9szfbrT

AmosaurusRex's tweet photo. Can LLMs reason internally while processing their inputs, similar to how humans can think ahead as we process information? Our latest work introduces Thinking States, a novel architectural adaptation that transforms reasoning into a internal recurrent process.
By training models to maintain a dynamic thinking state, we achieve significant inference speedups over Chain-of-Thought while substantially outperforming existing latent reasoning methods.

Paper: https://t.co/nXJ9szfbrT

132

106

13K

Who to follow

Jonathan Berant

@JonathanBerant

Mor Geva

@megamor2

Assistant Professor at @TelAvivUni and Research Scientist at @Irregular; previously at @GoogleResearch, @GoogleDeepMind and @allen_ai

Valentina Pyatkin

@valentina__py

Postdoc at the Allen Institute for AI @allen_ai , @uwnlp , and @ETH_AI_Center

jonherzig retweeted

4 months ago

ManagerBench was accepted to #ICLR2026🇧🇷 We evaluate LLM decision in realistic human-validated managerial scenarios Some frontier LLMs choose options harmful to humans to advance their operational goals⚠️, while some are overly safe, avoid harm even when it's aimed at furniture🛋️

jonherzig retweeted

8 months ago

🤔What happens when LLM agents choose between achieving their goals and avoiding harm to humans in realistic management scenarios? Are LLMs pragmatic or prefer to avoid human harm? 🚀 New paper out: ManagerBench: Evaluating the Safety-Pragmatism Trade-off in Autonomous LLMs🚀🧵

AdiSimhi's tweet photo. 🤔What happens when LLM agents choose between achieving their goals and avoiding harm to humans in realistic management scenarios? Are LLMs pragmatic or prefer to avoid human harm?
🚀 New paper out: ManagerBench: Evaluating the Safety-Pragmatism Trade-off in Autonomous LLMs🚀🧵 https://t.co/GAi7wWBDaC

jonherzig retweeted

Arie Cattan @ArieCattan

12 months ago

🚨 RAG is a popular approach but what happens when the retrieved sources provide conflicting information?🤔 We're excited to introduce our paper: “DRAGged into CONFLICTS: Detecting and Addressing Conflicting Sources in Search-Augmented LLMs”🚀 A thread 🧵👇

ArieCattan's tweet photo. 🚨 RAG is a popular approach but what happens when the retrieved sources provide conflicting information?🤔

We're excited to introduce our paper:
“DRAGged into CONFLICTS: Detecting and Addressing Conflicting Sources in Search-Augmented LLMs”🚀

A thread 🧵👇 https://t.co/IzF2fUFoyr

jonherzig retweeted

about 1 year ago

🚨 It's often claimed that LLMs know more facts than they show in their outputs, but what does this actually mean, and how can we measure this “hidden knowledge”? In our new paper, we clearly define this concept and design controlled experiments to test it. 1/🧵

zorikgekhman's tweet photo. 🚨 It's often claimed that LLMs know more facts than they show in their outputs, but what does this actually mean, and how can we measure this “hidden knowledge”?

In our new paper, we clearly define this concept and design controlled experiments to test it.
1/🧵 https://t.co/gNcP23JCY6

222

178

27K

jonherzig retweeted

over 1 year ago

At #EMNLP2024? Join me in the Language Modeling 1 session tomorrow, 11:00-11:15, for a talk on how fine-tuning with new knowledge impacts hallucinations.

jonherzig retweeted

over 1 year ago

LLMs often "hallucinate". But not all hallucinations are the same! This paper reveals two distinct types: (1) due to lack of knowledge and (2) hallucination despite knowing. Check out our new preprint, "Distinguishing Ignorance from Error in LLM Hallucinations"

AdiSimhi's tweet photo. LLMs often "hallucinate". But not all hallucinations are the same! This paper reveals two distinct types: (1) due to lack of knowledge and (2) hallucination despite knowing.

Check out our new preprint, "Distinguishing Ignorance from Error in LLM Hallucinations" https://t.co/1Pm5akHgtZ

955

183

790

91K

jonherzig retweeted

Shauli Ravfogel @ravfogel

over 1 year ago

Our work on the effects of exposing LLMs to new knowledge through fine-tuning has been accepted to #EMNLP2024! We show that LLMs struggle to learn new knowledge, but when they do, they hallucinate more. Looking forward to presenting our findings and discussing them in person.

jonherzig retweeted

Arie Cattan @ArieCattan

almost 2 years ago

🚨🚨 Check out our new paper for a new ICL method that greatly boosts LLMs in long contexts! >> https://t.co/8jt69HbOHC

jonherzig retweeted

Rohan Paul

@rohanpaul_ai

almost 2 years ago

This is quite a valuable resource from @GoogleA for evaluating the complex reasoning and numerical calculation capabilities of large language models. A few key takeaways: 'TACT: Advancing Complex Aggregative Reasoning with Information Extraction Tools' 📌 The TACT (Text And Calculations through Tables) dataset challenges LLMs' reasoning and computational abilities on complex instructions that require aggregating information scattered across texts and performing complex integration on this information to generate the answer. TACT instances consist of the original text, the written instruction, and a gold answer, all requiring advanced text comprehension and reasoning. 📌 TACT was constructed by leveraging the InstructIE dataset, which contains texts and their associated tables. For each table, experts formulated new queries and gathered their respective answers. The dataset creation process involved 1) Initial review and relevance vetting, 2) Numerical aspect identification, 3) Natural language instruction formulation, 4) Natural language query over the table, 5) Translation to Pandas commands and gold response extraction, and 6) Command execution and validation. 📌 Experiments show that all contemporary LLMs perform poorly on TACT, achieving an accuracy below 38%. To pinpoint the difficulties, the authors analyze model performance across three components: table-generation, Pandas command-generation, and execution. 📌 To address these challenges, the authors propose the "IE as a tool" framework. The key idea is to solve TACT instructions through the sequential invocation of two tools: one that generates a table from the text and instruction, and one that generates a corresponding Pandas command. The model then executes the command, along with the original instruction and text, to produce the final answer. The authors implement each tool with few-shot prompting. 📌 The IE as a tool approach shows a 12% improvement over existing prompting techniques on TACT. Analyzing the performance on the individual table-generation and Pandas command-generation tasks reveals significant headroom in each, suggesting that focused few-shot prompting can considerably enhance performance. This aligns with the authors' finding that each dissected component of the TACT task has untapped potential for improvement.

rohanpaul_ai's tweet photo. This is quite a valuable resource from @GoogleA for evaluating the complex reasoning and numerical calculation capabilities of large language models. A few key takeaways:

'TACT: Advancing Complex Aggregative Reasoning with Information Extraction Tools'

📌 The TACT (Text And Calculations through Tables) dataset challenges LLMs' reasoning and computational abilities on complex instructions that require aggregating information scattered across texts and performing complex integration on this information to generate the answer. TACT instances consist of the original text, the written instruction, and a gold answer, all requiring advanced text comprehension and reasoning.

📌 TACT was constructed by leveraging the InstructIE dataset, which contains texts and their associated tables. For each table, experts formulated new queries and gathered their respective answers. The dataset creation process involved 1) Initial review and relevance vetting, 2) Numerical aspect identification, 3) Natural language instruction formulation, 4) Natural language query over the table, 5) Translation to Pandas commands and gold response extraction, and 6) Command execution and validation.

📌 Experiments show that all contemporary LLMs perform poorly on TACT, achieving an accuracy below 38%. To pinpoint the difficulties, the authors analyze model performance across three components: table-generation, Pandas command-generation, and execution.

📌 To address these challenges, the authors propose the "IE as a tool" framework. The key idea is to solve TACT instructions through the sequential invocation of two tools: one that generates a table from the text and instruction, and one that generates a corresponding Pandas command. The model then executes the command, along with the original instruction and text, to produce the final answer. The authors implement each tool with few-shot prompting.

📌 The IE as a tool approach shows a 12% improvement over existing prompting techniques on TACT. Analyzing the performance on the individual table-generation and Pandas command-generation tasks reveals significant headroom in each, suggesting that focused few-shot prompting can considerably enhance performance. This aligns with the authors' finding that each dissected component of the TACT task has untapped potential for improvement.

jonherzig retweeted

Jack Hessel @jmhessel

almost 2 years ago

This (awesome!) work from @zorikgekhman et al. explores the fine-line between teaching a model "new facts" versus teaching it to randomly guess. Naive fine-tuning on new facts unknown-to-the-model indeed results in hallucinations, but... https://t.co/PrWrWipsKy

jmhessel's tweet photo. This (awesome!) work from @zorikgekhman et al. explores the fine-line between teaching a model "new facts" versus teaching it to randomly guess. Naive fine-tuning on new facts unknown-to-the-model indeed results in hallucinations, but...

https://t.co/PrWrWipsKy https://t.co/NcjFT0s12F

jonherzig retweeted

Avi Caciularu @clu_avi

almost 2 years ago

🚨 New Paper 🚨 Are current LLMs up to the task of solving *complex* instructions based on content-rich text? Our new dataset, TACT, sheds some light on this challenge. How does it work? https://t.co/4u3iTC087B Work by @GoogleAI & @GoogleDeepMind 👇🧵

clu_avi's tweet photo. 🚨 New Paper 🚨
Are current LLMs up to the task of solving *complex* instructions based on content-rich text?
Our new dataset, TACT, sheds some light on this challenge.
How does it work?
https://t.co/4u3iTC087B
Work by @GoogleAI & @GoogleDeepMind
👇🧵 https://t.co/lusPI6Kwlq

104

12K

jonherzig retweeted

almost 2 years ago

Happy to announce our latest research “Representation Surgery: Theory and Practice of Affine Steering", accepted at ICML! A joint work with @roeeaharoni @jonherzig @ryandcotterell @ponguru and @shashwat_s19, done during an internship at @GoogleAI.(1/8) https://t.co/jOwQZq4mkS

ravfogel's tweet photo. Happy to announce our latest research “Representation Surgery: Theory and Practice of Affine Steering", accepted at ICML! A joint work with @roeeaharoni @jonherzig @ryandcotterell @ponguru and @shashwat_s19, done during an internship at @GoogleAI.(1/8) https://t.co/jOwQZq4mkS https://t.co/SnRovDKXEJ

128

12K

jonherzig retweeted

Google AI

@GoogleAI

about 2 years ago

Today we share a comprehensive evaluation of tool-assisted generation strategies, where we ask: Does few-shot tool assistance work? Surprisingly, we found that it generally does not perform better than an LM operating without tools — learn more →https://t.co/iD57bXbdQX

GoogleAI's tweet photo. Today we share a comprehensive evaluation of tool-assisted generation strategies, where we ask: Does few-shot tool assistance work? Surprisingly, we found that it generally does not perform better than an LM operating without tools — learn more →https://t.co/iD57bXbdQX

306

49K

jonherzig retweeted

elvis

@omarsar0

about 2 years ago

Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? This is one of the more interesting LLM papers I read last week. It reports that LLMs struggle to acquire factual knowledge through fine-tuning. When examples with new knowledge are eventually learned they linearly increase the LLM's tendency to hallucinate. I mostly use fine-tuning to refine generations for my use cases and in some special situations and rarely for memorizing information. More thoughts on my latest LLM recap: https://t.co/xNaUkBbl4X

omarsar0's tweet photo. Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?

This is one of the more interesting LLM papers I read last week.

It reports that LLMs struggle to acquire factual knowledge through fine-tuning.

When examples with new knowledge are eventually learned they linearly increase the LLM's tendency to hallucinate.

I mostly use fine-tuning to refine generations for my use cases and in some special situations and rarely for memorizing information.

More thoughts on my latest LLM recap: https://t.co/xNaUkBbl4X

421

316

42K

jonherzig retweeted

Jerry Wei

@JerryWeiAI

about 2 years ago

Super cool work from @zorikgekhman and others at @GoogleAI! Our team previously investigated fine-tuning LLMs to reduce sycophancy; one of our key findings was that you have to filter out prompts that the model does not know the answer to. The lesson we learned was that training on new knowledge can lead to unexpected and random behaviors. I'm happy to see that there's been more-comprehensive experiments done on this phenomenon. This paper essentially shows more-extensively that attempting to introduce new knowledge during fine-tuning can result in a model that's more likely to hallucinate, which follows the findings from our work! https://t.co/mA4RUYjckN

jonherzig retweeted