Guy Bar-Shalom @GuyBarSh - Twitter Profile

Pinned Tweet

about 2 months ago

New blogpost out 📃 "Detecting LLM Misbehaviors from the Inside Out with Deep Learning on Structured Data" (https://t.co/tdcTr3waPZ) [1/8]

GuyBarSh's tweet photo. New blogpost out 📃

"Detecting LLM Misbehaviors from the Inside Out with Deep Learning on Structured Data" (https://t.co/tdcTr3waPZ) [1/8] https://t.co/QE3bym5IVN

1

10

6

3

2K

GuyBarSh retweeted

Yam Eitan @ytn_ym

12 days ago

1/ How much can you compress an LLM’s KV cache? tl;dr it depends on how you train your model. Many strong context compaction methods, such as Cartridges and attention matching, operate post-hoc: given a fixed model and a context, they try to compress the resulting KV cache. @yoav_gelberg and I ask the complementary question: can we train the model to produce KV representations that are easier to compress? In other words: keep the compression method fixed, and change the representations it sees.

ytn_ym's tweet photo. 1/

How much can you compress an LLM’s KV cache?

tl;dr it depends on how you train your model.

Many strong context compaction methods, such as Cartridges and attention matching, operate post-hoc: given a fixed model and a context, they try to compress the resulting KV cache.

@yoav_gelberg and I ask the complementary question:
can we train the model to produce KV representations that are easier to compress?

In other words: keep the compression method fixed, and change the representations it sees.

6

99

18

79

29K

Guy Bar-Shalom @GuyBarSh

about 2 months ago

- "Beyond Next Token Probabilities: Learnable, Fast Detection of Hallucinations and Data Contamination on LLM Output Distributions", AAAI 2026 (https://t.co/ixdfiyqNqO) [8/8]

0

50

Guy Bar-Shalom @GuyBarSh

about 2 months ago

New blogpost out 📃 "Detecting LLM Misbehaviors from the Inside Out with Deep Learning on Structured Data" (https://t.co/tdcTr3waPZ) [1/8]

1

10

6

3

2K

Guy Bar-Shalom @GuyBarSh

about 2 months ago

- "Neural Message-Passing on Attention Graphs for Hallucination Detection", ICLR 2026 (https://t.co/NotDP8R8bg) [7/8]

1

0

78

Guy Bar-Shalom @GuyBarSh

4 months ago

Check out our new ICLR 2026 paper - we explore hallucination detection through graph learning. Take a look!

Fabrizio Frasca @ffabffrasca

4 months ago

🧵"Neural Message Passing on Attention Graphs for Hallucination Detection" at #ICLR2026 ! 🕸️We apply GNNs on the structured data LLMs produce as they generate text (e.g. attentions) to predict their errors. 📄 https://t.co/IQEyA7zaht 🤝 @GuyBarSh (co-1st) @YftahZ @HaggaiMaron

ffabffrasca's tweet photo. 🧵"Neural Message Passing on Attention Graphs for Hallucination Detection" at #ICLR2026 !

🕸️We apply GNNs on the structured data LLMs produce as they generate text (e.g. attentions) to predict their errors.

📄 https://t.co/IQEyA7zaht
🤝 @GuyBarSh (co-1st) @YftahZ @HaggaiMaron https://t.co/2TvP6730e6

1

75

15

46

9K

0

17

5

6

1K

Guy Bar-Shalom @GuyBarSh

4 months ago

These works were joint efforts with a group of amazing collaborators: @ffabffrasca @HaggaiMaron @YftahZ @ytn_ym @yaniv_galron @yoav_gelberg @itayevron @mayabechlerspei @ido_guy Ami Tavory Moshe Eliasof Ran Elbaz

2

5

0

346

Guy Bar-Shalom @GuyBarSh

4 months ago

Happy to share my new #ICLR2026 papers !

3

16

2

1

3K

Guy Bar-Shalom @GuyBarSh

4 months ago

📌 [4/4] On the Expressive Power of GNN Derivatives We study how using gradients of GNNs can increase their expressive power, providing a principled way to go beyond standard message passing. https://t.co/XwyjvdhVNw

1

4

1

0

306

GuyBarSh retweeted

Haggai Maron @HaggaiMaron

6 months ago

📄 Beyond Token Probes: Hallucination Detection via Activation Tensors with ACT-ViT w/ @GuyBarSh , @ffabffrasca, Yaniv Galron, @YftahZ https://t.co/D9M59DcbzV

0

9

2

0

667

GuyBarSh retweeted

Fabrizio Frasca @ffabffrasca

6 months ago

@GuyBarSh and I will be presenting the poster today, stop by 🤗 📍 Fri, Dec 5 • 4:30–7:30 PM PST • Exhibit Hall C,D,E # 4000

0

7

3

2

899

GuyBarSh retweeted

Omer Belhasin @omerbelhasin

6 months ago

🤔 Can discrete diffusion models actually outperform standard classifiers? We show that it can! 📄 https://t.co/TwQx7iP17o 💻 https://t.co/TjPeGWcIED 🌐 https://t.co/ga3YOazPog

1

17

4

16

7K

Guy Bar-Shalom @GuyBarSh

8 months ago

[7/7] Code: https://t.co/pLAWJRwWfL

0

2

1

295

Guy Bar-Shalom @GuyBarSh

8 months ago

[1/7] New paper: "Beyond Token Probes: Hallucination Detection via Activation Tensors with ACT-ViT" #NeurIPS2025 [https://t.co/8aD1n8ejyp] Joint work with: @ffabffrasca (co-first), @yaniv_galron, @YftahZ , @HaggaiMaron

GuyBarSh's tweet photo. [1/7] New paper: "Beyond Token Probes: Hallucination Detection via Activation Tensors with ACT-ViT" #NeurIPS2025 [https://t.co/8aD1n8ejyp]

Joint work with: @ffabffrasca (co-first), @yaniv_galron, @YftahZ , @HaggaiMaron https://t.co/NV6lGCXCV6

1

16

3

6

4K

Guy Bar-Shalom @GuyBarSh

8 months ago

[6/7] Results (over 15 LLM/dataset combinations): • Consistently outperforms classic probes • Zero-shot generalization to new datasets • Fast adaptation to unseen LLMs by tuning only their new corresponding adapter

1

0

84

Guy Bar-Shalom

@GuyBarSh

Last Seen Users on Sotwe

Trends for you

Most Popular Users