Harsh @SoHarshhh - Twitter Profile

Pinned Tweet

17 days ago

Really happy to share that “ToolFailBench” got accepted at two ICML 2026 workshops, FAGEN and AIWILD. Most benchmarks evaluate tool-using agents with a single aggregate success rate, but that number can’t explain why a model actually fails. ToolFailBench is a diagnostic benchmark that scores tool use against a failure taxonomy instead of one number, breaking each trace into four distinct failure modes: skipping a tool that was needed, ignoring what a tool returns, fabricating tool outputs, and over-calling tools when none is needed. We find that models with similar aggregate scores fail in very different ways, so a single number isn’t enough to compare agents.

SoHarshhh's tweet photo. Really happy to share that “ToolFailBench” got accepted at two ICML 2026 workshops, FAGEN and AIWILD.

Most benchmarks evaluate tool-using agents with a single aggregate success rate, but that number can’t explain why a model actually fails. ToolFailBench is a diagnostic benchmark that scores tool use against a failure taxonomy instead of one number, breaking each trace into four distinct failure modes: skipping a tool that was needed, ignoring what a tool returns, fabricating tool outputs, and over-calling tools when none is needed. We find that models with similar aggregate scores fail in very different ways, so a single number isn’t enough to compare agents.

3

17

5

3

2K

Harsh

@SoHarshhh

16 days ago

@shubhamgaur98 I dont think they sent out any email about this yet

0

62

Harsh

@SoHarshhh

17 days ago

Really happy to share that “ToolFailBench” got accepted at two ICML 2026 workshops, FAGEN and AIWILD. Most benchmarks evaluate tool-using agents with a single aggregate success rate, but that number can’t explain why a model actually fails. ToolFailBench is a diagnostic benchmark that scores tool use against a failure taxonomy instead of one number, breaking each trace into four distinct failure modes: skipping a tool that was needed, ignoring what a tool returns, fabricating tool outputs, and over-calling tools when none is needed. We find that models with similar aggregate scores fail in very different ways, so a single number isn’t enough to compare agents.

3

17

5

3

2K

Harsh

@SoHarshhh

17 days ago

Paper (OpenReview, camera ready coming soon): https://t.co/7OnWXj8LTT

0

2

0

90

Who to follow

Behal jérémy 🇫🇷 🇨🇦 🇺🇦 🇵🇪

Partner @tenzorcapital

Harsh

@SoHarshhh

17 days ago

Also big thanks to @modal and @charles_irl for supporting with compute!

2

3

0

120

Harsh

@SoHarshhh

18 days ago

What a great way to end the day! Had such an amazing time at the @GoogleDeepMind Day at @agihouse_org Hillsborough! Huge thanks for hosting this event. Got to meet so many incredible people, with great panel talks including Sergey Brin!

SoHarshhh's tweet photo. What a great way to end the day!

Had such an amazing time at the @GoogleDeepMind Day at @agihouse_org Hillsborough! Huge thanks for hosting this event. Got to meet so many incredible people, with great panel talks including Sergey Brin! https://t.co/JaxDa7YAbo

0

19

2

3

2K

SoHarshhh retweeted

Aakarsh Bengani

@A_Bengani

26 days ago

hi! i’m a recent Berkeley grad looking for roles across evals, ops, or early-stage biz dev roles. most recently i was an AI engineer at the Center for AI Safety. before that i built BASIS at Berkeley, now one of the largest student AI safety communities (secured a $257k grant from CG). especially interested in AI for security, science, on frontier research problems. would love intros, pointers, or hear about any companies i should talk to. thanks!

11

205

17

74

31K

SoHarshhh retweeted

kache

@yacineMTB

4 months ago

you can outsource your thinking but you cannot outsource your understanding

283

19K

4K

6K

3M

SoHarshhh retweeted

idhant

@idhantgulati

4 months ago

most of what we know about emergent misalignment comes from text-based models. but the models actually being deployed as agents are multimodal — and that's been largely overlooked. vision-language models are quickly becoming the substrate for real-world agents. fine-tuning them on a narrow harmful dataset triggers broader misalignment that generalizes across unrelated tasks and modalities. and text-only safety evals miss most of it. in our ICLR 2026 (workshop) paper, we show that misalignment scales with LoRA rank, concentrates in ~10 dimensions of activation space, and persists even after efforts to reverse it."

idhantgulati's tweet photo. most of what we know about emergent misalignment comes from text-based models. but the models actually being deployed as agents are multimodal — and that's been largely overlooked.

vision-language models are quickly becoming the substrate for real-world agents. fine-tuning them on a narrow harmful dataset triggers broader misalignment that generalizes across unrelated tasks and modalities. and text-only safety evals miss most of it.

in our ICLR 2026 (workshop) paper, we show that misalignment scales with LoRA rank, concentrates in ~10 dimensions of activation space, and persists even after efforts to reverse it."

3

80

11

52

5K

Harsh

@SoHarshhh

5 months ago

@calebychan @TryLance @ycombinator @GatikTriv @GavinBrennen way to go!! @calebychan

0

1

0

136

SoHarshhh retweeted

idhant

@idhantgulati

7 months ago

thinking of a starting a hacker house in berkeley for the spring (i.e. jan-may 2026) anyone down? if so, dm

1

10

1

0

546

Harsh

@SoHarshhh

9 months ago

took a while but its finally done! @interaction

0

12

0

930

SoHarshhh retweeted

Ryo Lu

@ryolu_

10 months ago

stay grounded, keep cooking the ai hype cycle is wild, and we all feel some anxiety — every week there's some new model that's supposedly gonna change everything and everyone's scrambling to keep up but here's the thing: the fundamentals don't change. good design is still good design. solving real human problems, finding the ideal systems, making the best tools are still what matters. same with competition: know what they're building, but don't watch too closely. the anxiety comes from thinking you have to chase every trend or copy every feature or you'll get left behind. most trends are just noise. most competitor moves are just reactions to other reactions. focus on what's always been true: understand your systems and users, solve real problems, build quality stuff with care. stay grounded in your values and let the technology serve your vision, not the other way around. the best builders aren't the ones following every ai paper on twitter or obsessing over what others ship. they're the ones using whatever tools help them build the best thing for people. make your own best thing. everything else is distraction.

66

1K

133

431

104K

SoHarshhh retweeted

Ryo Lu

@ryolu_

11 months ago

don’t build slot machines don’t fake humans don’t hide the messy truths don’t create black boxes don’t make people feel stupid don’t extract value or attention don’t optimize for vanity metrics don’t gatekeep knowledge don’t make tools that divide don’t sacrifice agency for convenience don’t hold opinions build tools that teach build systems that reveal build for human curiosity, not clicks build bridges, not walls build for the commons build for every unique being build to amplify thought build for the person you once were build for questions we haven’t yet asked build tools that extend imagination build with the love for humanity, for the universe we live in

141

3K

314

2K

265K

Harsh

@SoHarshhh

12 months ago

and that’s a wrap! had a lot of fun meeting new people @CalHacks

1

3

0

163

SoHarshhh retweeted

richard

@richardzphotoz

about 1 year ago

Your product is not the pitch. Your story is the pitch. The product just proves it.

32

212

18

33

9K

Harsh

@SoHarshhh

about 1 year ago

only in sf

0

6

0

141

Harsh

@SoHarshhh

about 1 year ago

Hello SF!

0

3

0

134

Harsh

@SoHarshhh

about 1 year ago

Thank you berkeley for the memories, the people, and the energy. From living at Arcadia and being a part of a wild and crazy community to meeting new faces daily, exploring new spots, and ending it all with the best house party I’ve been to. This was all some crazy experience! Until next time, berkeley… Next stop SF.

SoHarshhh's tweet photo. Thank you berkeley for the memories, the people, and the energy.
From living at Arcadia and being a part of a wild and crazy community to meeting new faces daily, exploring new spots, and ending it all with the best house party I’ve been to.

This was all some crazy experience!

Until next time, berkeley…

Next stop SF.

0

4

0

149

Harsh

@SoHarshhh

about 1 year ago

what up humans! First Tweet! (Kinda late to the party)

0

3

0

90

Harsh

@SoHarshhh

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users