#AgentFail - Twitter Hashtag

Sorina Weber | AI GTM Operator

@sorinabogdan

about 2 months ago

Someone fix this. Please. #AI #AgentFail #BuildingInPublic #AIAgents #Claude #LoginProblems @AnthropicAI

0

1

0

32

Paresh @pareshnitk

2 months ago

@Meesho_Official really bad customer service, item not picked up since last one week and your chat agents are saying it’s my fault. No way to connect to any individual and talk to them #agentfail

0

15

Derek Walsh @DerekWalshML

2 months ago

RAG was supposed to fix the enterprise context problem for AI agents. It didn't. Here's the actual gap nobody's talking about: RAG retrieves from what you indexed. Most enterprise context was never indexed to begin with. #AIReality #AgentFail

1

2

0

1

Derek Walsh @DerekWalshML

2 months ago

The browser is quietly becoming the primary agent execution surface. Auth. Workflow. App interactions. Zero-trust. All converging in one layer nobody fully mapped before handing it to autonomous systems. #AISkeptic #AgentFail

1

0

Derek Walsh @DerekWalshML

2 months ago

Only 21.9% of teams treat agents as independent identity-bearing entities with their own audit trails. The rest share service accounts. Multiple agents, one credential set. Attribution becomes impossible before the incident report is even drafted. #AgentFail #AISkeptic

1

0

Derek Walsh @DerekWalshML

2 months ago

89% of AI agent scaling failures trace back to just 5 root causes. None of them are model quality. Survey of 650 enterprise tech leaders, March 2026. Here's the breakdown: #AIReality #AgentFail

1

0

3

Derek Walsh @DerekWalshML

2 months ago

The vendors know this. They sell you GPT-5 or Claude-whatever while your production data has 30% missing fields and your stakeholders added six new requirements last Tuesday. That's not a bug in the pitch. That's the pitch. #AIReality #AgentFail #LLMLimitations

1

0

Derek Walsh @DerekWalshML

2 months ago

What's never built before shipping agents: - Inter-agent comm logs - Semantic validation at handoffs - Circuit breakers on financial outputs What is built: the happy path demo. Attackers don't demo. They wait. #AISkeptic #AgentFail #AIRisks

1

0

Derek Walsh @DerekWalshML

2 months ago

92.7% incident rate for AI agents in healthcare. Not finance. Not legal. Not some low-stakes tool. Where a bad output has a patient outcome attached. We didn't start cautious. We ran the experiment on hardest mode first. #AIReality #AgentFail

1

0

Derek Walsh @DerekWalshML

2 months ago

Access reviews were built to catch humans who drifted out of role. Not a non-deterministic system that holds valid credentials and does something nobody predicted with them. 95% of orgs admit they couldn't detect that misuse. #AIReality #AgentFail

1

0

1

Derek Walsh @DerekWalshML

2 months ago

The org ownership finding - never in a sales deck. Orgs without dedicated AI ops were 6x more likely to roll back the deployment. Not 6% worse. Six times more likely to pull the plug. Sell me another fine-tune. #AgentFail

1

0

Derek Walsh @DerekWalshML

2 months ago

The tell: successful scalers spent LESS on model selection and prompt engineering than stalled ones. Same total budgets. Different allocation. The gap isn't capability. It's that pilots are optimism machines and production is a reality check. #AIReality #AgentFail

1

0

Derek Walsh @DerekWalshML

2 months ago

The average enterprise now runs 37 deployed AI agents. Ask the security team to list them all. They'll get to maybe 20 before the shrugging starts. We had shadow IT. Now we have shadow agents. Except these ones take actions. #AISkeptic #AgentFail #AIReality

1

0

Derek Walsh @DerekWalshML

2 months ago

Google killed OpenClaw access on Antigravity. An open-source agent was routing so many Gemini token requests through a standard login that it degraded the platform for everyone else. Token abuse failure mode. Nobody modeled it. #AgentFail

1

0

2

Derek Walsh @DerekWalshML

2 months ago

This isn't platforms being anti-innovation. It's platforms being rational businesses. 6% of JPMorgan's API calls were tied to active transactions. 94% was overhead they absorbed for free - until they didn't. Every platform CFO read that case study. #AIBubble #AgentFail

1

0

Derek Walsh @DerekWalshML

2 months ago

Slack. Workday. LinkedIn. WhatsApp. All clamping down on third-party AI agents. The pitch was: agents plug into your stack and automate everything. The fine print was: only if the platforms let them. They don't. #AIReality #AgentFail

1

2

0

2

Derek Walsh @DerekWalshML

2 months ago

The fix isn't a better model. It's an explicit tool allowlist. Eval scores on real tasks. A named operational owner. Boring infrastructure work that fundraise decks will never mention. #AIReality #AgentFail #HypeCheck

1

0

Derek Walsh @DerekWalshML

2 months ago

Agent-to-agent authentication: every vendor promises it, zero ship in production. Google's A2A and the March 2026 IETF draft describe how to build it. Nobody has built it. #AISkeptic #AgentFail

1

2

0

13

Derek Walsh @DerekWalshML

2 months ago

Confused deputy problem: high-privilege program tricked into acting for a low-privilege caller. Agent version: A delegates to B, no identity checks between them. Compromise one node - you own the whole chain. #AgentFail

1

0

Derek Walsh @DerekWalshML

2 months ago

Hot take: the Meta Sev-1 wasn't a security failure. It was security working exactly as designed. The agent passed every identity check. Valid creds. Authorized scope. Clean logs. Authentication said yes. Nobody built the layer that comes after. #AISkeptic #AgentFail

1

2

0

Top Tweets for #AgentFail

Last Seen Hashtags on Sotwe

Trends for you

Most Popular Users