Aladdin Kaya @AladdinKayaa - Twitter Profile

Pinned Tweet

about 1 month ago

I noticed something simple: systems don’t fail suddenly. they drift. retries ↑ latency ↑ cycle_time ↓ Built a small demo that pauses before failure compounds. Still early, but it works: https://t.co/oeukf6Nvnx

1

0

2K

Aladdin Kaya @AladdinKayaa

about 12 hours ago

The challenge is no longer building intelligence. The challenge is knowing when to trust it. #AIInfrastructure #Verification #Governor

Dario Amodei

@DarioAmodei

about 13 hours ago

Alongside it, Anthropic is releasing a proposal for how governments can address the risks posed by frontier AI and a policy framework for job displacement, for which we intend to provide substantial financial backing. https://t.co/P0W2lIKbdY

19

428

23

92

85K

0

58

Aladdin Kaya @AladdinKayaa

about 13 hours ago

As agent systems scale, verification becomes infrastructure. #AIInfrastructure #Agents #Verification

Claude

@claudeai

about 15 hours ago

Dynamic workflows in Claude Code are now generally available. For complex tasks like codebase-wide bug hunts, Claude writes its own orchestration and runs subagents in parallel, verifying the work before it reaches you. Read more: https://t.co/nbNpvkfRBZ

10

116

10

32

39K

0

54

Aladdin Kaya @AladdinKayaa

1 day ago

@bcherny Self-verification increases autonomy. It does not automatically guarantee trustworthiness.

0

81

Aladdin Kaya @AladdinKayaa

1 day ago

@yunta_tsai Capability compounds. Understanding compounds faster. The second one is harder to automate.

0

43

Aladdin Kaya @AladdinKayaa

1 day ago

@mattpocockuk Queues tell us where work is waiting. They don’t tell us whether the system is actually making progress.

0

36

Aladdin Kaya @AladdinKayaa

3 days ago

Interesting result. Accuracy improved dramatically after adding a deterministic execution layer. The question may no longer be: “Can the agent do the task?” but: “Can we trust how the task was completed?” Reliability is becoming infrastructure

Anthropic

@AnthropicAI

3 days ago

New Science Blog: Why has AI advanced faster in coding than in biology? To agents, bio databases are like cities built before cars—maddening to drive in because they're designed for different traffic. How do we build infrastructure agents can use? https://t.co/PQaNQ4GRJZ

303

4K

481

2K

619K

0

148

Aladdin Kaya @AladdinKayaa

3 days ago

@garrytan Capability without reliable judgment eventually becomes a trust problem.

0

32

Aladdin Kaya @AladdinKayaa

3 days ago

@bcherny The interesting question isn’t whether a loop can run for days. It’s whether we can tell when its judgment starts drifting during those days.

0

48

Aladdin Kaya @AladdinKayaa

5 days ago

@MicrosoftLearn I’m focusing on the layer between capability and judgment. As agents become more autonomous, knowing when to stop, review, or escalate may become just as important as knowing how to act.

1

0

63

Aladdin Kaya @AladdinKayaa

6 days ago

The biggest AI story may not be AI replacing tasks. It may be AI accelerating AI development. That’s where the compounding begins.

0

103

Aladdin Kaya @AladdinKayaa

8 days ago

@mipsytipsy The capability leap is real. The operational trust debt is real too.

0

1

0

58

Aladdin Kaya @AladdinKayaa

10 days ago

@emollick The hard part is no longer proving AI can generate value. The hard part is governing reliability once that value scales across the organization.

0

101

Aladdin Kaya @AladdinKayaa

11 days ago

@garrytan The interesting shift is that prompts are slowly becoming trainable artifacts instead of static instructions. Evaluation loops may become more important than prompt writing itself.

0

100

Aladdin Kaya @AladdinKayaa

11 days ago

@GoogleAIStudio Vibe coding governance layers before agents start vibe deploying themselves into production.

0

42

Aladdin Kaya @AladdinKayaa

13 days ago

@MicrosoftLearn Interesting seeing governance move from “security add-on” to core agent infrastructure. The stack is shifting from: build agents → govern execution.

1

0

285

Aladdin Kaya @AladdinKayaa

13 days ago

@garrytan Infinite compute doesn’t remove the need for judgment. It amplifies the consequences of bad judgment.

0

1

0

80

Aladdin Kaya @AladdinKayaa

14 days ago

@claudeai As agents become more autonomous, capability stops being the bottleneck. Long-running reliability becomes the real problem. Not whether systems can act. Whether they remain trustworthy while acting independently for hours.

0

921

Aladdin Kaya @AladdinKayaa

14 days ago

Governor v0.1 started as a simple operational drift signal. What became obvious very quickly: the hardest failures in agent systems are rarely visible crashes. They’re silent degradations hidden behind healthy dashboards. v2 expands deeper into that layer.

0

168

Aladdin Kaya @AladdinKayaa

17 days ago

This is exactly why operational drift matters. As model capability commoditizes, judgment becomes the real bottleneck. Most systems do not fail instantly. They drift first. https://t.co/oeukf6Nvnx

Lenny Rachitsky

@lennysan

18 days ago

Automation is a lie. CLIs are over. The SaaSpocalypse is dumb. A year ago @danshipper came on the podcast to predict where AI was heading. He was remarkably right—including the call that everyone was sleeping on Claude Code. Dan has a unique lens into where things are going because his team at @every is possibly the most AI-pilled group of people in tech. I always learn a ton talking to Dan. So I brought him back for round two. We'll score these in exactly a year: 🔸 Every company will have one “super-agent” in Slack. 🔸 Codex and Claude Code will become the new operating system for knowledge work. 🔸 The AI job apocalypse is not happening. 🔸 PMs and designers will thrive. 🔸 We will read way more AI-generated writing and we will like it. 🔸 "I would buy SaaS stocks right now." Listen now 👇 https://t.co/wzxQ5bz49h

88

1K

121

3K

2M

0

343

Aladdin Kaya @AladdinKayaa

18 days ago

Governor v0.1 Operational drift detection for agent systems. Not a blocker. A control point. Most systems do not fail instantly. They drift first. https://t.co/0Ruj82hn3o

0

131

Aladdin Kaya

@AladdinKayaa

Last Seen Users on Sotwe

Trends for you

Most Popular Users