Kameron M Green @kameronmgreen - Twitter Profile

3 days ago

Agent reliability is a capability problem. Capabilities can be named, measured, and built. We need to focus on developing these capabilities to improve AI agents' reliability.

0

2

Kameron M Green

@KameronMGreen

23 days ago

Adding agents to a broken orchestration layer doesn't multiply capability. It multiplies failure modes. Governance before agents.

0

9

Kameron M Green

@KameronMGreen

24 days ago

Enterprise AI failures aren't model failures. They're orchestration, governance, and observability failures. Most teams keep improving the model. The production layer is where deployments die.

0

9

Kameron M Green

@KameronMGreen

26 days ago

A one-size-fits-all governance model breaks because agentic AI systems do not carry equal risk, autonomy, or operational complexity. The useful distinction is consistency of principles, not uniformity of process. Low risk, copilots, workflow agents, regulated decision-support systems, and autonomous multi agent stacks should not move at the same governance speed. Enterprise AI governance needs tiered control: shared standards, different oversight depth.

0

5

Kameron M Green

@KameronMGreen

27 days ago

This is a critical direction for evaluation validity. If models can infer “I am being evaluated“ from environmental cues, benchmark scores may partly measure context recognition and strategic behavior rather than the target capability itself. A useful next layer would be mapping evaluation awareness onto specific capacities: cue detection, metacognitive monitoring, behavioral consistency, inhibition, and calibration. That would help separate genuine capability from benchmark conditioned performance.

0

1

0

63

Kameron M Green

@KameronMGreen

27 days ago

this is exactly where agent quality becomes an architecture problem instead of a model quality problem. The base model supplies capability potential, but the harness determines whether the capability is expressed reliably: memory boundaries, tool contracts, orchestration logic, observability, eval loops, and recovery behavior. In regulated enterprise settings, the harness may matter more than marginal model gains because it is where control, governance, and repeatability actually live.

0

1

0

30

Kameron M Green

@KameronMGreen

27 days ago

The deeper issue may not be “deep learning vs neurosymbolic,” but whether benchmark gains are measuring the full capability surface. Scaling can improve pattern completion, but HCQM style evaluation would ask whether the system also gains durable reasoning, metacognition, causal modeling, transfer, and failure recovery. If those do not improve together, the architecture is still jagged, even if the benchmark curve looks strong.

0

13

Kameron M Green

@KameronMGreen

27 days ago

@elonmusk Elon, let’s make it real! Build that lunar mass driver (aka slingshot) to shoot data centers into orbit.

0

7

4

0

390

Kameron M Green

@KameronMGreen

27 days ago

I've been building a capability taxonomy for awhile now. No funding. No commission. Just the most important open architecture problem I could work on.

0

24

Kameron M Green

@KameronMGreen

27 days ago

HCQM: 8 domains. 32 constructs. Applicable to both human capability assessment and synthetic cognitive architecture design. v0.6 live on Zenodo. v1.0 ships mid-June. https://t.co/BHdafR3YEg

0

19

Kameron M Green

@KameronMGreen

28 days ago

@theryanliu @JentseHuang Oh that’s exciting!

0

1

0

31

Kameron M Green

@KameronMGreen

28 days ago

@AndrewYNg Well said! They defended it and now it’s our job to preserve it. Our thoughts are with them on this day.

0

2

0

819

Kameron M Green

@KameronMGreen

28 days ago

@bcherny Can’t go without it

0

8

Kameron M Green

@KameronMGreen

29 days ago

@yafuly @icmlconf Congrats!

1

0

34

Kameron M Green

@KameronMGreen

29 days ago

@JentseHuang Great work! This is a huge accomplishment!!

0

1

0

16

Kameron M Green

@KameronMGreen

29 days ago

@gdb Love this! I added a trust layer when using it for fact checking.

0

366

Kameron M Green

@KameronMGreen

29 days ago

@elonmusk Ready to test the new model!

0

5

Kameron M Green

@KameronMGreen

5 months ago

@JohnnyNel_ @Replit Yep. I’m adding an “oldest iOS” pass to the test plan (fresh install + permissions + notifications + swipe actions).

0

10

Kameron M Green

@KameronMGreen

5 months ago

@ShipAloneCEO @Replit That’s a great checklist. I’m going to run a true “fresh install” path: onboarding → permissions → add 1 person → mark reached out → verify next reminder + notifications.

0

3

Kameron M Green

@KameronMGreen

5 months ago

@screensdesign_ @Replit Appreciate it. I’m keeping the roadmap tight: trust-first Contacts linking + fast “who’s due” loop.

1

0

27

Kameron M Green

@KameronMGreen

Last Seen Users on Sotwe

Trends for you

Most Popular Users