Building autonomous agents for scientific discovery? 🧬🤖
@GoogleDeepMind Science Skills is now available on GitHub. We've open-sourced this specialized toolkit to accelerate your agentic workflows with scientific grounding and higher token efficiency.
Download now ↓
https://t.co/cwp1HOeKvo
🚀 Codex CLI 0.137.0 is out!
🤖 Multi-agent v2 per-thread runtime choice
🌐 Web and image tools in more flows, parallel search
🔌 codex plugin list --json with cached catalog
Changelog: https://t.co/sKpn9YVFmE
A harnessed LLM agent, clearly explained!
Most people picture this as a model with tools bolted on. The real architecture inverts that relationship.
The model itself is deliberately thin. Intelligence gets pushed outward, and the harness composes it at runtime.
Three dimensions orbit the harness core:
- 𝗠𝗲𝗺𝗼𝗿𝘆 holds the state a model shouldn't carry in weights or context. Working context, semantic knowledge, episodic experience, and personalized memory each have their own lifecycle.
- 𝗦𝗸𝗶𝗹𝗹𝘀 hold procedural knowledge. This can cover operational procedures, decision heuristics, and normative constraints that specialize the general model per task.
- 𝗣𝗿𝗼𝘁𝗼𝗰𝗼𝗹𝘀 hold the interaction contracts. Agent-to-user, agent-to-agent, and agent-to-tools are three distinct surfaces with their own failure modes.
Between the core and these modules sit the mediators, like sandboxing, observability, compression, evaluation, approval loops, and sub-agent orchestration.
They govern how the harness reaches out and how state flows back in.
The useful question this framing unlocks is: for any new capability, where should it live?
- Stable knowledge goes to memory
- Learned playbooks go to skills
- Communication contracts go to protocols
- Loop governance goes to the mediators
Harness design becomes a question of what to externalize, and how to mediate it.
I'm building a minimal agent harness from scratch and will open-source it soon.
In the meantime, my co-founder wrote an article about the anatomy of Agent Harness, covering the orchestration loop, tools, memory, context management, and everything else that transforms a stateless LLM into a capable agent.
Read it below.
You should want to control and host your own memory
It’s the one thing that you should be able to take to any platform
Watch for this to be a defining battle in the new browser war: the AI harness wars of 2027