Most agent frameworks put safety rules in the system prompt and hope the model follows them. Wrong layer โ one injection and they're gone.
OpenShell enforces policy via a YAML file.
A supervisor sits outside the agent and controls:
๐ network โ default deny
๐ filesystem โ sandboxed workspace
๐ง inference โ managed endpoint
๐ credentials โ injected at runtime
Compromise the agent, the policies still hold.
OpenShell is key for Agents - NemoClaw is a blueprint:
- Harness (swappable)
- Model (swappable)
- Runtime โ always OpenShell
The runtime is the part doing the security work. Swap everything else, that stays. That's the piece worth understanding.
DeepAgents can be that harness!
@NVIDIAAI@NVIDIAAIDev@LangChain
NVIDIA released Nemotron 3 Nano Omni.
Open with a full paper of recipes for their data and SFT.
One Open model. Text, images, video, audio โ all in one. Mamba-MoE backbone, Vision encoder, Parakeet audio encoder.
7 things every team needs locked down before shipping a multi-user agent to production.
Rogue agents can run up a $10K bill overnight. Agents can hallucinate for 100s of users. Most builders don't notice. Here's what you should be monitoring.
TL;DR ๐
๐ Model control - unified layer between your code and models, swap providers without breaking your stack
๐ Prompt registry - version your prompts, keep them out of the codebase, treat them like a second tier of code
๐ก๏ธ Guardrails - pre and post LLM, pre and post tool calls. Handle PII, PHI, prompt injection, and output filtering
๏ฟฝ๏ฟฝ๏ฟฝ Budget limiting - hard caps per model and project. The cloud providers won't do this for you
๐ง Tools & MCPs - central auth, granular permissions, test every one
๐ Monitoring & Tracing - trace every request, response, error, and latency spike
โ Evals - before go-live and after. Catch the silent failures before your users do
๐๐๐ฆ๐ฆ๐ ๐ has landed with an ๐๐ฉ๐๐๐ก๐ ๐.๐ ๐ฅ๐ข๐๐๐ง๐ฌ๐.
That alone would be news. But what's inside these models makes it a bigger deal.
๐ง๐;๐๐ฅ ๐
๐๏ธ 4 models across 2 tiers โ ๐๐จ๐ซ๐ค๐ฌ๐ญ๐๐ญ๐ข๐จ๐ง (๐๐ณ๐๐๐ฐ๐ ๐ ๐ข๐, ๐ฏ๐ญ๐ ๐ฑ๐ฒ๐ป๐๐ฒ) and ๐๐ฑ๐ด๐ฒ (๐๐ฎ๐, ๐๐ฐ๐)
๐ง 128 experts in the MOE, only 3.8B active per tokenย
๐ Native audio support on Edge models with a 50% smaller audio encoder than Gemma 3N(681M โ 305M params)
๐๏ธ New vision encoder with variable aspect ratios and resolutions โ much better OCR and document understanding
๐ค Built-in chain-of-thought ๐ฟ๐ฒ๐ฎ๐๐ผ๐ป๐ถ๐ป๐ด ๐ฎ๐ฐ๐ฟ๐ผ๐๐ ๐๐ฒ๐ ๐, ๐ถ๐บ๐ฎ๐ด๐ฒ๐, ๐ฎ๐ป๐ฑ ๐ฎ๐๐ฑ๐ถ๐ผ
๐ ๏ธ ๐๐๐ป๐ฐ๐๐ถ๐ผ๐ป ๐ฐ๐ฎ๐น๐น๐ถ๐ป๐ด ๐ฏ๐ฎ๐ธ๐ฒ๐ฑ ๐ถ๐ป from architecture level, not prompt-coaxed
๐ ๐ญ๐ฎ๐ด๐ ๐ฐ๐ผ๐ป๐๐ฒ๐ ๐ on edge, ๐ฎ๐ฑ๐ฒ๐ on workstation models
๐ 140 languages in pre-training, 35 in instruction tuning
@VentureBeat@Sam_Witteveen went deep on why this acquisition is the official obituary for the ChatGPT era.
Read the breakdown on @VentureBeat here: https://t.co/yvOUOlgJab
LangChain's CEO refused to let employees install OpenClaw on company laptops โ then called it one of the most important agent projects in years. The paradox explains everything about where enterprise AI is heading. https://t.co/9T36UkYGXD
It's true: the web is not built for AI agents. But that's changing. @Sam_Witteveen summarizes the emerging WebMCP toolset from @googlechrome, @Microsoft, et al.
Web browsing is expensive and inconsistently effective for agents. WebMCP changes that.
https://t.co/39zgvTVW0I
Anthropic launched Cowork last week.
One competing startup's response? Pivot their product and open-source EVERYTHING.
The tweet announcing it hit 1.7M views.
Here's the @Eigent_AI story ๐งต
Apple going with fine tuning of Gemini (Flash? / Flash Lite?) is probably smarter for them than jumping in to a fully "make your own models" strategy at this point in the game.
๐งต Google just dropped UCP (Universal Commerce Protocol) at NRF with @sundarpichai himself announcing it ๐
This is Google's play for agentic commerce - and it's built with Shopify, Etsy, Target, Walmart & Wayfair. Here's what you need to know ๐
@TRJ_0751@alarcon7a@mweinbach@tabGeeks Sam Witteveen tests the new "vibe coding" capabilities in AI Studioโfrom cloning classic games to building slick news sites.
See what Gemini 3 built in a single shot ๐
https://t.co/C6oEJEbgKk
Kimi K2 Thinking by @Kimi_Moonshot just dropped and it's very impressive. This model is now beating OpenAI, Anthropic, and Google on major benchmarks. The LLM landscape just shifted dramatically.