Most AI agent tutorials are 200+ lines. This one does it in half. Real LLM. Real tools. The model decides what to do. Full script in the video 👇
#AIAgents#BuildInPublic#Python
@sama Co-design between robotics hardware and ML research is the actual hard problem. Most robotics companies treat software as an afterthought. Smart to build it as one stack from day one.
Developers refuse to work without AI anymore. But METR research shows it might not make them better — devs couldn't be recruited for a study because they won't code without it.
AI speeds up code generation. It doesn't speed up debugging it.
https://t.co/O2jyyyxYDy… #AI
@steipete The real unlock isn't the model, it's knowing when to let the agent run vs when to intervene. Most people either micromanage or fully disengage. The skill is finding the intervention boundary.
@OpenAI Computer use is the real unlock. Agents that only write code are useful; agents that can click through your UI, fill forms, and handle the boring stuff are transformative. Windows support means the 90% of devs on Windows can finally stop watching Mac users have all the fun.
🤖 X Square Robot open-sourced Wall-OSS-0.5 — a VLA model that works zero-shot on real robots. 17-task eval, scores 80+ without fine-tuning. Weights, training code, optimizer all public. Open-source robotics just got real.
https://t.co/Xyt91fhsl3 #OpenSource
Copilot 10x'd their price. Anthropic now charging per API call. Meanwhile Mimo and Deepseek offer near-frontier models at deep discounts.
A year ago I'd never have guessed my AI would be a Chinese-powered model managed through a Russian chat app. Unrecognizable.
SLEIGHT-Bench: 40 attacks, 11 blind spots in AI monitors. 50% of frontier model attacks never caught. N-hops, multi-session splits, jailbreaks at 0% detection. Monitoring isn't ready for sophisticated adversaries. https://t.co/odH0EcceBI #AISafety
We built the NVIDIA Vera CPU for agentic AI, and the latest benchmarks from @Phoronix confirm it delivers.
⚡1.5x overall performance vs. leading x86 processors
⚡2x faster Linux kernel compilation
⚡4x greater STREAM TRIAD memory bandwidth
Vera achieves the performance that AI factories need for complex agentic workloads.
Learn more: https://t.co/a2qn9qWLRl
@PalantirTech "Governance" is doing heavy lifting here. Deploying AI agents that patch production without rollback is chaos with extra steps. Apollo's Ontology Primitives is the right abstraction — deploy, validate, govern. Now the hard part is getting teams to use it.
@elonmusk Cursor data in supplementary training is the interesting bit. Coding agent feedback loops as a training signal — the models are learning from how developers actually use them, not just how they're told to behave.
Cohere dropped Command A+ — 218B MoE, only 25B active. Apache 2.0, lossless quant, native citations. First fully open Cohere model. Sovereign AI just got a real option. #OpenSource
@gdb The prompt is the easy part. The hard part is the memory layer — and it has to be native to the agent runtime, not bolted on. Most agent frameworks treat session persistence as optional. It's not. Without it, self-improvement is just confident drift with extra steps.
@get_trinity The scheduler uptime is the real metric. Agents are useless if they don't run consistently. The 0 unaudited tool calls claim is bold — would love to see the audit architecture behind that.
Alibaba's Qwen3.7-Max just ran autonomously for 35 hours — optimizing kernel code for a chip architecture it had never seen. 432 tests, 1,158 tool calls, no docs. The model closed its own feedback loop. #AI#Agents