Truly unbelievable
GLM 5.2 just released and it's an open weights model you can run locally
The insane part is, it's just as good as Opus 4.8
Unlimited, free super intelligence running on your desk
In this video I cover how it works, and how to set up your first local model:
🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length.
🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models.
🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice.
Try it now at https://t.co/GCdiMzk1Dl via Expert Mode / Instant Mode. API is updated & available today!
📄 Tech Report: https://t.co/drlDrxkYtp
🤗 Open Weights: https://t.co/T13Y8i7SDM
1/n
Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.
Jane Street AI Engineer revealed how they trained their own LLM for trading to make $22.5B/year
16 minutes. free. straight from tier-1 quants.
bookmark & watch - this is the most honest "AI inside a hedge fund" talk ever published.
forget the "AI trading bot" YouTube grifters. This is the real inside view: data, training, evals, integration.
then start building your own bot using post below.
Fun interactive science app ideas | Part 3
Played around with generating 3D biological structures and made an app to explore them interactively
UI Design
GPT Images 2
Code
Gemini 3.1 Pro
More demos ↓
Claude Code just got an update nobody is talking about. And it replaced my entire video crew.
You type one simple sentence.
Claude writes the script.
It uses ElevenLabs to clone your voice.
It calls Remotion to build the scenes.
It grabs HeyGen to make your avatar talk.
Ten minutes later, you have a finished video.
You never even turn on a camera.
This changes how we make content forever.
Jensen Huang: “If that $500,000 engineer did not consume at least $250,000 worth of tokens, I'm going to be deeply alarmed.”
The Nvidia CEO expects his highly paid engineers to be spending at least HALF their salaries on tokens to supercharge their abilities.
@Jason:
“ The conversation we've had on the pod a number of times is, ‘Oh my God, look at the token usage in our companies.’ It is growing massively.”
“And some people are asking, ‘Hey, when I join a company, how many tokens do I get? Because I want to be an effective employee.’”
“You've postulated, I believe, $75,000 in tokens for each engineer, something like that.”
“So are you spending, at Nvidia, $1 billion, $2 billion on tokens for your engineering team right now?”
Jensen:
“We're trying to.”
“Let me give you the thought experiment: Let's say you have a software engineer or AI researcher and you pay them $500,000 a year. We do that all the time.”
“That $500,000 engineer, at the end of the year, I'm going to ask them, how much did you spend in tokens?”
“If that person said, ‘$5,000,’ I will go ape… something else.”
“If that $500,000 engineer did not consume at least $250,000 worth of tokens, I'm going to be deeply alarmed.
“And this is no different than one of our chip designers who says, ‘Guess what? I'm just going to use paper and pencil, I don't think I'm going to need any CAD tools.’”
Jason:
“This is a real paradigm shift, to start thinking about these all-star employees, it almost reminds me of what we learned in the NBA when LeBron James started spending a million dollars a year just on his health and his body, like in maintaining it. Here he is at age 41, still playing.”
“These are incredible knowledge workers. Why wouldn't we give them superhuman abilities?”
We’ve agreed to a partnership with @SpaceX that will substantially increase our compute capacity.
This, along with our other recent compute deals, means that we’ve been able to increase our usage limits for Claude Code and the Claude API.
Introducing Claude Design by Anthropic Labs: make prototypes, slides, and one-pagers by talking to Claude.
Powered by Claude Opus 4.7, our most capable vision model. Available in research preview on the Pro, Max, Team, and Enterprise plans, rolling out throughout the day.
Today we're open sourcing https://t.co/p76KVdY7dG, a reference platform for cloud coding agents.
You've heard that companies like Stripe (Minions), Ramp (Inspect), Spotify (Honk), Block (Goose), and others are building their own "AI software factories". Why?
1️⃣ On a technical level, off-the-shelf coding agents don't perform well with huge monorepos, don't have your institutional knowledge, integrations, and custom workflows.
2️⃣ On a business level, the moat of software companies will shift from 'the code they wrote', to the 'means of production' of that code. The alpha is in your factory.
Open Agents deploys to our agentic infrastructure: Fluid for running the agent's brain, Workflow for its long-running durability, Sandbox for secure code execution, AI Gateway for multi-model tokens.
(Because of our focus on Open SDKs and runtimes, this codebase is a gem even if you're not hosting on Vercel.)
TL;DR: if you're building an internal or user-facing agentic coding platform, deploy this:
https://t.co/xdsc42nbDN