Launching our new paper on arXiv: we trained the largest multilingual food model ever built.
4.1M recipes. 7 languages. 1,790 ingredients. 300 dimensions.
All of human cooking compressed into 2 megabytes.
@cerebras This is complete absurdity. Any model can run hyper-fast if you isolate it and throw a massive amount of brute-force compute at it. But show us the actual cost. Gemini delivers that kind of speed at a fraction of your infrastructure cost—stop misleading people …
@cerebras This is complete absurdity. Any model can run hyper-fast if you isolate it and throw a massive amount of brute-force compute at it. But show us the actual cost. Gemini delivers that kind of speed at a fraction of your infrastructure cost—stop misleading people …
Today we're sharing our work on interaction models. A new class of model trained from scratch to handle real-time interaction natively, instead of gluing it onto a turn-based one.
https://t.co/MoS5s4cm60
Imagine how great life would be for us humans if we could offload so many of these repetitive, exhausting, and physically demanding tasks — like cooking, caregiving, cleaning, laundry, grocery runs, and yard work — to robots! @gs_ai_ Great work , all the best!
We are back. After one year of quiet building.
Introducing GENE-26.5, our first robotic brain that takes a major step toward human-level capability.
For years, robotics has struggled to learn from the world’s largest and valuable data source: Humans.
Solving it means rethinking the whole stack from the ground up:
- A robotics-native foundation model.
- A 1:1 human-like robotic hand.
- A noninvasive data collection glove for motion, force, and touch.
- A simulator that turns weeks of experiments into minutes.
GENE-26.5 is trained across language, vision, proprioception, tactile, and action. We designed a set of tasks to test how far we can go with this new paradigm.
Fully autonomous, 1x speed, one model, same weights. (Enjoy with sound on)
We are approaching the endgame for robotics.
And this is just a beginning.
South Korea's first humanoid robot monk made its debut at Jogye Temple in Seoul, ahead of Buddha's birthday. Gabi, the 130-centimeter-tall robot, wore a traditional grey-and-brown Buddhist robe and stood before monks as it pledged to devote itself to Buddhism
On June 1 and 2, under the aegis of International Big Cat Allaince, an initiative spearheaded by PM Shri @narendramodi ji, over 400 global stakeholders will deliberate on ways to bring big cat conservation into the mainstream of national development agendas.
🔗https://t.co/W7IjCXwE8t
Introducing Flue — The First Agent Harness Framework
Flue is a TypeScript framework for building the next generation of agents, designed around a built-in agent harness.
Flue is like Claude Code, but 100% headless and programmable. There's no baked in assumption like requiring a human operator to function. No TUI. No GUI. Just TypeScript.
But using Flue feels like using Claude Code. The agents you build act autonomously to solve problems and complete tasks. They require very little code to run. Most of the "logic" lives in Markdown: skills and context and AGENTS.md.
Flue is like Astro or Next.js for agents (not surprising, given my background 🙃). It's not another AI SDK. It's a proper runtime-agnostic framework. Write once, build, and deploy your agents anywhere (Node.js, Cloudflare, GitHub Actions, GitLab CI/CD, etc).
We originally built Flue to power AI workflows inside of the Astro GitHub repo. But then @_bgiori got his hands on it, and we realized that every agent needs a framework like Flue, not just us.
Check it out! It's early, but I'm curious to hear what people think. Are agents ready for their library -> framework moment?
AI co-clinician is our new research initiative to help explore how multimodal agents could better support healthcare workers and patients. 🩺
Here’s a snapshot of our progress 🧵
🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length.
🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models.
🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice.
Try it now at https://t.co/GCdiMzk1Dl via Expert Mode / Instant Mode. API is updated & available today!
📄 Tech Report: https://t.co/drlDrxkYtp
🤗 Open Weights: https://t.co/T13Y8i7SDM
1/n