- <1B params
- supports 91 languages
- 5 pages/s on RTX 5090
- runs on CPU, GPU, MPS
- 83.3% olmocr bench score (top under 3B)
Surya OCR is a state-of-the-art model for document intelligence.
100% open-source.
A completely local agent that lives right inside your pocket. 📱
Watch Gemma 4 run 100% locally in the Google AI Edge Gallery app. It converts images into JSON schemas, transcribes audio, and uses agent skills to interact with apps, all entirely offline.
We are entering a new era of on-device automation. ✨
Watch Gemma 4 E4B navigate and drive an iOS simulator directly using Argent. Local models can handle complex interactions and software navigation autonomously.
Japan Airlines will trial humanoid robots for baggage handling and aircraft cleaning at Tokyo's Haneda Airport starting in May, citing workforce shortages and rising tourist numbers
Today we’re giving an update on ramping F.03 production at BotQ
In the last 120 days, Figure scaled manufacturing 24x - from 1 robot/day to 1 robot/hour
We will manufacture 55 humanoid robots this week
Run promptable segmentation with Ultralytics YOLO26-E! 🧠
Segment objects using natural language prompts with YOLO26-E, ideal for flexible visual search, rapid annotation, and interactive computer vision workflows.
Get started ➡️ https://t.co/7F2WlQqtAp
#Ultralytics#YOLO26 #AI #Research
Claude can actually do CAD now in @Onshape
Here it worked for an hour and built a 4-part monitor arm, starting only from a sketch and description. The trick was to give it the tools to look at its own work.
Introducing: Jarvis Onshape MCP
🤯BREAKING: Researchers just mathematically proved that AI layoffs will collapse the economy: and every CEO already knows it.
The AI Layoff Trap. A game theory paper from UPenn + Boston University is glaringly important!
100K+ tech layoffs in 2025. 80% of US workers exposed. And no market force can stop it.
→ Every company fires workers to cut costs
→ Every fired worker stops buying products
→ Revenue collapses across every sector
→ The companies that fired everyone go bankrupt
It's a Prisoner's Dilemma with math behind it. Automate and you survive short-term. Don't automate and your competitor kills you. But everyone automating destroys the demand that makes all companies viable.
UBI (universal basic income) won't fix it.
Profit taxes won't fix it.
The researchers found only one solution: a Pigouvian automation tax "robot tax"
The AI trap on the economy is here!
Friendly reminder that Google has an official app to run Gemma 4 on your phone.
- 100% open source
- Fully offline and private
- Multimodal with text/audio/image
- Works with Gemma E4B and E2B
And the app is available on both iOS and Android.
Steps and download below
Gemma 4 just dropped. I had it captioning video in real-time within an hour.
Running locally on a MacBook. No cloud. No API. Real-time scene understanding.
Oh and SAM3 is segmenting every object in the same frame. Same laptop.
Stop whatever you're doing, Bengaluru.
Announcing OpenAI's 1st Official Codex Hackathon in India. Over 100 builders, $100,000 in prizes. This April 16th, build ambitious ideas 👇🏻
We’re bringing together 100 of the most ambitious builders and developers who already use AI coding tools daily and are comfortable shipping production-grade code.
Build something new from scratch or push an existing open-source project further with Codex. Spin up subagents & parallel tasks, use Plugins, and leverage worktrees to operate at a higher level of velocity and ambition.
Excited to do this with good folks @gabrielchua, @yashrajnayak, @nhlhomer & @harshitm29
Perks
+ Assured API credits
+ Assured 1 month of ChatGPT Pro
+ Chance to win $100,000 in credits & subscriptions
Only 100 spots.
Apply from the link in the thread.
Vibe Coding XR: prompt → working XR app in <60s.
Built on Gemini + XR Blocks, it generates physics-aware WebXR experiences with spatial interactions out of the box.
No heavy setup. Just describe and test in headset or simulator.
https://t.co/tHTnw4VCtJ
You can now enable Claude to use your computer to complete tasks.
It opens your apps, navigates your browser, fills in spreadsheets—anything you'd do sitting at your desk.
Research preview in Claude Cowork and Claude Code, macOS only.