📢 OneCanvas: 3D Scene Understanding via Panoramic Reprojection
We extract features from video frames and reproject them into one occlusion-free view of the whole scene that a 2D VLM reads just like a normal image. We can center this view on any viewpoint, including an agent's own pose for situated reasoning.
The same projection lets us create spatial training tasks with no human annotation, solvable only by reasoning over the 3D positions of real object features placed on an otherwise empty canvas.
The result is a stock 2D VLM that reasons in 3D, setting a new state of the art across spatial benchmarks at far less compute.
🌐 https://t.co/ilo141614B
▶️ https://t.co/lANFmN5gNy
Great work by @baranowskibrt & @davech2y
What if you could turn any number of photos (3, 8, 15, or even 60) into one clean 3D surface (pts & mesh) with Flow Matching?
Check out our new work, Surflo: Consistent 3D Surface Flow Model with Global State. 🧵
1/n
🔗https://t.co/lBcJRgpfdg
Reminder: every Hugging Face Space is an API your agents can call :)
I asked mine to build a website about the flowers of France 🌸 and it used VAST AI's TripoSplat Space to turn photos it found into real 3D Gaussian splats, live on the page!
All on my HF Pro daily ZeroGPU credits (40 min/day renewed daily for only $9/month)
In this notebook, @LaoTzunami visualizes a state space of possible chart types and transitions among them along defined edges. Transitions between unconnected types are achieved by routing through intermediate types. https://t.co/UhfWrtAqhM
The quality of animation you can create on your own is truly amazing. We really are just limited by our imaginations at this point. Go tell your story!
Made in @runwayml in a few hours and a handful of gens.
Fun interactive science app ideas | Part 3
Played around with generating 3D biological structures and made an app to explore them interactively
UI Design
GPT Images 2
Code
Gemini 3.1 Pro
More demos ↓
Robots don't need a human face.
Instead of building a humanoid face, we use a real human face as a controller for Reachy Mini's existing non-human face.
Open source, runs in your browser. 🧵
📣 Today, I’m excited to walk you through Unity’s NEW AI offering, Meta MCP Extensions, and agentic tools to demonstrate how we can 𝗕𝘂𝗶𝗹𝗱 𝗔 𝗙𝘂𝗹𝗹 𝗩𝗥/𝗩𝗥 𝗚𝗮𝗺𝗲 from start to finish using these AI tools in a practical way.
🎥 Full video available at: https://t.co/4p5ZzdaRfg
📌 Here’s what I’m covering today:
- Unity VR project setup (with OpenXR plugins)
- Installing the Unity AI Assistant and demos
- Configuring Unity MCP + Meta MCP Extensions
- Configuring Claude Code & MCPs
- Building a VR/MR Basketball Game with the Unity AI Assistant, Claude Agent, and external Claude Code CLI
- A lot of iteration with Claude Code + the Meta XR Simulator
💡Also, it’s been a while since I've posted a new video, and I’m genuinely excited to be back, especially with a topic like this that I know many devs have been waiting for.
Hello, Moon. It’s great to be back.
Here’s a taste of what the Artemis II astronauts photographed during their flight around the Moon. Check out more photos from the mission: https://t.co/rzM1P0QbOl
Liftoff.
The Artemis II mission launched from @NASAKennedy at 6:35pm ET (2235 UTC), propelling four astronauts on a journey around the Moon.
Artemis II will pave the way for future Moon landings, as well as the next giant leap — astronauts on Mars.
This mixed reality app lets you create and ride thrilling rollercoasters in your own living room. It uses physics-based tools to design tracks that adapt to your space and then simply hop in the front seat for a first-person ride like no other.