Introducing #Rodin Gen-2.5🚀
🔥World’s 1st 10 MILLION polygon #3D GenAI — down to skin microstructures.
1️⃣1M-poly in 4s
2️⃣Adaptive thinking effort - just like LLMs
3️⃣3D-native textures, no blind spot
4️⃣Batch up to 10 results
💥Manual BANG to Parts & more...
🚨$1 for first mo!
Microsoft has released a 4B parameter model that turns any image into a 3D asset in 3 seconds.
It uses a new geometry format called O-Voxel that converts to a textured mesh in under 100ms on CUDA.
Outputs GLB files with full PBR textures, ready for Blender, Unity, and Unreal.
100% Open Source.
D-Rex by NVIDIA.
photorealistic digital humans that move and look correct under any lighting condition.
- built on EVA +Cosmos DiffusionRenderer
- supports facial expression,3D pose control under novel HDR lighting.
so, Hollywood-grade relightable characters without the usual technical headaches of traditional 3D rendering pipelines.
https://t.co/F0k8hwdL94
GPT2 + Seedance 2.0 → Capoeira Sequence.
Testing for more complex choreography.
(Full workflow + prompts in thread)
PROCESS:
+ GPT2: Choreography diagram
+ GPT2: Camera direction diagram
+ Midjourney: Setting base image
+ Seedance 2.0 Omni-Ref: Video gen
Didn't follow 100%.
Probably needs more than a 15s gen.
📢📢📢introducing 𝐏𝐨𝐰𝐞𝐫 𝐅𝐨𝐚𝐦
A 3D representation that can be ray traced or rasterized in real time, with NO COMPROMISE in quality.
- Project: https://t.co/LkmVQjkIt2
- arXiv: https://t.co/TtMbyKrvrp
Rasterized at 3DGS-class FPS
Ray traced at Radiant Foam speeds
A bunch of folks have been building machine learning models that turn a photograph into a 3D environment made of Gaussian splats (read: blobs of color floating in space).
Cool technology & a very admirable effort. But marketing these as "world models" seems wrong.
More accurate would be to say that they are a riff on the broader class of image-conditioned 3D generators, with a somewhat different flavor of condition image and output representation.
As far as world modeling, they don't make great predictions about how the natural world looks or behaves. (Even for, say, a chair behind a table.)
Again: I love the technology. Super cool creative stuff. I don't love the marketing and hype around it.
Took it further with @omma_ai, (@splinetool ) now it works on video, and everything working in a Web Browser.
Drop a video, Depth Anything v2 calculates the depth map frame by frame, and Three.js renders a fully lit 3D mesh in real time.
Depth estimation on video in the browser
+Automatic baking pass
+ Dynamic lighting reacting to the geometry
#vibecoding #threejs #webgl
🚀We just released Asset Harvester, an image-to-3D model and end-to-end pipeline that extracts real object assets from autonomous driving videos!
🌐 Website: https://t.co/vXnFVW1ui8
💻 Code: https://t.co/3q3vcRvojy
[1/5]
#AssetHarvester#AVSimulation#WorldModel #AutonomousDriving
The @playcanvas team has solved collision for 3D Gaussian splats. Install splat-transform via NPM to get a CLI tool + library that can output high quality voxel-based collision. Here you can see a splat navigated in first person mode with voxel rendering toggled on/off. 🧵
I’ve been building a lightweight, WebGL/WebGPU-powered alternative to AutoCAD
- Full DXF Pipeline: Seamless import and export modules.
- Workspace Support: Full implementation of Viewport, Layout, Paper space, and Model space.
- Draw Tools: polylines, colors, HATCH etc.
📢Face Anything: 4D Face Reconstruction from Any Image Sequence
Transformer model for 4D face reconstruction and dense tracking:
- predict canonical facial coordinates per pixel
- tracking as reconstruction in canonical space
- geometry + correspondences in one forward pass
Key idea: a shared canonical space across frames
- correspondences as nearest neighbors
- no motion or deformation estimation
Stable geometry and tracking, even under large expressions and viewpoint changes - check out our results!
🌐 https://t.co/VRF2UFYo6Y
▶️ https://t.co/qMv8IKpy6R
Great work by @UmutKocasa4344, @SGiebenhain, @richard_o_shaw
Depth Anything V2 is a total beast for real-time 3D.
Built a website that hits 30fps+ depth reconstructions from a single camera feed, extruding 12k voxels via WebGPU. Pure ML running local Vision Transformers with Three.js and zero backend lag.
Lmk in the comments if you want a copy! 🙌
Experience and music made with AI in @omma_ai (tool by @splinetool).
Introducing 3D Gaussian Splatting to Mesh 3.0 by KIRI Engine! I'd like to say it gives the best mesh quality from 3DGS by far in the market! Showcases: https://t.co/H7PCeyAz5p If you are getting trouble with bad results in phone scanning, try this out. Special thanks to paper GGGS (https://t.co/lgW94U8iu6), which inspired us a lot! #GaussianSplatting #3DScan #CVPR2026