🎉 After one year of teamwork, we are excited to release our 3D foundation model — LingBot-Map!
Unlike DA3/VGGT, LingBot-Map is a purely autoregressive model for streaming 3D reconstruction ⚡
It achieves ~20 FPS on 518×378 resolution over sequences exceeding 10,000 frames — and beyond 🚀
Two key insights behind LingBot-Map:
🔑 Keep SLAM's structural wisdom: build Geometric Context Attention with long-context modeling while maintaining a compact streaming state
🔑 Make everything end-to-end learnable — no optimization, no post-processing
Let's check out our demos 👇
This teaser update is unreal.
It uses AI to create videos from images and adds movement to specific objects.
Seriously some next-level stuff from Runway.