Vladimir Yugay

Verified account

@vyuga3d

Doing research in 3D Computer Vision. Ph.D. student at @UvA_Amsterdam. Previously at @Meta @TU_Muenchen. Top 100 at @LegionTD2

Amsterdam

Joined December 2022

67 Following

593 Followers

97 Posts

Pinned Tweet

9 months ago

📽️ Check out Visual Odometry Transformer! VoT is an end-to-end model for getting accurate metric camera poses from monocular videos. https://t.co/6tVXVt6mTx VoT does not require camera calibration parameters, post-optimization, and operates in real-time, capable of processing thousands of frames. It is trained on a vast amount of real-world indoor data, but can work just fine in outdoor scenarios. It uses only camera poses as supervision - no optical flow, intrinsics, point clouds, or tracks - making it broadly accessible. We experimented with different backbones, camera pose representations, scalability, and attention mechanisms. Our evaluation spans hundreds of full-length videos across various metrics, without aligning the predicted trajectory to the ground truth, to simulate a real-world application. Thanks to the team, @kienduynguyen94, @theogevers, @cgmsnoek, and @Martin_R_Oswald from the @UvA_Amsterdam!

5

266

43

201

16K

3 months ago

@pablovelagomez1 @rerundotio @Gradio Nice work! Adding splatting-based slam like monogs would be great

1

1

0

0

254

3 months ago

@techfusionjb It doesn't handle in view dynamics - only changes to the static scene itself. lots of work ahead

0

0

0

0

26

3 months ago

⏩ GaME code release! https://t.co/jnSgngCWJd Grab components for your 3D reconstruction pipeline: 🔹 Purely geometric out-of-view scene change detection 🔹 Outdated observations filtering 🔹 Evaluation videos of changing scenes Contributions welcome 🚀

2

55

9

45

4K

Who to follow

Ivona Najdenkoska

@ivonajdenkoska

Research @tavus | PhD @UvA_Amsterdam | Prev. @Meta @KU_Leuven @Netcetera_Buzz

Andrija Novakovic

@AndrijaNovakov6

research @BainCapCrypto

Associate Professor at School of Computing at NUS. Research interests: human-centric vision, video understanding

6 months ago

@gabriberton Didn't see much if any improvement for odometry

0

1

0

0

211

9 months ago

@bercankilic We'd love to throw more data in! Alas academic compute is limited

0

1

0

0

48

9 months ago

📽️ Check out Visual Odometry Transformer! VoT is an end-to-end model for getting accurate metric camera poses from monocular videos. https://t.co/6tVXVt6mTx VoT does not require camera calibration parameters, post-optimization, and operates in real-time, capable of processing thousands of frames. It is trained on a vast amount of real-world indoor data, but can work just fine in outdoor scenarios. It uses only camera poses as supervision - no optical flow, intrinsics, point clouds, or tracks - making it broadly accessible. We experimented with different backbones, camera pose representations, scalability, and attention mechanisms. Our evaluation spans hundreds of full-length videos across various metrics, without aligning the predicted trajectory to the ground truth, to simulate a real-world application. Thanks to the team, @kienduynguyen94, @theogevers, @cgmsnoek, and @Martin_R_Oswald from the @UvA_Amsterdam!

5

266

43

201

16K

9 months ago

@bercankilic Difficult. Lots of dynamics happening in the view - this is quite different from the data it was trained on. However, if tuned on egocentric dynamic data, I'm pretty confident it would work

1

1

0

0

181

9 months ago

0

0

0

0

332

9 months ago

@bllchmbrs customer support is always online😂 ty!

0

1

0

0

105

9 months ago

@bllchmbrs Barely have time to sleep these days 😓

1

1

0

0

367

11 months ago

@gabriberton @changh95 @AjdDavison Exactly. Another aspect is that they were all relatively simple indoor scenes and we had input depth maps for extra verification

0

1

0

0

45

11 months ago

@changh95 @AjdDavison For me even small dino features worked better than dbovw for loop closure detection, and there's quite some stuff from @gabriberton

1

2

0

0

80

11 months ago

@gabriberton It's a very strong claim. It may be fine for robots, but in the wild reconstruction (e.g. random phone videos) slam (both dense and sparse) is very very far from being solved. It becomes even more obvious when moved a bit further from academic datasets

1

1

0

0

167

12 months ago

@maikelborys No. We think that it will be easier for a person who knows ros to adapt it to ros, than for a person who doesn't know ros to adapt it to pure python :)

0

1

0

0

19

over 1 year ago

⏩Code release for MAGiC-SLAM! https://t.co/GHUbY2s54U We vibe-coded hard to make the code as simple as possible. Here are some features you can seamlessly integrate into your 3D reconstruction pipeline right away:

5

256

43

197

20K

12 months ago

@kommentlezz Not really. Sfm is typically done offline and operates with 3D point clouds, not 3D gaussians.

0

0

0

0

22

about 1 year ago

Introducing “Gaussian Mapping of Evolving Scenes”! We present an RGBD mapping system with novel view synthesis capabilities that accurately reconstruct scenes that change over time. https://t.co/9xz7zvf0xx

1

99

18

59

9K

about 1 year ago

@AdamWHarley Absolutely love using the model

0

3

0

0

299

vyuga3d retweeted

about 1 year ago

One of our CVPR highlights 👉 Meet MAGiC-SLAM: multi-agent SLAM powered by rigidly deformable 3D Gaussians for novel view synthesis. New tracking, map-merge & loop-closure kill drift, align maps, and run faster + more accurately than 2-agent baselines on synthetic & real data. @vyuga3d @theogevers @Martin_R_Oswald #CVPR2025

2

86

12

46

7K

about 1 year ago

I will be presenting our previous work at CVPR Nashville. Drop by if you want to chat!

0

1

0

0

202

about 1 year ago

This work was conducted in collaboration with Kersten Thies, @lucacarlone1 , @theogevers , @martinoswald , and Lukas Schmid at the Computer Vision Group of the @UvA_Amsterdam and @MIT Spark Lab.

vyuga3d's tweet photo. This work was conducted in collaboration with Kersten Thies, @lucacarlone1 , @theogevers , @martinoswald , and Lukas Schmid at the Computer Vision Group of the @UvA_Amsterdam and @MIT Spark Lab. https://t.co/4JQEi9zXsp

1

2

0

0

232

Last Seen Users on Sotwe

Trends for you

Most Popular Users