Roni Itkin @itkroni - Twitter Profile

Pinned Tweet

about 2 months ago

We introduce 🌍GlobalSplat: Efficient Feed-Forward 3D Gaussian Splatting via Global Scene Tokens.🌍 Most feed-forward 3DGS methods still start from pixel, voxel, or dense view-aligned primitives. We take a different route: align first, decode later. 🧵👇

1

44

17

7

3K

ItkRoni retweeted

Omer Benishu @omerbenishu

10 days ago

🚨 Excited to share our new paper: "PhyGenHOI: Physically-Aware 4D Generation of Dynamic Human-Object Interactions"! 🎉 We tackle generating photorealistic 4D interactions by deeply coupling generative human motion diffusion with physical simulation! 🧠��🧵👇(1/4)

3

20

6

2

342

ItkRoni retweeted

Hadar Davidson @HadarDavidson

11 days ago

Excited to share Colored Noise Sampling (CNS)!🎉 Instead of injecting white noise, our SDE sampler exploits the inherent spectral bias of diffusion models. We dynamically color the injected noise to focus on frequencies where details are missing, substantially improving FID.🧵1/9

5

264

32

162

14K

ItkRoni retweeted

Anpei Chen @AnpeiC

12 days ago

We suggest going back to relative pose modeling, enabling efficient and robust 3D reconstruction with low memory overhead — in both streaming and offline settings. Project: https://t.co/Ji6Uipd5sL Paper: https://t.co/bUzsHX3XdQ

0

102

15

67

9K

ItkRoni retweeted

Guy Yariv @guy_yariv

21 days ago

Our paper: "LaMI: Augmenting Large Language Models via Late Multi-Image Fusion" has been selected for an Oral Presentation at #ACL2026! LaMI boosts LLM visual commonsense by generating complementary images from a text prompt and late-fusing their evidence into the prediction 🧵

guy_yariv's tweet photo. Our paper:
"LaMI: Augmenting Large Language Models via Late Multi-Image Fusion"

has been selected for an Oral Presentation at #ACL2026!

LaMI boosts LLM visual commonsense by generating complementary images from a text prompt and late-fusing their evidence into the prediction
🧵 https://t.co/tEIPaI6Vwd

5

62

17

16

3K

ItkRoni retweeted

Michael Baltaxe @MichaelBaltaxe

21 days ago

Happy to share that our work "RAD: Retrieval-Augmented Monocular Metric Depth Estimation for Underrepresented Classes" was accepted to #CVPR2026 (findings). Check out the project page: https://t.co/HwIMDBPxXd

MichaelBaltaxe's tweet photo. Happy to share that our work "RAD: Retrieval-Augmented Monocular Metric Depth Estimation for Underrepresented Classes" was accepted to #CVPR2026 (findings).

Check out the project page: https://t.co/HwIMDBPxXd https://t.co/4AAj53yEVM

2

75

16

36

5K

ItkRoni retweeted

Avital Shafran @AvitalShafran

about 1 month ago

Very excited to share the first paper from my postdoc, led by the talented @JieZhang_ETH . This was an extremely fun project with a great group of people 🥸

3

27

6

0

1K

ItkRoni retweeted

Guy Yariv @guy_yariv

about 1 month ago

✨Happy to share that our paper DyPE has been accepted to #ICML2026!

0

25

4

10

3K

ItkRoni retweeted

Xingyu Chen @RoverXingyu

about 2 months ago

Wonderful collaboration with @ItkRoni @IssacharNoam @YehonatanKe @RoverXingyu @AnpeiC @BenaimSagie !

0

4

1

0

244

ItkRoni retweeted

Anpei Chen @AnpeiC

about 2 months ago

GlobalSplat: Stop unprojecting, start decoding. 🛠️ We fuse all input views into a fixed set of Global Scene Tokens to build high-fidelity 3D assets without the pixel-wise redundancy. ✅ Higher quality ✅ Better spatial allocation 🔗 https://t.co/m6ROsAOck8 #3DGS

3

284

39

231

19K

ItkRoni retweeted

Zhenjun Zhao @zhenjun_zhao

about 2 months ago

GlobalSplat: Efficient Feed-Forward 3D Gaussian Splatting via Global Scene Tokens @ItkRoni, @IssacharNoam, @YehonatanKe, @RoverXingyu, @AnpeiC, @BenaimSagie tl;dr: all input views->a fixed number of latent scene tokens->decoder->explicit 3D Gaussians https://t.co/g038S2pRsz

zhenjun_zhao's tweet photo. GlobalSplat: Efficient Feed-Forward 3D Gaussian Splatting via Global Scene Tokens

@ItkRoni, @IssacharNoam, @YehonatanKe, @RoverXingyu, @AnpeiC, @BenaimSagie

tl;dr: all input views->a fixed number of latent scene tokens->decoder->explicit 3D Gaussians

https://t.co/g038S2pRsz https://t.co/aDqhcpvUZo

3

40

5

19

3K

Roni Itkin

@ItkRoni

about 2 months ago

Joint collaboration with: @IssacharNoam @YehonatanKe @RoverXingyu @AnpeiC @BenaimSagie 📄 Paper: https://t.co/lP6r5FXWmh 🌐 Project: https://t.co/hvONwLkqsK

0

3

0

247

Roni Itkin

@ItkRoni

about 2 months ago

We introduce 🌍GlobalSplat: Efficient Feed-Forward 3D Gaussian Splatting via Global Scene Tokens.🌍 Most feed-forward 3DGS methods still start from pixel, voxel, or dense view-aligned primitives. We take a different route: align first, decode later. 🧵👇

1

44

17

7

3K

Roni Itkin

@ItkRoni

about 2 months ago

Strong efficiency-quality operating point: 24 Views on A100: 1.79 GB peak GPU memory 77.88 ms inference 3.8 MB on disk With as few as 2–32K Gaussians, 🌍GlobalSplat🌍 has better PSNR on RE10K then feed-forward 3DGS methods that use hundreds of thousands to millions of Gaussians.

ItkRoni's tweet photo. Strong efficiency-quality operating point:
24 Views on A100:
1.79 GB peak GPU memory
77.88 ms inference
3.8 MB on disk
With as few as 2–32K Gaussians, 🌍GlobalSplat🌍 has better PSNR on RE10K then feed-forward 3DGS methods that use hundreds of thousands to millions of Gaussians. https://t.co/WvaL5ivxKZ

1

2

0

1

283

Roni Itkin

@ItkRoni

Last Seen Users on Sotwe

Trends for you

Most Popular Users