Shunsuke Saito @psyth91 - Twitter Profile

Pinned Tweet

10 months ago

Relightable Codec Avatars is now extended to full-body! At #SIGGRAPH2025, we will present Relightable Full-body Gaussian Codec Avatars. Key contributions include learnable Zonal Harmonics and deferred learnable radiance transfer for specular! Check it out! https://t.co/m6REfulkWr

2

177

30

75

14K

psyth91 retweeted

Sadao Tokuyama

@tokufxug

about 2 months ago

Facebookやインスタ、VRのQuestやAIグラスのRay-Ban MetaのMetaによる新研究「LCA」。 100万本の動画学習により、スマホ撮影から表情や指の動きまで精密な3Dアバターを一瞬で生成可能に。服の揺れや照明変化も自然に再現され、将来的にスタジオ級の分身がスマホで作れる世界を目指してる模様。

6

451

64

388

33K

psyth91 retweeted

A.I.Warper

@AIWarper

about 1 month ago

Sapiens2 test. All things considered, probably the best pose I have used. Still heavily reliant on a good bbox detection though. I am out of the loop and do not know what the "Best" bbox detection is these days.

10

250

28

185

41K

psyth91 retweeted

merve

@mervenoyann

25 days ago

Meta silently dropped Sapiens2 last week 🔥 a family of high-res models trained on 1B human images > for pose estimation, body-part segmentation, surface normals, pointmaps (sota) > 6 sizes: 0.1B → 5B params (all ViT patch 16) > high-res: 1024×768 and 4K

10

442

47

404

30K

Who to follow

International Conference on 3D Vision (3DV)

@3DVconf

Official account for International Conference on 3D Vision (3DV) #3DV2027 Dates: April 6-9, 2027 Place: Thessaloniki, Greece 🇬🇷

Georgios Pavlakos

@geopavlakos

Assistant Professor at UT Austin @UTCompSci | Working on Computer Vision and Machine Learning

Angjoo Kanazawa

@akanazawa

Assistant Professor at @Berkeley_EECS, @berkeley_ai. KAIR, @nerfstudioteam. Amazon Scholar @ FAR. Previously advised @WonderDynamics and @LumaLabsAI. she/her.

psyth91 retweeted

Massimiliano Viola ✈️ CVPR @massiviola01

2 days ago

There used to be a time when a novel architecture would do it... But today, data is the one dictating the rules!😬 @Meta Reality Labs solved dense human-centric tasks with specialized foundation models, proving that they still hold a massive advantage over generalist ones.

massiviola01's tweet photo. There used to be a time when a novel architecture would do it...

But today, data is the one dictating the rules!😬

@Meta Reality Labs solved dense human-centric tasks with specialized foundation models, proving that they still hold a massive advantage over generalist ones. https://t.co/bRtAHV8Sv2

1

34

5

18

3K

psyth91 retweeted

Wojciech Zielonka @w_zielonka

about 1 month ago

I am happy to share that our STAR has been accepted to Eurographics 2026: “How to Build Digital Humans?” It introduces a novel taxonomy and a concise overview of the full creation pipeline, from face and body to hands, garments, and hair. https://t.co/E8YsdKpQGF

w_zielonka's tweet photo. I am happy to share that our STAR has been accepted to Eurographics 2026:

“How to Build Digital Humans?”

It introduces a novel taxonomy and a concise overview of the full creation pipeline, from face and body to hands, garments, and hair.

https://t.co/E8YsdKpQGF https://t.co/6h5gzxnIku

1

73

17

35

7K

psyth91 retweeted

Astrid Wilde 🌞

@astridwilde1

about 1 month ago

Sapiens2 is the highest quality ViT backbone that now exists in the public domain. It was pretrained on the equivalent of 1/2 of all human images on Flickr. First public release by a large lab that is non-trivial to replicate. Huge public service. Well done.

astridwilde1's tweet photo. Sapiens2 is the highest quality ViT backbone that now exists in the public domain. It was pretrained on the equivalent of 1/2 of all human images on Flickr. First public release by a large lab that is non-trivial to replicate. Huge public service. Well done. https://t.co/agMVsbXnHA

4

518

45

393

58K

psyth91 retweeted

Rawal Khirodkar

@rawal_khirodkar

about 1 month ago

Introducing Sapiens2 — the next generation of our human-centric vision models Pretrained at scale and at high resolution, Sapiens2 learns human semantics more effectively without losing fidelity, and generalizes strongly across human vision tasks. Paper: https://t.co/c7uTv3NIBP Accepted at ICLR 2026 Code: https://t.co/6JUUJJrV7W Demo: https://t.co/etxRWJAexF

10

278

42

208

33K

Shunsuke Saito @psyth91

2 months ago

@TkfmTktm 恐縮です！自信作です！

0

1

0

190

psyth91 retweeted

Junxuan Li @JunxuanL

2 months ago

LCA is accepted at CVPR 2026! 🚀 We introduce a pre/post-training paradigm for 3D avatars (1M in-the-wild videos ➡️ studio data). The result? High-fidelity full-body avatars with emergent relightability and zero-shot stylization. Project: https://t.co/liddQrFIqF #CVPR2026

2

52

17

16

3K

psyth91 retweeted

Rawal Khirodkar

@rawal_khirodkar

2 months ago

Large-scale Codec Avatars: learning photorealistic avatars from millions of videos. A massive team effort, and incredibly proud of how it turned out. - Project: https://t.co/XMZWMDuI0P - Paper: https://t.co/iIMbGpGoc8 #CVPR2026

10

295

56

235

26K

psyth91 retweeted

Pablo Vela

@pablovelagomez1

5 months ago

I've been working a lot with SAM3 and the Momentum Human Rig (MHR). I finally integrated it into the data I'm working with @rerundotio. The progression I've taken looks as follows SAM3 + SAM3D-body on 1. a single image 2. a set of multiple images 3. a single video 4. A multiview video capture I took inspiration from the SAM3D-body paper and built a multiview fitting optimization pipeline. This pipeline involves using the 2D keypoints from the single-view pipeline, triangulating them, and employing an L1 loss between the 2D/3D keypoints. The temporal stability isn't great, so that's the next portion I'm going to focus on. One really frustrating thing about SAM3D-body is the lack of per-joint confidence values. It makes it harder to deal with occlusions. I'm probably going to need to use a separate model, or maybe add a confidence head.

9

448

44

327

43K

psyth91 retweeted

ゴメパパ

@gomessdegomess

6 months ago

毎年お馴染みlevelsfyiの年度末レポートがやってきたので気になるところだけまとめてくメリカのトップ給与動向のまとめ

1

40

6

23

29K

psyth91 retweeted

AI at Meta

@AIatMeta

6 months ago

SAM 3D is helping advance the future of rehabilitation. See how researchers at @CarnegieMellon are using SAM 3D to capture and analyze human movement in clinical settings, opening the doors to personalized, data-driven insights in the recovery process. 🔗 Learn more about SAM 3D: https://t.co/WAtASpkTdY

34

486

88

125

67K

psyth91 retweeted

AI at Meta

@AIatMeta

7 months ago

Introducing SAM 3D, the newest addition to the SAM collection, bringing common sense 3D understanding of everyday images. SAM 3D includes two models: 🛋️ SAM 3D Objects for object and scene reconstruction 🧑‍🤝‍🧑 SAM 3D Body for human pose and shape estimation Both models achieve state-of-the-art performance transforming static 2D images into vivid, accurate reconstructions. 🔗 Learn more: https://t.co/yXcvts8Ogc

129

6K

1K

4K

857K

psyth91 retweeted

Kris Kitani @kkitani

7 months ago

Super excited to share the release of SAM 3D. It's been a year in the making. Two models for lifting object and people to 3D!

9

166

11

24

16K

Shunsuke Saito @psyth91

7 months ago

これはICCVの専門家を名乗ってもいいのではないだろうか。そんなのあるのか知らないが

ResearchPort @ResearchPort

8 months ago

「ICCV2025」トップカンファレンス定点観測 vol.19 https://t.co/3inRQTU2Qn #ICCV2025

0

26

7

18

15K

0

26

0

7

7K

psyth91 retweeted

Jihyun Lee @jyun_leee

8 months ago

I have two exciting career updates to share! 😃 1️⃣ After memorable years at KAIST, I recently joined Meta as a Postdoctoral AI Research Scientist! I’m thrilled to be part of the Codec Avatars Lab, working with Shunsuke Saito (@psyth91) — one of the few researchers I admired most during my PhD years — and his amazing team. I’m genuinely super excited about the next-generation avatar project we’re pushing forward! 2️⃣ I’m currently attending ICCV 🏖 and will be giving a keynote talk at the HANDS workshop this afternoon. If you’re interested, please join the talk at 13:40 in room 305B. If you’d like to connect or chat outside of the talk, also feel free to drop me a message!

18

461

15

82

41K

psyth91 retweeted

David Park @park_jinhyung1

9 months ago

Introducing ATLAS: A high-fidelity, parametric human body model enabling precise, independent control of surface and skeletal attributes for character creation. To be presented at #ICCV2025! Learn more about ATLAS here: https://t.co/Iz4nnhm0rB

park_jinhyung1's tweet photo. Introducing ATLAS: A high-fidelity, parametric human body model enabling precise, independent control of surface and skeletal attributes for character creation. To be presented at #ICCV2025!

Learn more about ATLAS here:
https://t.co/Iz4nnhm0rB https://t.co/ubwBRylBZB

6

187

34

103

26K

Shunsuke Saito @psyth91

10 months ago

Want Gaussian Avatar on mobile? Turns out the bottleneck is decoding of pose correctives. At #SIGGRAPH2025, we present a simple yet highly effective solution. We make *any* Gaussian avatars mobile-ready via linear distillation and corrective sharing. 👉https://t.co/pet4CzPK4P

2

104

12

38

15K

psyth91 retweeted

Kuroko @_c_he_

10 months ago

📢 #SIGGRAPH2025 I'll be presenting our paper "3DGH: 3D Head Generation with Composable Hair and Face". Swing by and let's talk about hair and head generation! #Meta #Yale ⏰ Monday, Aug 11 | 2:00pm - 3:30pm PDT 📍 West Building, Rooms 301-305 🔗 https://t.co/PFhQbd4vIK

1

16

4

1

1K

Shunsuke Saito

@psyth91

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users