Lei Yang @drowsyleilei - Twitter Profile

Lei Yang @drowsyleilei

about 1 month ago

@wchengad 谢谢，还得继续改进，在琢磨GPT Image 2是怎么能做到在米粒上写字的

0

42

Lei Yang @drowsyleilei

about 1 month ago

Would be interesting to turn all the recipes in #HowToCook into highly detailed visual guides. 🥘📷 🤗U1 for infographic:https://t.co/IZfAgzFdRj 📖Cook101 for coder: https://t.co/tqIbBKuItg

drowsyleilei's tweet photo. Would be interesting to turn all the recipes in #HowToCook into highly detailed visual guides. 🥘📷 🤗U1 for infographic:https://t.co/IZfAgzFdRj
📖Cook101 for coder: https://t.co/tqIbBKuItg https://t.co/e1uBFhd59k

SenseTime @SenseTime_AI

about 1 month ago

📢📢 𝗜𝗻𝘁𝗿𝗼𝗱𝘂𝗰𝗶𝗻𝗴 𝗮𝗻 𝗲𝗻𝗵𝗮𝗻𝗰𝗲𝗱 𝗺𝗼𝗱𝗲𝗹 𝗳𝗼𝗿 𝗶𝗻𝗳𝗼𝗴𝗿𝗮𝗽𝗵𝗶𝗰 𝗴𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝗼𝗻! 📊 Posters, charts, recipe cards, postcards — even arXiv-style pages — all from 𝗦𝗲𝗻𝘀𝗲𝗡𝗼𝘃𝗮-𝗨𝟭-𝟴𝗕-𝗠𝗼𝗧-𝗜𝗻𝗳𝗼𝗴𝗿𝗮𝗽𝗵𝗶𝗰. 🚀 +6.8 / +18.2 on BizGenEval (hard) / IGenBench (Q-ACC) over base U1, plus 100+ diverse showcases. 🤗 https://t.co/sYjY1ne3JX 🖼️Showcases: https://t.co/7K0zET8gOt Try it out — we'd love to see what you build! @huggingface

SenseTime_AI's tweet photo. 📢📢 𝗜𝗻𝘁𝗿𝗼𝗱𝘂𝗰𝗶𝗻𝗴 𝗮𝗻 𝗲𝗻𝗵𝗮𝗻𝗰𝗲𝗱 𝗺𝗼𝗱𝗲𝗹 𝗳𝗼𝗿 𝗶𝗻𝗳𝗼𝗴𝗿𝗮𝗽𝗵𝗶𝗰 𝗴𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝗼𝗻!

📊 Posters, charts, recipe cards, postcards — even arXiv-style pages — all from 𝗦𝗲𝗻𝘀𝗲𝗡𝗼𝘃𝗮-𝗨𝟭-𝟴𝗕-𝗠𝗼𝗧-𝗜𝗻𝗳𝗼𝗴𝗿𝗮𝗽𝗵𝗶𝗰.

🚀 +6.8 / +18.2 on BizGenEval (hard) / IGenBench (Q-ACC) over base U1, plus 100+ diverse showcases.

🤗 https://t.co/sYjY1ne3JX

🖼️Showcases: https://t.co/7K0zET8gOt

Try it out — we'd love to see what you build!

@huggingface

9

32

9

25

5K

2

6

0

1

341

drowsyleilei retweeted

zhiqian_joy

@lnzhqin16

about 1 month ago

🚀We just released SenseNova-U1-8B-MoT-Infographic! It brings aesthetic posters, charts, comics, even arXiv paper to life with dense text!📈SOTA on infographic benchmarks!📊 🤗Weights: https://t.co/PqagRLIcW5 🎨100+ Examples: https://t.co/4pWOod37bv Hope you have fun with it !!✨

lnzhqin16's tweet photo. 🚀We just released SenseNova-U1-8B-MoT-Infographic!
It brings aesthetic posters, charts, comics, even arXiv paper to life with dense text!📈SOTA on infographic benchmarks!📊
🤗Weights: https://t.co/PqagRLIcW5
🎨100+ Examples: https://t.co/4pWOod37bv
Hope you have fun with it !!✨ https://t.co/HFteL4FkVZ

2

3

1

77

drowsyleilei retweeted

Dahua Lin @lindahua

about 1 month ago

Another model of the SenseNova U1 series dropped. This one is enhanced specifically for infographics. Enjoy.

0

4

3

0

485

Who to follow

Yushi LAN

@GROS17121524

Ph.D. in CV & Graphics

Jiaqi Wang

@wjqdev

Research Director at JoyAI

Brian Li

@Brian_Bo_Li

Brian is building, something new @amilabs Prev works LLaVA-OneVision/LMMs-Eval/LMMs-Engine/OneVision-Encoder.

drowsyleilei retweeted

Wang Ruisi @Rui147000038622

about 1 month ago

We’re also releasing SenseNova-SI-8M,currently the largest spatial intelligence QA dataset available, to support future research. If you’re also attending CVPR this June, happy to chat in person (Poster Session 2-ID 66)! SenseNova-SI-8M: https://t.co/29eggyDGsP

0

4

2

0

500

drowsyleilei retweeted

Dahua Lin @lindahua

about 1 month ago

Proud to announce the release of the SenseNova U1 Tech Report — together with the a new set of model weights based on MoE. We hope this open release promotes transparency, reproducibility, and further innovation across the AI community. Huge thanks to the team for making this possible. 🚀

1

23

6

0

7K

drowsyleilei retweeted

Zhongang Cai @CVPR'26

@caizhongang

about 1 month ago

Excited to have contributed to the spatial intelligence capabilities of SenseNova-U1, surpassing strong baselines such as Qwen3.5 on key benchmarks including VSI-Bench. We’re also thrilled to open-source SenseNova-SI-8M, which is currently the largest spatial QA dataset to date. See you at CVPR this June, happy to chat in person! SenseNova-SI-8M: https://t.co/AHHjw5omzU

0

13

6

3

3K

drowsyleilei retweeted

Haiwen Diao @paranioar

about 1 month ago

🔥 New week, New SenseNova-U1-A3B-MoT Drop — and this one goes Deep!🔥 Technical Report is OUT — the detailed disclosure of how to build Native Multimodal Unified Models. Inside: ✨ Near-Lossless Visual Interface (no VEs, no VAEs, no Deep Decoders) ✨ Joint AR + Pixel-space Flow Matching ✨ Native Mixture-of-Transformers Backbone ✨ Training recipe + RL post-training + Distillation  📣 Paper: https://t.co/erw1PKbabE 🦁 Github: https://t.co/ANlWRuTkx0 🌟 Models: https://t.co/tOvuNPAMlD 🎮 Demo: https://t.co/R6cOj4FL6d

paranioar's tweet photo. 🔥 New week, New SenseNova-U1-A3B-MoT Drop — and this one goes Deep!🔥

Technical Report is OUT — the detailed disclosure of how to build Native Multimodal Unified Models.

Inside:
✨ Near-Lossless Visual Interface (no VEs, no VAEs, no Deep Decoders)
✨ Joint AR + Pixel-space Flow Matching
✨ Native Mixture-of-Transformers Backbone
✨ Training recipe + RL post-training + Distillation
 📣 Paper: https://t.co/erw1PKbabE
🦁 Github: https://t.co/ANlWRuTkx0
🌟 Models: https://t.co/tOvuNPAMlD
🎮 Demo: https://t.co/R6cOj4FL6d

0

25

8

14

3K

drowsyleilei retweeted

Ziwei Liu

@liuziwei7

about 1 month ago

🥳The Technical Report of #SenseNovaU1 Released🥳 📜We openly share our journey and observations of building this SOTA *native unified multimodal model* for both understanding and generation. - Enjoy reading all the arch, data, training details👇 📄https://t.co/8IRFAmXJs5

liuziwei7's tweet photo. 🥳The Technical Report of #SenseNovaU1 Released🥳

📜We openly share our journey and observations of building this SOTA *native unified multimodal model* for both understanding and generation.

- Enjoy reading all the arch, data, training details👇
📄https://t.co/8IRFAmXJs5 https://t.co/7OHuR51k3B

2

156

24

71

24K

drowsyleilei retweeted

Ziwei Liu

@liuziwei7

about 1 month ago

🤩Excited to see that our native unified multimodal model #SenseNovaU1 is listed as *trending models* @huggingface - Code: https://t.co/U2uOLI2gIj - Model: https://t.co/pOfp37ZYe9

liuziwei7's tweet photo. 🤩Excited to see that our native unified multimodal model #SenseNovaU1 is listed as *trending models* @huggingface

- Code: https://t.co/U2uOLI2gIj
- Model: https://t.co/pOfp37ZYe9 https://t.co/2Dxo0YXI9J

1

49

6

13

5K

drowsyleilei retweeted

Dahua Lin @lindahua

about 1 month ago

🚀 SenseNova-U1 update (May 6) 8-step distilled LoRA that reduces the inference time by 10x, and ComfyUI support! Much easier to deploy and use now.

0

13

5

2

4K

drowsyleilei retweeted

Zhongang Cai @CVPR'26

@caizhongang

about 2 months ago

Sora, we are sorry to see you go! 😭 Wait, Seedance 2.0 is just as good at reasoning? 👍 Check out more: https://t.co/8mZkFlmc1J

0

4

1

2

637

drowsyleilei retweeted

Wang Ruisi @Rui147000038622

about 2 months ago

RIP Sora 2, Hello Seedance 2.0! Seedance 2.0 achieves competitive score on VBVR-Bench. Take an early peek at Seedance 2.0’s performance on video reasoning tasks. 👉 https://t.co/YruVq6iOWb

0

4

3

6

797

Lei Yang @drowsyleilei

about 2 months ago

Thanks @_akhaliq for sharing! We’ve been exploring how to enable more human-like, interleaved thinking across text and images. U1 is our first step—still far from perfect, but we’re excited to open it up and collaborate with the community to improve it together.🤗

drowsyleilei's tweet photo. Thanks @_akhaliq for sharing! We’ve been exploring how to enable more human-like, interleaved thinking across text and images. U1 is our first step—still far from perfect, but we’re excited to open it up and collaborate with the community to improve it together.🤗 https://t.co/CCpbJG8sSd

AK

@_akhaliq

about 2 months ago

SenseNova U1 is out on Hugging Face https://t.co/22fCKWGQzx

1

25

7

10

7K

0

4

0

1

165

drowsyleilei retweeted

Ziwei Liu

@liuziwei7

about 2 months ago

🔥Native Unified Multimodal Model Open Sourced🔥 🚀SenseNova U1🚀 is the first native multimodal model that unifies multimodal understanding, reasoning, and generation within a monolithic architecture. - Code: https://t.co/56vtl0e6up - Model @huggingface: https://t.co/6XJ8s2VYqt 🤩Try Now: https://t.co/RzAAFct7Qg

liuziwei7's tweet photo. 🔥Native Unified Multimodal Model Open Sourced🔥

🚀SenseNova U1🚀 is the first native multimodal model that unifies multimodal understanding, reasoning, and generation within a monolithic architecture.

- Code: https://t.co/56vtl0e6up
- Model @huggingface: https://t.co/6XJ8s2VYqt
🤩Try Now: https://t.co/RzAAFct7Qg

3

135

30

54

134K

drowsyleilei retweeted

Dahua Lin @lindahua

about 2 months ago

SenseNova U1 has been released and open-sourced. This is a milestone worth commemorating—not because we have built a perfect model, but because we have taken a critical step into a new era. We believe that logical reasoning and visual intuition will become deeply integrated, and that the barriers between digital intelligence and physical intelligence will ultimately be broken down. Perhaps when we look back two years from now, this step we take today will prove to have been truly significant.

1

10

3

1K

Lei Yang @drowsyleilei

about 2 months ago

🤯Amazed by Image-2/NB 2, we kept asking: what’s the path behind them? We see native unified multimodal models as promising. 🚀Today, we open-source SenseNova U1, with solid und. & gen.(esp. Infographic & Interleaved). Hope to improve with the community toward Agentic Learning.🤝

Haiwen Diao @paranioar

about 2 months ago

SenseNova U1 with NEO-Unify just dropped 👀 • 🚫 No VE / VAE • 🔗 End-to-end pixel–word modeling • 🧠 Native multimodal reasoning (efficient & unified) Moving from “multimodal integration” → “true unification” Strong signal toward the next paradigm. https://t.co/iTEyHh0N5m

paranioar's tweet photo. SenseNova U1 with NEO-Unify just dropped 👀

• 🚫 No VE / VAE
• 🔗 End-to-end pixel–word modeling
• 🧠 Native multimodal reasoning (efficient & unified)

Moving from “multimodal integration” → “true unification”

Strong signal toward the next paradigm.
https://t.co/iTEyHh0N5m https://t.co/RhhbrYLAof

1

127

26

88

28K

0

15

5

9

3K

drowsyleilei retweeted

Dahua Lin @lindahua

about 2 months ago

We promised to open the models built on top of NEO-Unify. Now, they are coming …

1

14

4

3

2K

drowsyleilei retweeted

Ziqi Huang

@ziqi_huang_

about 2 months ago

Presenting our poster on ViMoGen at #ICLR2026 @iclr_conf this afternoon! 🇧🇷 Stop by to chat about generalizable human motion generation, video generation, and the evaluation & data. 📍 Pavilion 4, P4-#3704 🕒 Today, 3:15 PM – 5:45 PM BRT 🔗 https://t.co/vwihLSW06N

ziqi_huang_'s tweet photo. Presenting our poster on ViMoGen at #ICLR2026 @iclr_conf this afternoon! 🇧🇷

Stop by to chat about generalizable human motion generation, video generation, and the evaluation & data.

📍 Pavilion 4, P4-#3704
🕒 Today, 3:15 PM – 5:45 PM BRT
🔗 https://t.co/vwihLSW06N https://t.co/zzC3GEK3Zm

0

29

7

3

4K

drowsyleilei retweeted

Wenjia Wang @WenjiaWang_HKU

4 months ago

🚀 Excited to share our #CVPR2026 paper: EmbodMocap: In-the-Wild 4D Human-Scene Reconstruction for Embodied Agents. EmbodMocap, a portable yet affordable solution requiring only two moving iPhones—no calibrated multi-view camera studio, motion capture suits, or LiDAR sensors needed. With our fully automated optimization pipeline, you can effortlessly obtain high-precision scene meshes, human interaction motions, RGBD images, and camera parameters. The captured data is ready for training human-scene reconstruction models (like TRAM, pi3, etc.) and humanoid control policies (like deepmimic, AMP, etc.). What you need to do: 1. Borrow or buy two iPhone 12 Pros from eBay (600 USD in total). 2. Find 2 friends, then capture the sequences. 3. Deploy our repo, run our code, and get the results! The code and data will be released within 1 week. (Just come back to work from the Chinese Spring Festival, Happy Chinese New Year!) 📷 Project page: https://t.co/1soqR44lCJ 📷ArXiv: https://t.co/F2qp2deQ2m 📷Code: https://t.co/ZvdeiA3y8Z

10

285

47

196

17K

Lei Yang

@drowsyleilei

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users