Long Zhao @garyzhao9012 - Twitter Profile

24 days ago

Made by Grok Imagine, this is movie-trailer quality. Multimodal is for entire humanity, this is how humans perceive the world, connect with one another, and communicate ideas. Grok Imagine is built for that reality: not just to create stunning visuals, but to make imagination truly useful for everyone. It won’t be long before multimodal becomes truly indispensable.

30

206

42

8

26K

Long Zhao

@garyzhao9012

about 2 months ago

@archanfel_anoth 🐐🫡🫡

0

2

0

335

Long Zhao

@garyzhao9012

about 2 months ago

@JackCaiXun It's so sad to see you leave. Wish you all the best, Jack!

1

4

0

892

garyzhao9012 retweeted

Xuhui Jia

@jia_xuhui

2 months ago

We’re hiring experts in multimodal data crawling. please reach out if this fits your expertise.

3

117

7

14

37K

Who to follow

NEC Labs America

@NECLabsAmerica

@NEC Labs America delivers high-impact #technology #research. Located in Princeton, NJ & San Jose, CA. #AI #MachineLearning #DataScience #OpticalNetworking

Zifeng Wang

@ZifengWang315

Research Scientist @Google, PhD in Machine Learning @Northeastern. Large Language Models, Continual learning, Data & Parameter-efficient learning.

Xuhui Zhou

@nlpxuhui

PhD student @LTIatCMU. Previously, @openhandsdev, @allen_ai, @UWNLP, @Apple, @UCBerkeley; Social Intelligence in language +X.

garyzhao9012 retweeted

Xuhui Jia

@jia_xuhui

5 months ago

Nano Banana has truly redefined what's possible with image generation models, pushing the boundaries of people's imagination when it debuted Today, we're excited to introduce Grok-Imagine-Image: a new model that's both faster and better than Nano Banana. Through this journey, we've built many of the essential building blocks needed to unlock the next generation of models and to keep fueling the growth and prosperity of the visual AI community. Stay tuned... something incredible is coming very soon! But today, hello world, grok-imagine-image!

41

327

29

59

163K

garyzhao9012 retweeted

Google Research

@GoogleResearch

about 1 year ago

At 4:00 today, stop by the #CVPR2025 Google booth where Ting Liu will demo a model for video creation by demonstration that can generate physically plausible video that continues naturally given a context scene. Find sample videos at https://t.co/VmfjfuxDgR

1

36

4

3K

Long Zhao

@garyzhao9012

over 1 year ago

We are delighted to share our latest work: Video Creation by Demonstration (https://t.co/7TQOWA9tND)! See our intereseting results here: https://t.co/vjLBnwmYbF

Ting Liu

@_tingliu

over 1 year ago

Introducing our latest work Video Creation by Demonstration, a novel video creation experience. Paper: https://t.co/YZFCLKj5aM Project: https://t.co/o9inp7qScE Huggingface: https://t.co/Lg5h7kvr70

0

34

7

14

15K

0

6

0

1

222

Long Zhao

@garyzhao9012

over 1 year ago

Happy to share our recent work "Epsilon-VAE", an effective autoencoder that turns single-step decoding into a multi-step probabilistic process. Please check our paper for more detailed results! arXiv page: https://t.co/TcZf6FzyX6

0

12

3

1

1K

Long Zhao

@garyzhao9012

about 2 years ago

Super excited to be featured by Google AI! We are also happy to share that our VideoPrism paper has been accpted by ICML 2024. Looking forward to meeting you guys in Vienna! Paper: https://t.co/to6je51VsR Blog: https://t.co/fOgROlqN4b

Google AI

@GoogleAI

about 2 years ago

Introducing Long Zhao, a Senior Research Scientist at Google, who worked to build VideoPrism: A Foundational Visual Encoder for Video Understanding. Read the blog to explore innovations in video understanding tasks and more →https://t.co/MnfeIMAohS

20

382

60

129

130K

0

5

0

1

416

Long Zhao

@garyzhao9012

over 2 years ago

Our team will present our paper "Unified Visual Relationship Detection with Vision and Language Models" (https://t.co/Vtqy2I3PLi) at #ICCV2023 in Paris next week. Please join our poster session on Wednesday (Oct. 4th, 2023) 02:30 PM-04:30 PM to learn more!

0

4

1

2

259

garyzhao9012 retweeted

Honglu Zhou @zhou_honglu

almost 3 years ago

📢 Our #SMART101 challenge is now open! 🎉 Join the brightest minds in multimodal reasoning and cognitive models of intelligence to drive AI progress. 🚀 Don't miss out! Challenge closes on Sept. 1. Winning teams will receive prizes! 🏆 https://t.co/asTC5oscJh #VLAR #ICCV2023 #AI

zhou_honglu's tweet photo. 📢 Our #SMART101 challenge is now open! 🎉 Join the brightest minds in multimodal reasoning and cognitive models of intelligence to drive AI progress. 🚀 Don't miss out! Challenge closes on Sept. 1. Winning teams will receive prizes! 🏆 https://t.co/asTC5oscJh
#VLAR #ICCV2023 #AI https://t.co/NoPXla90XP

1

17

20

1

3K

garyzhao9012 retweeted

Google AI

@GoogleAI

over 4 years ago

The Visual Transformer has helped advance many core computer vision applications, e.g., image classification, but training can be inefficient and models lack interpretable designs. Learn how the Nested Hierarchical Transformer addresses these challenges → https://t.co/JGYUJzW7BL