Lu Jiang @roadjiang - Twitter Profile

9 months ago

How do we generate videos on the scale of minutes, without drifting or forgetting about the historical context? We introduce Mixture of Contexts. Every minute-long video below is the direct output of our model in a single pass, with no post-processing, stitching, or editing. 1/4

22

604

98

473

156K

Lu Jiang @roadjiang

12 months ago

@OverPowere13959 @CeyuanY This one is https://t.co/OkJXplb8Bo Others did not get the approval.

0

1

0

68

Lu Jiang @roadjiang

12 months ago

@katedeyneka Thank you for attending. I am glad that you like it.

0

1

0

54

roadjiang retweeted

Ceyuan Yang

@CeyuanY

about 1 year ago

Glad to share Seaweed-7B, a cost-effective foundation model for video generation. Our tech report highlights the key designs that significantly improve compute efficiency and performance given limited resources, achieving comparable quality against other industry-level models. To unleash the power of the foundation model, Seaweed-7B further enables a wide range of downstream applications including image-to-video generation, human video generation, subject-consistent video generation, video-audio joint generation, long video generation and storytelling, real-time generation, super-resolution generation, camera controlled generation. Check out our webpage and report for more details: Webpage: https://t.co/5s9Af4FQCb Paper: https://t.co/GHVs4cvELt It's a wonderful journey of the last year. Thanks to all teammates for their contributions, sincerely.

34

512

96

273

77K

Who to follow

Jie Huang

@jefffhj

Building intelligence @xAI. Grok-2🍍, 3🍫, 4🫐, Video Gen🪄. PhD from UIUC CS.

Jordi Pont-Tuset

@jponttuset

Research Scientist @ Google DeepMind Zürich.

Hector Liu

@waterluffy

Building Institute of Foundation Models (https://t.co/eSDH1CD8ly) Views Mine. @LLM360, LLM, (del)NLP, Computational Linguistic(del)

Lu Jiang @roadjiang

about 1 year ago

@rtk254 Ronen, interesting discussion! We recently have a work showing that training on synthetically generated CGI videos can indeed help models learn to generate videos that better respect physical constraints: https://t.co/HmOmX3uMEP @ronen

0

2

0

236

Lu Jiang @roadjiang

about 1 year ago

@_akhaliq Thanks for posting the video from our work. More information can be found at: https://t.co/hPYMIM7bnL

1

10

2

7

9K

Lu Jiang @roadjiang

about 1 year ago

@dreamingtulpa Thanks for reporting our work and discussion. Like mentioned in the paper's abstract: while the model still lacks a deep understanding of physics, it offers one of the first empirical demonstrations that synthetic video enhances physical fidelity in video synthesis.

0

57

roadjiang retweeted

AK

@_akhaliq

about 1 year ago

Synthetic Video Enhances Physical Fidelity in Video Synthesis A turtle swimming in a green background. + video matting illustration

4

99

14

36

17K

roadjiang retweeted

Ceyuan Yang

@CeyuanY

about 1 year ago

We propose Long Context Tuning (LCT) for scene-level video generation to bridge the gap between current single-shot generation and real-world narrative video productions. Homepage: https://t.co/1kA5LrNY8W Report: https://t.co/8GF2hTSOXn

4

104

23

43

47K

roadjiang retweeted

Junfei Xiao

@never1andd

over 1 year ago

Want the deep dive? • arXiv: https://t.co/2HLdzMyDEH • Project Page: https://t.co/AUhbJmrwem See how VideoAuteur + CookGen are shaping long narrative video generation. Big shout out to my co-authors and advisors: @fncheng2333 @liangkegui @YuilleAlan @roadjiang

0

2

1

0

494

roadjiang retweeted

AK

@_akhaliq

over 1 year ago

Seaweed APT Diffusion Adversarial Post-Training for One-Step Video Generation Existing diffusion and autoregressive generative models require repeated neural network evaluations. It is extremely slow for the high-resolution video generation task, as a few-second video can take many minutes to generate. Our work is the first to demonstrate the generation of an entire video using a single neural function evaluation (1NFE) by using our proposed adversarial post-training technique. Our model generates 2 seconds of 1280x720 24fps videos in real-time. We showcase some of the results below:

9

203

34

116

21K

Lu Jiang @roadjiang

almost 2 years ago

@windx0303 @IJCAIconf promising field

0

39

Lu Jiang @roadjiang

over 2 years ago

Interesting comparison between our VideoPoet and other competitive models. The comparison is incredibly helpful and reinforces my belief that VideoPoet excels in generating larger motions. We know the exact reasons for this and are working on improving single frame quality.

Anu Aakash @anuaakash

over 2 years ago

Google VideoPoet, Runway, Pika & Genmo Google recently announced Video Poet. Google's VideoPoet is a large language model (LLM) that is capable of a wide variety of video generation tasks, including: - text-to-video - image-to-video - video stylization - video inpainting and outpainting - video-to-audio. I tried some of their text-to-image prompts (from their demo) in Pika, Runway and Genmo. Here are the results: 10 examples 1/10 Two teddy bears holding hands, walking down rainy 5th avenue.

11

400

108

331

50K

0

6

0

1

1K

Lu Jiang @roadjiang

over 2 years ago

@anuaakash VideoPoet co-author here. Thanks a ton! Due to policy constraints, we weren't able to perform such comparisons. Your analysis is incredibly helpful and reinforces my belief that VideoPoet excels in creating larger motions. Its per frame quality can be further improved.

1

0

36

Lu Jiang @roadjiang

over 2 years ago

Excited to be at #NeurIPS2023 this week! Can't wait to reconnect with colleagues and make new connections. If you're up for a coffee chat, feel free to reach out. Find me at our spotlight/posters. https://t.co/QHpEz66JyP Tue 12 5:15 p.m. https://t.co/mFX52fktOm Wed 13 10:45 a.m

0

5

1

0

904

roadjiang retweeted

Agrim Gupta

@agrimgupta92

over 2 years ago

We introduce W.A.L.T, a diffusion model for photorealistic video generation. Our model is a transformer trained on image and video generation in a shared latent space. 🧵👇

49

1K

247

625

431K

Lu Jiang @roadjiang

over 2 years ago

#plagiarism #AcademicTwitter #aaai

0

1

0

436

Lu Jiang @roadjiang

over 2 years ago

😲While preparing the meta-review for #aaai24, I stumbled upon a new form of parallelism. It wasn't about the paper's concepts, but rather in the review comments, where two reviewers listed identical comments, word for word, over 200 matching words. #PeerReview #AIResearch

1

4

0

853

Lu Jiang @roadjiang

almost 3 years ago

📢 Call for Papers! International Journal of Computer Vision (IJCV) invites submissions for its special issue on "Generative Models for Content Creation and Manipulation." 🗓️ Manuscript Submission Deadline: February 28, 2024 🔗 Check it out here: https://t.co/S81We2MU7E

0

4

1

0

534

Lu Jiang @roadjiang

almost 3 years ago

@k_saifullaah It seems relevant and a common problem we can try to reduce the human-supervision. Thanks for sharing!

0

1

0

104

Lu Jiang @roadjiang

almost 3 years ago

Fascinating research by Google reveals the power of Language Models (LLMs) like PaLM or GPT in tackling visual tasks using in-context learning. This novel method enables LLMs to perform image generation tasks without requiring any parameter updates. #palm #GPT4 #LLMs

roadjiang's tweet photo. Fascinating research by Google reveals the power of Language Models (LLMs) like PaLM or GPT in tackling visual tasks using in-context learning. This novel method enables LLMs to perform image generation tasks without requiring any parameter updates. #palm #GPT4 #LLMs https://t.co/m2EiDj35rv

2

249

67

174

150K

Lu Jiang

@roadjiang

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users