eisneim @eisneim - Twitter Profile

Pinned Tweet

about 1 year ago

I'm working on a Flux dev based model that can relight a photo conditioned on time (eg. 6AM, 7AM ) without changing the background unlike ic-light and LBM model

2

21

0

7

594

eisneim retweeted

Wildminder

@wildmindai

15 days ago

LTX-2.3 OmniCine V1 LoRA - Anatomy fix - Director controls: - Better lip-sync and facial nuance + it finally stops burnt-in subtitles from ruining the video. - Objects and characters don't warp when things get fast or chaotic. - Handles 2D Anime, 3D CGI, photorealism. https://t.co/i2eNhXLck8

3

206

19

203

9K

eisneim retweeted

Wildminder

@wildmindai

about 1 month ago

LTX2.3 Seamless Transitions IC-LoRA perfect loops every time https://t.co/fZkruqx1lV

0

73

6

56

4K

eisneim retweeted

Gorden Sun

@Gorden_Sun

about 1 month ago

Warp-as-History：仅用一条视频就能实现交互式视频生成用单条带标注的视频做轻量LoRA微调后，即可让通用视频模型实现跟随视角生成视频。项目里用的这条视频是来自DAVIS数据集里的car-roundabout.mp4。原理是：把相机轨迹产生的变形(warp)伪装成视频模型原生的"历史帧"输入，无需额外的相机编码器或控制分支，就能让预训练视频生成模型跟随指定视角运动。 Github：https://t.co/a2z3jwyS7H

5

118

17

88

11K

eisneim retweeted

Gong Junmin

@junmingong

about 1 month ago

Khala 1.0 just dropped — a music generation model from the Central Conservatory of Music in Beijing. Paper, code, weights, and demo all open-sourced. I gave a talk there recently on ACE-Step and got an early look at Khala. Excited to see it officially out. Open-source music gen is thriving. 💻 https://t.co/iYQt9e1mMy 📝 https://t.co/fqwqtvHfP1 🎧 https://t.co/XAxqLEYGft

15

463

77

435

26K

eisneim retweeted

Mickmumpitz @mickmumpitz

2 months ago

I've been working on a bigger AI VFX pipeline and needed audio-driven vid2vid lip sync for @LTXStudio LTX 2.3. Couldn't find a workflow for it, so I built this one in @ComfyUI. More examples, free guide and free workflows below! 👇

14

476

53

415

26K

eisneim retweeted

Kai He @Kai__He

2 months ago

We open-sourced the code and model for UniRelight! 🎉 Given an input video and a target lighting configuration, our method jointly predicts a relit video and its corresponding albedo. Code: https://t.co/4zF94saWvo Model: https://t.co/d8i66UyvhU

7

277

45

191

32K

eisneim retweeted

Wildminder

@wildmindai

about 1 month ago

Wan2.2 again. SwiftI2V: Efficient 2K I2V video gen with 21GB VRAM. - uses 200x less GPU-time than CineScale - exact image fidelity - decoupled processing no models yet. https://t.co/UmfRrwq3IY

1

226

20

225

16K

eisneim @eisneim

about 1 month ago

https://t.co/VVVriWJdT0 I converted SAM2.1 to mlx so you can run video segmentation on Mac using apple Silicon GPU, no cuda gpu needed

0

2

90

eisneim retweeted

Mickmumpitz @mickmumpitz

2 months ago

Another test with the LTX 2.3 vid2vid lip sync workflow. I've been finding the inpainting mode works more reliably overall, so I'd actually recommend turning it on even for close-ups.

4

145

12

101

9K

eisneim retweeted

Brie Wensleydale🧀🐭

@SlipperyGem

2 months ago

Yet another amazing-lookingIC lora for LTX 2.3 lands on the scene. Its v2v and text prompted. Does editing, removal, replacement and restyle. Personally, I would REALLY like to know if it can handle a first frame as a reference. I'm guessing now though. https://t.co/8Ymjmd0KQl

3

162

14

128

8K

eisneim retweeted

Purz.ai

@PurzBeats

2 months ago

LTX 2.3 IC LoRA - EditAnything by Alisson Pereira

9

102

7

56

8K

eisneim retweeted

A.Robot

@100PercentRobot

2 months ago

Just discovered frame injection in LTX-2.3, so of course I did something weird

12

150

9

77

13K

eisneim retweeted

⚡AI Search⚡

@aisearchio

about 2 months ago

Another open source image generator & editor LLaDA2.0-Uni https://t.co/i8ohSPHx1z

3

128

12

80

8K

eisneim @eisneim

2 months ago

https://t.co/9vrV2Kkedu i created a new Repo for faster and better image to video generation using LTX 2.3 with triple stage sampling

0

1

171

eisneim @eisneim

3 months ago

https://t.co/T2HPMrbtpy new video model: 15B-parameter, 40-layer Transformer that jointly processes text, video, and audio via self-attention only. No cross-attention, no multi-stream complexity. Achieves 80.0% win rate vs Ovi 1.1 and 60.9% vs LTX 2.3

0

5

1

2

168

eisneim retweeted

Tongyi Lab @Ali_TongyiLab

4 months ago

1/2 Qwen3.5 is here. The next frontier of Native Multimodal Agents is open. 🚀 We are thrilled to release Qwen3.5-397B-A17B, our flagship open-weight vision-language model. Built for the future of coding, reasoning, and seamless multimodal interaction. Key Highlights: Inference Efficiency: A massive 397B total parameters, but only 17B active—delivering flagship power at a fraction of the cost. Hybrid Architecture: Innovative Gated Delta Networks (Linear Attention) + Sparse MoE for extreme speed. True Multimodality: Exceptional performance across GUI interaction, video comprehension, and agentic workflows. Global Scale: Qwen3.5 now supports over 200 languages. Empowering developers and enterprises to build smarter, faster, and more versatile AI agents.

44

2K

184

369

450K

eisneim retweeted

Ivan Fioravanti ᯅ

@ivanfioravanti

4 months ago

OpenCode + MLX + Qwen3.5-397B-A17B-4bit. Video is 8x, but the goal is showing that It works! This is something unimaginable just few months ago. MLX Team is pushing like crazy and M5 Ultra will do the rest 🚀

24

520

46

283

49K

eisneim retweeted

Wildminder

@wildmindai

4 months ago

Capybara? 14B model for T2V, T2I, TV2V, TI2I. - based on HunyuanVideo1.5; - byt5-small, Glyph-SDXL-v2, SigLIP; - 480p-1080p; 16.7GB model, 5GB VAE.. mostly for video editing. https://t.co/N34iJ4gC0K

0

145

21

111

17K

eisneim retweeted

Gorden Sun

@Gorden_Sun

4 months ago

BitDance：字节大年初一开源的AI绘画模型最大的亮点是速度快，使用高压缩视觉分词器，将图像映射为紧凑的二值Token序列，并且每一步扩散过程并行预测64个Token。所以即使模型大小有14B，生成图片的速度也非常快。模型：https://t.co/6FtVbAU4uk Github：https://t.co/ux7k7xVvbA

Gorden_Sun's tweet photo. BitDance：字节大年初一开源的AI绘画模型
最大的亮点是速度快，使用高压缩视觉分词器，将图像映射为紧凑的二值Token序列，并且每一步扩散过程并行预测64个Token。所以即使模型大小有14B，生成图片的速度也非常快。
模型：https://t.co/6FtVbAU4uk
Github：https://t.co/ux7k7xVvbA https://t.co/KdS1nbW90Q

3

120

28

124

15K

eisneim retweeted

Wildminder

@wildmindai

5 months ago

Self-Refining Video Sampling: inference-time method using a video generator as its own refiner to correct physics and motion. no retraining needed; scores >70% human preference; is validated on Wan2.2 & Cosmos. https://t.co/NGdxcTUNeX

4

259

39

207

32K

eisneim

@eisneim

Last Seen Users on Sotwe

Trends for you

Most Popular Users