Sumith Kulal @sumith1896 - Twitter Profile

Sumith Kulal

@sumith1896

2 days ago

@anupamsidhant @bfl_ai 🌲🌲🌲

0

3

0

215

Sumith Kulal

@sumith1896

2 days ago

Excited to have Martin Scorsese as an advisor. Stories will always have to be personal, but they can be woven together with new tools. That's what we strive to build at Black Forest Labs.

Black Forest Labs @bfl_ai

3 days ago

Martin Scorsese is an advisor to Black Forest Labs. He's spent six decades shaping how the world sees stories. Now he's helping us shape visual intelligence with human taste and craft at the center. We sat down with him for a working storyboarding session using FLUX.

298

3K

287

1K

3M

15

47

4

6

4K

sumith1896 retweeted

Robin Rombach

@robrombach

3 days ago

Seeing Martin Scorsese using FLUX for storyboarding and scene exploration was absolutely insane. Experiencing how one of the absolute masters of cinema & filmmaking uses the technology that we developed, his curiosity and creativity, and the way he prompted our models, was humbling. I am grateful to call Martin Scorsese an advisor to BFL, and to explore the next, multimodal and interactive phases of visual AI with him.

51

1K

124

665

85K

Sumith Kulal

@sumith1896

5 days ago

@baaadas @LumaLabsAI great run Jiaming!

0

1

0

881

Who to follow

Shikhar

@ShikharMurty

Agents and RL @GoogleDeepMind, prev: Stanford CS PhD student @StanfordNLP. Opinions my own

Dani Yogatama

@DaniYogatama

CEO @RekaAILabs, Associate Professor @CSatUSC

Vincent Sitzmann

@vincesitzmann

Building AI that learns by interacting with the world. Assistant Professor @ MIT, leading the Scene Representation Group (https://t.co/h5gvhLYZj4).

Sumith Kulal

@sumith1896

7 days ago

@AnjneyMidha @amppublic @AnthropicAI @DarioAmodei @NotTomBrown 🔥🔥🔥

0

70

sumith1896 retweeted

Anjney Midha

@AnjneyMidha

24 days ago

today, we @amppublic are announcing a $500M profit pool we have set aside to help local communities navigate the AI transition over the next few years we'd love to hear ideas for the best way to distribute these funds anyone can submit here: https://t.co/dpFyZldbXy

AnjneyMidha's tweet photo. today, we @amppublic are announcing a $500M profit pool we have set aside to help local communities navigate the AI transition over the next few years

we'd love to hear ideas for the best way to distribute these funds

anyone can submit here: https://t.co/dpFyZldbXy https://t.co/vjr6YYtHtx

28

491

32

177

42K

Sumith Kulal

@sumith1896

2 months ago

success and competence need repetition and volume.

John Coogan

@johncoogan

about 1 year ago

Five years ago, I recorded my first YouTube video. Today, I’m going full-time on @tbpn. My time at Founders Fund was incredible. Here’s 10,000 hours in 48 seconds:

240

2K

54

373

444K

0

7

0

1

742

Sumith Kulal

@sumith1896

2 months ago

@dps @hbarra @alcor @dreamer @natfriedman Congrats @dps, excited for the team!

0

876

sumith1896 retweeted

Anjney Midha

@AnjneyMidha

3 months ago

https://t.co/enjUwUDRij

53

609

69

470

272K

sumith1896 retweeted

Patrick Esser

@pess_r

3 months ago

Fixed vision encoders like DINO have driven impressive progress in more learnable representations for generative modeling - but there is no universal variant across modalities, and they do not scale with the generative model. We introduce our self-supervised framework, Self-Flow, that builds learnability directly into flow models, working in a unified and scalable way across image, video and audio. Particularly excited about the gains on video-action prediction: Beyond the overall success rate improving substantially, more complex tasks - like "Open and Place" - see some of the clearest gains. So many interesting research questions to explore to make 🤖 go brrr Super glad to be working with my amazing colleagues @hila_chefer, Dominik, @dustin_podell, Vikash, @Vinh_Suhi, Antonio and @robrombach - as well as the whole @bfl_ml team! arxiv: https://t.co/eP7ip58Tff project page: https://t.co/GNShpBMEQ1

pess_r's tweet photo. Fixed vision encoders like DINO have driven impressive progress in more learnable representations for generative modeling - but there is no universal variant across modalities, and they do not scale with the generative model.

We introduce our self-supervised framework, Self-Flow, that builds learnability directly into flow models, working in a unified and scalable way across image, video and audio.

Particularly excited about the gains on video-action prediction: Beyond the overall success rate improving substantially, more complex tasks - like "Open and Place" - see some of the clearest gains. So many interesting research questions to explore to make 🤖 go brrr

Super glad to be working with my amazing colleagues @hila_chefer, Dominik, @dustin_podell, Vikash, @Vinh_Suhi, Antonio and @robrombach - as well as the whole @bfl_ml team!

arxiv: https://t.co/eP7ip58Tff
project page: https://t.co/GNShpBMEQ1

4

225

24

138

36K

Sumith Kulal

@sumith1896

3 months ago

@iamsashasax @AnthropicAI Congrats Sasha!!!

0

1

0

173

sumith1896 retweeted

Hila Chefer

@hila_chefer

3 months ago

New research from @bfl_ml 🥳 Meet Self-Flow: our self-supervised framework for image, audio, video & world models 🤖 https://t.co/AshY8IkSEe Do generative models really need DINO to learn strong representations? We propose teaching them directly via a joint framework instead 🧵

hila_chefer's tweet photo. New research from @bfl_ml 🥳
Meet Self-Flow: our self-supervised framework for image, audio, video & world models 🤖
https://t.co/AshY8IkSEe

Do generative models really need DINO to learn strong representations? We propose teaching them directly via a joint framework instead 🧵 https://t.co/wofHy9mmGT

11

280

61

109

67K

Sumith Kulal

@sumith1896

3 months ago

hot off the press — representation learning done right 🚀

Black Forest Labs @bfl_ai

3 months ago

We present a research preview of Self-Flow: a scalable approach for training multi-modal generative models. Multi-modal generation requires end-to-end learning across modalities: image, video, audio, text - without being limited by external models for representation learning. Self-Flow addresses this with self-supervised flow matching that scales efficiently across modalities. Results: • Up to 2.8x faster convergence across modalities. • Improved temporal consistency in video • Sharper text rendering and typography This is foundational research for our path towards multimodal visual intelligence.

bfl_ai's tweet photo. We present a research preview of Self-Flow: a scalable approach for training multi-modal generative models.

Multi-modal generation requires end-to-end learning across modalities: image, video, audio, text - without being limited by external models for representation learning. Self-Flow addresses this with self-supervised flow matching that scales efficiently across modalities.

Results:
• Up to 2.8x faster convergence across modalities.
• Improved temporal consistency in video
• Sharper text rendering and typography

This is foundational research for our path towards multimodal visual intelligence.

15

900

136

514

146K

0

17

0

1K

Sumith Kulal

@sumith1896

3 months ago

@weihua916 @Anthropic woohoo, congrats Weihua!

0

1

0

923

Sumith Kulal

@sumith1896

3 months ago

future is wild, really excited for coding and agentic workflows with blazing fast speed ⚡️ kudos to @StefanoErmon @adityagrover_ @volokuleshov and the whole @_inception_ai team for pushing hard on the new paradigm!

Stefano Ermon

@StefanoErmon

3 months ago

Mercury 2 is live 🚀🚀 The world’s first reasoning diffusion LLM, delivering 5x faster performance than leading speed-optimized LLMs. Watching the team turn years of research into a real product never gets old, and I’m incredibly proud of what we’ve built. We’re just getting started on what diffusion can do for language.

319

4K

576

2K

1M

4

9

0

810

Sumith Kulal

@sumith1896

3 months ago

@StefanoErmon congrats team, very impressive!

0

1

0

81

sumith1896 retweeted

fal @fal

6 months ago

🚨 FLUX.2 from @bfl_ml is here on fal - day 0 release! 🎨 Generate and edit images with incredible quality 🎯 HEX codes and JSON prompts for better control 🎬 Pro and Flex: High-fidelity images and text rendering ⚡ Dev: LoRA training for customization https://t.co/N1Ezd8zhZO

21

265

35

71

112K

sumith1896 retweeted