Rowan Zellers

24 days ago

We are so back!

24 days ago

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://t.co/AFJZ5kH7Ku

462

16K

2K

12K

8M

37

546

18

60

53K

16 days ago

Grants for research on interactivity, realtime video/audio full duplex evals, and safety

professor at Stanford, researcher at NVIDIA, adventurer at heart

16 days ago

We are offering grants of $100,000 + Tinker credits to researchers advancing the field of human-AI interactivity. Submit your proposals by June 19th! https://t.co/907HfBy7g3

51

2K

196

2K

590K

4

111

10

50

26K

rown retweeted

MichiganAI @michigan_AI

19 days ago

Big congratulations to Dr. @ziqiao_ma, well deserved! 🎉👏 Excited for your new chapter at @thinkymachines!

1

32

2

1

5K

rown retweeted

Martin Ziqiao Ma

@ziqiao_ma

24 days ago

P.S. The demo is basically my life at thinky: I start to cut coffee, @liliyu_lili is visually prompt-injecting my human intelligence with sweet snack every day, and I've gained weight since joining TML.

6

135

10

14

19K

Who to follow

Yejin Choi

@YejinChoinka

Yi Tay

@YiTayML

research scientist @googledeepmind ✨♊, model co-lead/captain of gemini deepthink imo gold medal 🥇, opinions are my own.

Bill Yuchen Lin

@billyuchenlin

RL for coding @xAI @SpaceX Affiliate Assistant Prof @UW. Ex: @allen_ai; Google, Meta FAIR.

rown retweeted

Zixian Ma@CVPR

@zixianma02

24 days ago

Congrats Rowan and Thinky team on the cool release! I remember you mentioned having a v different vision of multimodal interactions a few weeks ago @rown so this is what that looks like! 🆒 It’s exciting to see this release going beyond just a single model, showcasing truly different native multimodal interactions too. A couple things from the nicely written blog really resonate with me: 1. people are most effective when they can collaborate with AI the same way they do with other people 2. existing interfaces limit human inputs (esp multimodal ones) to the model, and this input limit needs to be lifted to unlock much better interactivity The blog also reminds me of the fun and challenging discussions with @shannonzshen and others on what “scaling collaboration” can look like. we made an initial attempt describing our vision: https://t.co/YEHvWeH7LR It’d be great to see more human centric evaluations of the model/system/interface too — looking forward to it🥂

zixianma02's tweet photo. Congrats Rowan and Thinky team on the cool release!

I remember you mentioned having a v different vision of multimodal interactions a few weeks ago @rown so this is what that looks like! 🆒

It’s exciting to see this release going beyond just a single model, showcasing truly different native multimodal interactions too.

A couple things from the nicely written blog really resonate with me:
1. people are most effective when they can collaborate with AI the same way they do with other people
2. existing interfaces limit human inputs (esp multimodal ones) to the model, and this input limit needs to be lifted to unlock much better interactivity

The blog also reminds me of the fun and challenging discussions with @shannonzshen and others on what “scaling collaboration” can look like. we made an initial attempt describing our vision: https://t.co/YEHvWeH7LR

It’d be great to see more human centric evaluations of the model/system/interface too — looking forward to it🥂

0

65

6

13

7K

rown retweeted

Mira Murati

@miramurati

24 days ago

We started Thinking Machines to advance human-AI collaboration, and this is our first bet on what that looks like. Most labs treat autonomy as the goal and interactivity as scaffolding around a turn-based core. We think the way we work with AI matters as much as how smart it is. Interactivity has to be in the model, and it has to scale with intelligence rather than trail behind it. https://t.co/U4c0uC7tnT

34

802

42

135

57K

rown retweeted

Lilian Weng

@lilianweng

24 days ago

In the past few months, we had a lot of fun (and stress 😅) to produce 12 versions (+ many subversions) and 137 pages in our training run log book. Turns out human-human collaboration is important to improving human-AI collaboration. 😊

lilianweng's tweet photo. In the past few months, we had a lot of fun (and stress 😅) to produce 12 versions (+ many subversions) and 137 pages in our training run log book.

Turns out human-human collaboration is important to improving human-AI collaboration. 😊 https://t.co/kPucxYvXM4

43

934

47

284

171K

rown retweeted

Aurick Qiao

@aurickq

24 days ago

Very excited to share a preview of what we’ve been working on!

1

25

1

2K

rown retweeted

Long Lian

@LongTonyLian

24 days ago

Thinky’s new interaction models perform search in the background when listening and responding so you don’t notice! Also per request: Spoiler Alert 🚨

2

25

1

3

3K

Brandon Trabucco @brandontrabucco

24 days ago

@brandontrabucco @thinkymachines it's been super fun working together on human AI collaboration towards this release @brandontrabucco !

0

3

0

136

rown retweeted

Mu Cai

@MuCai7

24 days ago

My first share since joining @thinkymachines. Fun working with this team on real-time multimodal interaction. Vision in turn-based models felt like flipping through photos — continuous video is a different problem. Visual proactivity is essential — grateful to have worked on this alongside @liliyu_lili, @rown , and the rest of the team!

6

158

6

15

11K

rown retweeted

24 days ago

I'm excited to share some of our work at @thinkymachines. As models get more intelligent, the bottleneck is increasingly how quickly and seamlessly we can access their intelligence, and today we are sharing a preview of how we think about human-AI collaboration.

2

81

2

4

5K

24 days ago

@liliyu_lili @saurabh_garg67 @AndreaMadotto If you're interested in working on realtime video+speech specifically, or human AI collaboration more generally, please reach out!

0

25

1

4

1K

24 days ago

Our interaction model is the first general video+speech model that's visually proactive. It was super fun working on this with @liliyu_lili / @saurabh_garg67 / @AndreaMadotto and others - after countless versions it was amazing when visual interruptions suddenly worked!

Lili Yu

@liliyu_lili

24 days ago

We’re interested in AI systems that can collaborate in real time, without relying only on artificial turn boundaries. For audio, this feels natural: listen, speak, interrupt, update. For video, we think an important version of this is visual proactivity — models that respond when something happens visually: “Tell me when I start slouching.” “Count my pushups.” “Say stop when the person stops doing X.”

8

74

5

4

16K

6

135

6

12

11K

rown retweeted

Lili Yu

@liliyu_lili

24 days ago

We’re interested in AI systems that can collaborate in real time, without relying only on artificial turn boundaries. For audio, this feels natural: listen, speak, interrupt, update. For video, we think an important version of this is visual proactivity — models that respond when something happens visually: “Tell me when I start slouching.” “Count my pushups.” “Say stop when the person stops doing X.”

8

74

5

4

16K

rown retweeted

24 days ago

Lili and Martin get some help controlling themselves.

12

587

21

88

162K

24 days ago

@shaunralston https://t.co/LvvQJS7VOo

24 days ago

We are so back!

37

546

18

60

53K

1

3

0

113

rown retweeted