Tobias Weyand @0xtob - Twitter Profile

Tobias Weyand @0xtob

7 days ago

@AljosaOsep @ylecun According to Schmidhuber, every ML paper is stolen from Schmidhuber.

0

2

0

94

0xtob retweeted

Lucas Beyer (bl16)

@giffmana

13 days ago

Can't believe we're getting this before GTA 6

17

2K

83

321

227K

Tobias Weyand @0xtob

about 2 months ago

@giffmana Yeah, agents really take away many headaches from Linux. Another awesome use case I recently discovered is letting them deal with tricky git issues like resolving rebase conflicts.

1

2

0

194

Tobias Weyand @0xtob

7 months ago

@giffmana I recently let Gemini guide me through setting up a raspberry pi as a time machine backup server which was a total breeze! How much longer until we just give LLMs root and let them deal with all our sysadmin stuff?

0

31

Who to follow

Yash Patel

@yashvarpatel

Senior Research Scientist at FAIR - Meta SuperIntelligence Labs

Noah Snavely

@Jimantha

3D vision fanatic. Professor @cornell_tech & Researcher @GoogleDeepmind. He or they. https://t.co/m7Rs5xUFfG

Kwang Moo Yi

@kwangmoo_yi

Assistant Professor of Computer Science at the University of British Columbia. I also post my daily finds on arxiv. I also created https://t.co/mr7bj9zpDH

Tobias Weyand @0xtob

7 months ago

@ahmetius @SachitMenon Here's our video about the paper: https://t.co/nwmVaYcLkj

0

3

1

0

181

Tobias Weyand @0xtob

7 months ago

Attending #ICCV2025? Come chat with us about our Minerva dataset that tests if models can truly reason about videos! 🕵️‍♀️ @ahmetius and @SachitMenon will be presenting the dataset at Poster Session 5 tomorrow (Thurs, Oct 23) morning. Find them at poster #391.

Tobias Weyand @0xtob

about 1 year ago

We're excited to release Minerva 🕵️‍♀️, a benchmark to evaluate if AI can truly reason about videos, from spotting game-changing moments in sports 🏀 to understanding character motivations in short films 🍿. We provide the "why" behind the answers! Pointers below 👇

1

12

1

2

1K

1

5

3

0

2K

Tobias Weyand @0xtob

9 months ago

Our team is hiring! If you have experience in video understanding and/or generation, join us @GoogleDeepMind and help push the frontiers with Veo and Gemini!

Mikhail Sirotenko @sirotenko_m

9 months ago

We're hiring at @GoogleDeepMind! Looking for a talented Research Engineer to help build the future of Video generation and undrestanding (Veo and Gemini). Apply here: https://t.co/hYCj2jgvgw

0

3

1

0

369

0

1

0

130

Tobias Weyand @0xtob

10 months ago

@giffmana Same goes for punting. My daughter actually once responded to a question with "Sorry, I'm just a language model"

1

6

0

999

Tobias Weyand @0xtob

12 months ago

Excited that our Minerva and Neptune datasets are both featured in the Gemini 2.5 tech report! Minerva is among the most challenging video benchmarks with a large gap between SotA (Gemini 2.5 Pro, 67.6%) and humans (92.5%). https://t.co/mWROj5JXSz

Antoine Yang @AntoineYang2

12 months ago

The newly generally available Gemini 2.5 Flash and Pro are even better at video understanding than the versions we shared in the blog a month ago, see more details in the tech report 😀

AntoineYang2's tweet photo. The newly generally available Gemini 2.5 Flash and Pro are even better at video understanding than the versions we shared in the blog a month ago, see more details in the tech report 😀 https://t.co/dHuB8BlpHV

2

102

16

12K

0

6

1

0

384

0xtob retweeted

Boqing Gong @BoqingGo

12 months ago

Excited! VideoPrism-Base/Large are publicly available now: https://t.co/g5BNiA5O05 Check it out if you need a versatile video encoder for video-language or video-native tasks. Feedback appreciated!

0

21

3

5

2K

Tobias Weyand @0xtob

about 1 year ago

Gemini 2.5 Pro sets the state of the art on our newly released Minerva video reasoning benchmark by scoring 63.5%. 📜 Paper: https://t.co/nEWfr1SbqA 📊 Dataset: https://t.co/JengJVEgH6

JB Alayrac @jalayrac

about 1 year ago

A lot of work went to make Gemini 2.5 SOTA at video understanding, check out this 🧵 for more details! Looking back at where we were a year ago, the progress really feels phenomenal! So many things to unlock and enable from video 🎥 and we are only getting started!

5

146

11

29

35K

0

18

3

8

5K

Tobias Weyand @0xtob

about 1 year ago

Listen to the @agi_breakdown episode on Minerva here: https://t.co/oabPUYfYvw

0

3

0

106

Tobias Weyand @0xtob

about 1 year ago

We're excited to release Minerva 🕵️‍♀️, a benchmark to evaluate if AI can truly reason about videos, from spotting game-changing moments in sports 🏀 to understanding character motivations in short films 🍿. We provide the "why" behind the answers! Pointers below 👇

1

12

1

2

1K

Tobias Weyand @0xtob

about 1 year ago

The newly released Gemini 2.5 Pro (Preview 05/06) sets the state-of-the art on Minerva with 63.5% accuracy. Human accuracy is 92.5%. https://t.co/qrDY5qqp2P

1

3

0

176

Tobias Weyand @0xtob

over 1 year ago

Excited to share Long-Video Masked Autoencoder (LVMAE) our team just published at @NeurIPSConf! We boost the context length of video models using an adaptive decoder and a dual-masking strategy and achieve SotA on several video benchmarks. Paper: https://t.co/XeBME5RvFX

Google AI

@GoogleAI

over 1 year ago

Training video understanding models on longer contexts is computationally intensive. To address this, we present a novel approach that reduces the computational load while also improving the quality of the learned representations. More at: https://t.co/56Vj3kOzOl

GoogleAI's tweet photo. Training video understanding models on longer contexts is computationally intensive. To address this, we present a novel approach that reduces the computational load while also improving the quality of the learned representations. More at: https://t.co/56Vj3kOzOl https://t.co/f0PtWAVG7f

11

324

86

81

31K

0

4

0

321

Tobias Weyand @0xtob

over 1 year ago

Thank you @JeffDean , very much appreciate the boost! This is really a team effort with my amazing colleagues @NagraniArsha, Mingda Zhang, @raminia, Rachel Hornung, @nitesh_ai, @under_fitting, Austin Meyers, @zhouxy2017, @BoqingGo, @CordeliaSchmid, @sirotenko_m, @ZhuZhu66595.

Jeff Dean

@JeffDean

over 1 year ago

A nice new benchmark for long video understanding by Tobias Weyand @0xtob and others. This is likely to be one of the new frontiers of capabilities for large-scale multimodal models, and it's great to have a new benchmark to assess others in this area.

4

168

17

51

42K

0

27

6

4

17K

Tobias Weyand @0xtob

over 1 year ago

Excited that our work on Long video understanding is being featured by @GoogleAI !

Google AI

@GoogleAI

over 1 year ago

Can #AI truly understand long videos? Tobias Weyand & the Google Research team are testing the limits w/ Neptune, an open-source benchmark for long video understanding. Dive into the details & see how AI tackles temporal reasoning, cause & effect, & more →https://t.co/jNkgEYkdFA

11

165

31

52

51K

0

10

2

0

844

Tobias Weyand @0xtob

over 1 year ago

@ikostrikov

0

2

0

90

Tobias Weyand @0xtob

over 1 year ago

The other day I let my kids talk to Gemini live. Today my 3 year old asked my 6 year old: "Can you tell me a joke?" - 6 year old: "Sorry, I'm just a language model."

0

8

0

405

Tobias Weyand

@0xtob

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users