Vincent François-Lavet @VinFL - Twitter Profile

Vincent François-Lavet @VinFL

16 days ago

@no_reward_for_u Also related to this paper for additional ref: https://t.co/61aMR9NAu9

0

1

0

127

VinFL retweeted

Google DeepMind @GoogleDeepMind

29 days ago

Build your next story with Gemini Omni.

51

1K

172

284

213K

VinFL retweeted

Rony

@Ronycoder

about 2 months ago

Instead of watching an hour of Netflix, watch this 30-minute speech by the Head of Anthropic’s Coding Agents research team. It will teach you more about vibe coding than 100 paid courses.

53

3K

342

8K

465K

VinFL retweeted

Anthropic

@AnthropicAI

2 months ago

You can read a detailed technical report on the software vulnerabilities and exploits discovered by Claude Mythos Preview here: https://t.co/AgU6ltV2qW

77

2K

192

642

703K

Who to follow

Marc G. Bellemare

@marcgbellemare

Modelling @ Cohere. Ex RL research lead at Google Brain, DeepMind. Textbook author. Co-founder, Reliant AI.

Jakob Foerster

@j_foerst

Associate Prof in ML @UniofOxford. Something Something Research Scientist @MetaAI. Something @FLAIR_Ox. Always #teamhuman. Opinions belong to the world.

Katja Hofmann

@katjahofmann

At Microsoft Research. Lead of https://t.co/c1veO6CHsI - we drive innovation in machine learning with applications in games. https://t.co/Z7M9atHCja Board.

VinFL retweeted

Demis Hassabis

@demishassabis

3 months ago

Gemma 4 outperforms models over 10x their size! (note the x-axis is log scale!)

145

3K

237

297

218K

VinFL retweeted

Jeff Dean

@JeffDean

3 months ago

Today we're releasing Gemma 4, our new family of open foundation models, built on the same research and technology as our Gemini 3 series. These models set a new standard for open intelligence, offering SOTA reasoning capabilities from edge-scale (2B and 4B w/ vision/audio) up to a 26B parameter MoE model and a 31B dense model. By releasing Gemma 4 under the Apache 2.0 license, we hope to enable more innovation across the research and developer communities. Our earlier Gemma 3 models were downloaded 400M times and over 100,000 variants of those models have been published, so we're excited to see what the community will do with the even better Gemma 4 models! Learn more at https://t.co/BW6O3Gr8bc and https://t.co/8M0XSQSP4u Great work by everyone involved! #Gemma4 #AI #OpenSource #ML

55

1K

177

176

100K

VinFL retweeted

Jacob Freeman

@GeForce_JacobF

3 months ago

DLSS 5 is completely mind blowing. The neural rendering model with photoreal lighting and materials is a generation step up in visual fidelity. Gaming with DLSS 5 feels like future tech, but its possible now. It is truly incredible. 🤯

326

1K

91

210

413K

VinFL retweeted

Bryan Catanzaro

@ctnzr

3 months ago

Announcing NVIDIA Nemotron 3 Super! 💚120B-12A Hybrid SSM Latent MoE, designed for Blackwell 💚36 on AAIndex v4 💚up to 2.2X faster than GPT-OSS-120B in FP4 💚Open data, open recipe, open weights Models, Tech report, etc. here: https://t.co/CAYpP1iK3i And yes, Ultra is coming!

ctnzr's tweet photo. Announcing NVIDIA Nemotron 3 Super!

💚120B-12A Hybrid SSM Latent MoE, designed for Blackwell
💚36 on AAIndex v4
💚up to 2.2X faster than GPT-OSS-120B in FP4
💚Open data, open recipe, open weights

Models, Tech report, etc. here:
https://t.co/CAYpP1iK3i

And yes, Ultra is coming! https://t.co/QuguMQaC8S

62

1K

205

452

208K

VinFL retweeted

ARC Prize

@arcprize

4 months ago

Gemini 3 Deep Think (2/26) Semi Private Eval - ARC-AGI-1: 96.0%, $7.17/task - ARC-AGI-2: 84.6% $13.62/task New ARC-AGI SOTA model from @GoogleDeepMind

arcprize's tweet photo. Gemini 3 Deep Think (2/26) Semi Private Eval

- ARC-AGI-1: 96.0%, $7.17/task
- ARC-AGI-2: 84.6% $13.62/task

New ARC-AGI SOTA model from @GoogleDeepMind https://t.co/mN8PFAWk4A

55

2K

165

255

267K

VinFL retweeted

Waymo @Waymo

5 months ago

The age of autonomous mobility at scale is here. Waymo has raised $16B to bring the world’s most trusted driver to more cities. ✅ $126B valuation ✅ 20M+ lifetime rides ✅ 90% reduction in serious injury crashes Read more from our co-CEOs: https://t.co/Fc5I33WpYB

101

745

115

72

249K

VinFL retweeted

Ankesh Anand

@ankesh_anand

6 months ago

"how can flash beat pro??" -> the answer is RL! flash is not just a distilled pro. we've had lots of exciting research progress on agentic RL which made its way into flash but was too late for pro. can't wait to finally bring them to pro👀

113

4K

257

818

954K

VinFL retweeted

Oleksii Kuchaiev

@kuchaev

6 months ago

One under-appreciated (so far) aspect of Hybrid-MoE architecture such as in Nemotron 3, is that it is a better fit for reasoning. Its throughput advantage over “plain” transformer grows with batch size and generation length. Which is what happens in reasoning RL loops.

kuchaev's tweet photo. One under-appreciated (so far) aspect of Hybrid-MoE architecture such as in Nemotron 3, is that it is a better fit for reasoning. Its throughput advantage over “plain” transformer grows with batch size and generation length. Which is what happens in reasoning RL loops. https://t.co/CQU2NWtl5M

2

55

6

11

3K

VinFL retweeted

Bryan Catanzaro

@ctnzr

6 months ago

Today, @NVIDIA is launching the open Nemotron 3 model family, starting with Nano (30B-3A), which pushes the frontier of accuracy and inference efficiency with a novel hybrid SSM Mixture of Experts architecture. Super and Ultra are coming in the next few months.

ctnzr's tweet photo. Today, @NVIDIA is launching the open Nemotron 3 model family, starting with Nano (30B-3A), which pushes the frontier of accuracy and inference efficiency with a novel hybrid SSM Mixture of Experts architecture. Super and Ultra are coming in the next few months. https://t.co/v7MKIy7Oe4

41

1K

221

400

506K

VinFL retweeted

Anirudha Majumdar

@Majumdar_Ani

6 months ago

Generalist robots need a generalist evaluator. But how do you test safety without breaking things? 💥 🌎 Introducing our new work from @GoogleDeepMind: Evaluating Gemini Robotics Policies in a Veo World Simulator https://t.co/ZjvpYXFddZ 🧵👇

27

580

88

298

237K

VinFL retweeted

JFPuget 🇫🇷🇺🇦🇨🇦🇬🇱

@JFPuget

6 months ago

Some people may be disappointed that we won the @arcprize competition with a LLM (QWEN 4B) and not with a more reasoning oriented or code oriented method. I'd answer this: @kaggle compute limits forces us to focus on efficiency. We tried reasoning models but they were too slow. Same for code generation at test time. We had to move the heavy lifting (generative LLM for puzzle understanding + code gen) to pretraining. We used heavy models there: Claude and GPT OSS 120B. We still used some fancy scaffolding at test time, including the ARChitects decoding and TRM. But no reasoning nor code generation. If we relax the Kaggle compute limits then we can, and maybe should, use both reasoning and code generation at test time. But this comes at two orders of magnitude more cost.

3

187

16

52

18K

VinFL retweeted

ARC Prize

@arcprize

6 months ago

Announcing the ARC Prize 2025 Top Score & Paper Award winners The Grand Prize remains unclaimed Our analysis on AGI progress marking 2025 the year of the refinement loop

arcprize's tweet photo. Announcing the ARC Prize 2025 Top Score & Paper Award winners

The Grand Prize remains unclaimed

Our analysis on AGI progress marking 2025 the year of the refinement loop https://t.co/Lbap0VVFs9

25

311

48

122

223K

VinFL retweeted

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

11 months ago

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference "Seed Diffusion Preview achieves an inference speed of 2,146 token/s over H20 GPUs while maintaining competitive performance across a sweep of standard code evaluation benchmarks, significantly faster than contemporary Mercury and Gemini Diffusion, establishing new state of the art on the speed-quality Pareto frontier for code models."

iScienceLuvr's tweet photo. Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

"Seed Diffusion Preview achieves an inference speed of 2,146 token/s over H20 GPUs while maintaining competitive performance across a sweep of standard code evaluation benchmarks, significantly faster than contemporary Mercury and Gemini Diffusion, establishing new state of the art on the speed-quality Pareto frontier for code models."

2

254

46

104

15K

VinFL retweeted

Graham Neubig

@gneubig

11 months ago

Summary of GPT-OSS architectural innovations: 1. sliding window attention (ref: https://t.co/m0iS23mN8y) 2. mixture of experts (ref: https://t.co/1F6KtdfDg0) 3. RoPE w/ Yarn (ref: https://t.co/5P2jUcxo4E) 4. attention sinks (ref: streaming llm https://t.co/vZw7q2dplG)

11

2K

349

2K

119K

VinFL retweeted

OpenAI

@OpenAI

11 months ago

We released two open-weight reasoning models—gpt-oss-120b and gpt-oss-20b—under an Apache 2.0 license. Developed with open-source community feedback, these models deliver meaningful advancements in both reasoning capabilities & safety. https://t.co/PdKHqDqCPf

315

10K

2K

3M

VinFL retweeted

Demis Hassabis

@demishassabis

11 months ago

Official results are in - Gemini achieved gold-medal level in the International Mathematical Olympiad! 🏆 An advanced version was able to solve 5 out of 6 problems. Incredible progress - huge congrats to @lmthang and the team! https://t.co/pp9bXF7rVj

193

6K

732

605

1M

Vincent François-Lavet

@VinFL

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users