Vinay @vinaygode - Twitter Profile

Vinay

@vinaygode

3 days ago

Amazing work in cv

SkalskiP

@skalskip92

4 days ago

supervision just hit 40,000 GitHub stars! it now powers over 6.5k open-source computer vision projects, including all my demos like basketball AI link: https://t.co/xXMRaS4ejS

53

6K

421

8K

870K

0

1

0

53

Vinay

@vinaygode

about 2 months ago

https://t.co/NPDNd0AoqH

How To AI

@HowToAI_

about 2 months ago

Yann LeCun was right the entire time. And generative AI might be a dead end. For the last three years, the entire industry has been obsessed with building bigger LLMs. Trillions of parameters. Billions in compute. The theory was simple: if you make the model big enough, it will eventually understand how the world works. Yann LeCun said that was stupid. He argued that generative AI is fundamentally inefficient. When an AI predicts the next word, or generates the next pixel, it wastes massive amounts of compute on surface-level details. It memorizes patterns instead of learning the actual physics of reality. He proposed a different path: JEPA (Joint-Embedding Predictive Architecture). Instead of forcing the AI to paint the world pixel by pixel, JEPA forces it to predict abstract concepts. It predicts what happens next in a compressed "thought space." But for years, JEPA had a fatal flaw. It suffered from "representation collapse." Because the AI was allowed to simplify reality, it would cheat. It would simplify everything so much that a dog, a car, and a human all looked identical. It learned nothing. To fix it, engineers had to use insanely complex hacks, frozen encoders, and massive compute overheads. Until today. Researchers just dropped a paper called "LeWorldModel" (LeWM). They completely solved the collapse problem. They replaced the complex engineering hacks with a single, elegant mathematical regularizer. It forces the AI's internal "thoughts" into a perfect Gaussian distribution. The AI can no longer cheat. It is forced to understand the physical structure of reality to make its predictions. The results completely rewrite the economics of AI. LeWM didn't need a massive, centralized supercomputer. It has just 15 million parameters. It trains on a single, standard GPU in a few hours. Yet it plans 48x faster than massive foundation world models. It intrinsically understands physics. It instantly detects impossible events. We spent billions trying to force massive server farms to memorize the internet. Now, a tiny model running locally on a single graphics card is actually learning how the real world works.

HowToAI_'s tweet photo. Yann LeCun was right the entire time. And generative AI might be a dead end.

For the last three years, the entire industry has been obsessed with building bigger LLMs. Trillions of parameters. Billions in compute.

The theory was simple: if you make the model big enough, it will eventually understand how the world works.

Yann LeCun said that was stupid.

He argued that generative AI is fundamentally inefficient.

When an AI predicts the next word, or generates the next pixel, it wastes massive amounts of compute on surface-level details.

It memorizes patterns instead of learning the actual physics of reality.

He proposed a different path: JEPA (Joint-Embedding Predictive Architecture).

Instead of forcing the AI to paint the world pixel by pixel, JEPA forces it to predict abstract concepts. It predicts what happens next in a compressed "thought space."

But for years, JEPA had a fatal flaw.

It suffered from "representation collapse."

Because the AI was allowed to simplify reality, it would cheat. It would simplify everything so much that a dog, a car, and a human all looked identical.

It learned nothing.

To fix it, engineers had to use insanely complex hacks, frozen encoders, and massive compute overheads.

Until today.

Researchers just dropped a paper called "LeWorldModel" (LeWM).

They completely solved the collapse problem.

They replaced the complex engineering hacks with a single, elegant mathematical regularizer.

It forces the AI's internal "thoughts" into a perfect Gaussian distribution.

The AI can no longer cheat. It is forced to understand the physical structure of reality to make its predictions.

The results completely rewrite the economics of AI.

LeWM didn't need a massive, centralized supercomputer.

It has just 15 million parameters.

It trains on a single, standard GPU in a few hours.

Yet it plans 48x faster than massive foundation world models. It intrinsically understands physics. It instantly detects impossible events.

We spent billions trying to force massive server farms to memorize the internet.

Now, a tiny model running locally on a single graphics card is actually learning how the real world works.

433

12K

2K

12K

1M

0

76

vinaygode retweeted

Chroma

@trychroma

3 months ago

Introducing Chroma Context-1, a 20B parameter search agent. > pushes the pareto frontier of agentic search > order of magnitude faster > order of magnitude cheaper > Apache 2.0, open-source

141

4K

403

4K

1M

Vinay

@vinaygode

4 months ago

@NaveenSpark @clawzempic The song 🙂

0

1

0

118

Who to follow

ZkSurvivors (∎, ∆) 𝕏

@ZkSurvivors

Hello Everyone ! 2222 GENESIS PASS Will Be Launched For Our Community Members Become Verified Zksurvivor

Vibe AI Labs

@vibe_ai_labs

Vibe AI Labs builds practical AI solutions that help businesses automate, scale, and innovate faster.

4 months ago

@zaraistastys Oh how true!

0

2

0

11

Vinay

@vinaygode

4 months ago

@trashh_dev Crime don’t climb

0

3

Vinay

@vinaygode

4 months ago

@dystopiabreaker Just keep in mind if you spin up too many polecats you’ll want an agent to manage them 🤣

0

244

Vinay

@vinaygode

4 months ago

@rvivek Work hard now or never get the opportunity to work again

0

7

Vinay

@vinaygode

4 months ago

Another Deepseek moment

MiniMax (official) @MiniMax_AI

4 months ago

Introducing M2.5, an open-source frontier model designed for real-world productivity. - SOTA performance at coding (SWE-Bench Verified 80.2%), search (BrowseComp 76.3%), agentic tool-calling (BFCL 76.8%) & office work. - Optimized for efficient execution, 37% faster at complex tasks. - At $1 per hour with 100 tps, infinite scaling of long-horizon agents now economically possible MiniMax Agent: https://t.co/aIzrFYcfUz API: https://t.co/fHRdSV7BwZ CodingPlan: https://t.co/FDhZBBjQrX

MiniMax_AI's tweet photo. Introducing M2.5, an open-source frontier model designed for real-world productivity.

- SOTA performance at coding (SWE-Bench Verified 80.2%), search (BrowseComp 76.3%), agentic tool-calling (BFCL 76.8%) & office work.

- Optimized for efficient execution, 37% faster at complex tasks.

- At $1 per hour with 100 tps, infinite scaling of long-horizon agents now economically possible

MiniMax Agent: https://t.co/aIzrFYcfUz
API: https://t.co/fHRdSV7BwZ
CodingPlan: https://t.co/FDhZBBjQrX

454

9K

1K

8K

5M

0

1

0

148

Vinay

@vinaygode

4 months ago

@bindra_dhruv Open source and swap out the MIT license

0

189

Vinay

@vinaygode

4 months ago

@shauseth Okay but is it small batch and ethically sourced

0

1

0

55

Vinay

@vinaygode

4 months ago

Starting to dive into agent lightning. This could be a great unlock for making agents have better reasoning and improving tool use https://t.co/M70581tF79