Samoulox @D_Jango21 - Twitter Profile

6 days ago

Introducing Sakana Fugu: A full multi-agent orchestration system accessible via a single model API. Our ‘Fugu Ultra’ model matches the performance of Fable and Mythos, delivering frontier capability without the risk of export controls. Try it: https://t.co/hhO6qTawgb 🐡

1K

38K

6K

30K

26M

D_Jango21 retweeted

Marcel Samba @_Marcel_Samba_

28 days ago

« Ça, c’est pour les juifs » Paris est tragique.

55

836

299

140

93K

D_Jango21 retweeted

OpenAI

@OpenAI

about 1 month ago

Today, we share a breakthrough on the planar unit distance problem, a famous open question first posed by Paul Erdős in 1946. For nearly 80 years, mathematicians believed the best possible solutions looked roughly like square grids. An OpenAI model has now disproved that belief, discovering an entirely new family of constructions that performs better. This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.

1K

27K

4K

9K

14M

Samoulox @D_Jango21

about 1 month ago

@enriconion @TribunePop23 Du coup tu boycott pas la Chine pour ce qu’ils font aux Ouïghours ?

0

54

Who to follow

D_Jango21 retweeted

Google DeepMind @GoogleDeepMind

about 2 months ago

We’re reimagining a 50-year-old interface - the mouse pointer - with AI. 🖱️ These experimental demos show how people can intuitively direct Gemini on their screens using motion, speech, and natural shorthand to get things done 🧵

461

9K

1K

3K

2M

D_Jango21 retweeted

Anuj

@anujcodes_21

about 2 months ago

Claude FULL COURSE 1 HOUR (Build & Automate Anything)

10

1K

244

1K

81K

D_Jango21 retweeted

PitchSavant

@PitchSavant

about 2 months ago

En vrai, je peux payer 40 € par mois pour des analyses comme ça qui durent 10 minutes.

73

17K

2K

4K

600K

D_Jango21 retweeted

AlAudhli 𝕏 العوذلي @alaudhli

about 2 months ago

Joshua Van vs Tatsuro Taira FULL UFC FIGHT at #UFC328 https://t.co/afVqoHmxUH

9

1K

107

429

89K

D_Jango21 retweeted

Anthropic

@AnthropicAI

about 2 months ago

New Anthropic research: Natural Language Autoencoders. Models like Claude talk in words but think in numbers. The numbers—called activations—encode Claude’s thoughts, but not in a language we can read. Here, we train Claude to translate its activations into human-readable text.

593

17K

2K

9K

2M

D_Jango21 retweeted

BFM

@BFMTV

2 months ago

Soldat français tué au Liban: "Le Hezbollah a visé nos soldats", affirme Emmanuel Macron

101

143

35

16

22K

D_Jango21 retweeted

LEON le média

@leonlemedia

2 months ago

🗨️ "Sale génocidaire", "tu tues les Palestiniens" : Emma raconte sa scolarité dans un lycée public du Val-de-Marne, en tant qu'élève identifiée comme juive. "Ce qui m'a le plus marquée, c'est les croix gammées dessinées sur mes tables". Témoignage recueilli avec l'aide de l'@ULJF_officiel.⤵️

364

3K

1K

518

249K

D_Jango21 retweeted

iDoser

@doser_i85668

3 months ago

How did you come up with the phrase "He rushed to the stage in slow motion"?! 🤣 Love your work 😍

47

12K

902

2K

98K

D_Jango21 retweeted

Avi Chawla

@_avichawla

3 months ago

https://t.co/HTVp6zvP3v

28

2K

409

5K

1M

D_Jango21 retweeted

Giuliano Liguori

@ingliguori

3 months ago

8 specialized AI model types 👇 LLM → text generation LCM → semantic reasoning LAM → action-oriented agents MoE → expert routing VLM → vision + language SLM → lightweight edge models MLM → masked token learning SAM → image segmentation AI is moving from “one big model” to specialized architectures. #AI #LLM #MoE #VLM #MachineLearning

ingliguori's tweet photo. 8 specialized AI model types 👇

LLM → text generation
LCM → semantic reasoning
LAM → action-oriented agents
MoE → expert routing
VLM → vision + language
SLM → lightweight edge models
MLM → masked token learning
SAM → image segmentation

AI is moving from “one big model”
to specialized architectures.

#AI #LLM #MoE #VLM #MachineLearning

35

2K

450

1K

55K

D_Jango21 retweeted

Andrej Karpathy

@karpathy

5 months ago

New art project. Train and inference GPT in 243 lines of pure, dependency-free Python. This is the *full* algorithmic content of what is needed. Everything else is just for efficiency. I cannot simplify this any further. https://t.co/HmiRrQugnP

646

25K

3K

29K

5M

D_Jango21 retweeted

Math Cafe

@Riazi_Cafe_en

4 months ago

EPFL's "Optimization for Machine Learning". PDF (2023): https://t.co/yXTer78n6k Video (2021): https://t.co/bMYJ37amWt

1

441

65

408

17K

D_Jango21 retweeted

Tech with Mak

@techNmak

4 months ago

This free CUDA course is worth more than most CS degrees. 12 hours that separate library users from GPU engineers. I watched senior devs struggle with concepts taught in hour 3. What makes it different: No hand-waving. No "just use this library." You build an MLP trainer FOUR times: → PyTorch (the easy way) → NumPy (getting harder) → C (now we're cooking) → CUDA (chef's kiss) Same model. Same dataset. Four implementations. By the end, you understand WHY PyTorch is fast. The curriculum nobody else teaches: ➡️ GPU architecture (not just "it's parallel") ➡️ Writing kernels that don't suck ➡️ Profiling at kernel AND system level ➡️ When cuBLAS helps (and when it doesn't) ➡️ CUDA vs Triton (the comparison you need) ➡️ PyTorch extensions (actually useful ones) Real talk: ➡️ After this course, you'll read PyTorch source code and understand it. ➡️ You'll optimize models other engineers can't touch. ➡️ You'll be the person teams hire to make things fast. Created by @elliotarledge 💪 12 hours. Free. No excuses. Who's starting this weekend? (I will put the details in the comments.)

techNmak's tweet photo. This free CUDA course is worth more than most CS degrees.

12 hours that separate library users from GPU engineers.

I watched senior devs struggle with concepts taught in hour 3.

What makes it different:

No hand-waving. No "just use this library."

You build an MLP trainer FOUR times: → PyTorch (the easy way) → NumPy (getting harder) → C (now we're cooking) → CUDA (chef's kiss)

Same model. Same dataset. Four implementations.

By the end, you understand WHY PyTorch is fast.

The curriculum nobody else teaches:
➡️ GPU architecture (not just "it's parallel")
➡️ Writing kernels that don't suck
➡️ Profiling at kernel AND system level
➡️ When cuBLAS helps (and when it doesn't)
➡️ CUDA vs Triton (the comparison you need)
➡️ PyTorch extensions (actually useful ones)

Real talk:
➡️ After this course, you'll read PyTorch source code and understand it.
➡️ You'll optimize models other engineers can't touch.
➡️ You'll be the person teams hire to make things fast.

Created by @elliotarledge 💪

12 hours. Free. No excuses.

Who's starting this weekend?
(I will put the details in the comments.)

5

701

113

935

63K

D_Jango21 retweeted

alex zhang

@a1zhang

6 months ago

Much like the switch in 2025 from language models to reasoning models, we think 2026 will be all about the switch to Recursive Language Models (RLMs). It turns out that models can be far more powerful if you allow them to treat *their own prompts* as an object in an external environment, which they understand and manipulate by writing code that invokes LLMs! Our full paper on RLMs is now available—with much more expansive experiments compared to our initial blogpost from October 2025! https://t.co/x47pIfIkTb