Charis Mitsakis @cmitsakis - Twitter Profile

Charis Mitsakis

@cmitsakis

about 2 hours ago

@xeophon I wish Zcode was MIT licensed as well, but it's closed source I think

0

1

0

121

Charis Mitsakis

@cmitsakis

about 3 hours ago

@arthurmensch All day I argue with the conspiracy theorists and the fat-shamers that deny the existence of Le Chaton Fat 🐱 (yes denying its existence is the ultimate form of fat-shaming)

0

9

0

5K

Charis Mitsakis

@cmitsakis

about 5 hours ago

@paw_lean @pizzaboy exactly! smart guy🧠

0

2

0

24

Charis Mitsakis

@cmitsakis

about 6 hours ago

@julien_c @huggingface are you low-key fat-shaming Le Chaton Fat? 🐱

0

20

1

0

3K

Charis Mitsakis

@cmitsakis

about 6 hours ago

@AN0NYME4EU @kimmonismus of course. I never believed the denialists conspiracy theorists

0

1

0

12

Charis Mitsakis

@cmitsakis

about 8 hours ago

VibeThinker-3B, post-trained upon Qwen2.5-Coder-3B base, scored 94.3 on AIME26, with a performance similar to DeepSeek V3.2, GLM-5, and Gemini 3 Pro. Small models are the future for agents because they can use tools to get the knowledge they lack and they can run fast and cheap.

Francesco Bertolotti @f14bertolotti

about 15 hours ago

Stellar performance from a 3B model. These results were achieved primarily through post-training refinements on Qwen2.5-Coder. The paper doesn't provide many details, but it appears they distill from RL ckpts and then do a final RL-based instruct RL. 🔗https://t.co/FmdRwGNMOg

f14bertolotti's tweet photo. Stellar performance from a 3B model. These results were achieved primarily through post-training refinements on Qwen2.5-Coder. The paper doesn't provide many details, but it appears they distill from RL ckpts and then do a final RL-based instruct RL.

🔗https://t.co/FmdRwGNMOg https://t.co/QPez8Ddbgp

17

350

51

308

150K

0

4

0

1

195

Charis Mitsakis

@cmitsakis

about 9 hours ago

denying the existence of Le Chaton Fat is the newest form of fat-shaming

1

7

1

0

53

Charis Mitsakis

@cmitsakis

about 9 hours ago

@cargoshortdad64 Recursive Self Fattening reaching galactic dimensions 🐱

0

127

Charis Mitsakis

@cmitsakis

about 9 hours ago

@scaling01 you called Le Chaton Fat a joke? 🐱

1

0

66

Charis Mitsakis

@cmitsakis

about 9 hours ago

@kimmonismus I think small models are the future for agents because they can use tools to get the knowledge and the can run fast and cheap

4

32

0

3

3K

Charis Mitsakis

@cmitsakis

about 10 hours ago

@financialjuice is this for Le Chaton Fat? 🐱

0

31

Charis Mitsakis

@cmitsakis

about 11 hours ago

The haters want to ban Le Chaton Fat, but they don't understand it's futile because Mistral implemented Recursive Self Fattening, and nothing can stop the exponential.

GLIF

@heyglif

1 day ago

The French Goverment needs to STOP Le Chaton Fat before it's too late. This is the FAT takeoff we've been warned against for years!

62

2K

152

285

287K

0

2

0

1

164

Charis Mitsakis

@cmitsakis

about 11 hours ago

@techmeditator it's not just fat, but it's getting fatter through Recursive Self Fattening. The details of the RSF implementation are described on the paper.

1

2

0

7